I'm currently working on enhancing the reasoning capabilities of large language models by synthetic data generation and reinforcement learning.
-
Seoul National University
- Seoul
- https://symoon11.github.io/
Highlights
- Pro
Pinned Loading
-
snu-mllab/parsimonious-blackbox-attack
snu-mllab/parsimonious-blackbox-attack PublicOfficial TensorFlow implementation of "Parsimonious Black-Box Adversarial Attacks via Efficient Combinatorial Optimization" (ICML 2019)
-
snu-mllab/EDAC
snu-mllab/EDAC PublicOfficial PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
-
snu-mllab/DCPG
snu-mllab/DCPG PublicOfficial PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
-
snu-mllab/Achievement-Distillation
snu-mllab/Achievement-Distillation PublicOfficial PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 2023)
-
snu-mllab/guided-rest
snu-mllab/guided-rest PublicOfficial implementation of "Learning to Better Search with Language Models via Guided Reinforced Self-Training" (NeurIPS 2025)
Python 6
-
If the problem persists, check the GitHub status page or contact support.



