|
||||||||
|
||||||||
Openings I am actively seeking self-motivated students with strong mathematical or programming skills for the following positions:
|
||||||||
Research Interest My recent research interests generally focus on sequential decision-making problems and their applications in Embodied AI. Specifically, my research interest include both applications and theories of reinforcement learning, game theory, nonconvex optimization, large language models (LLMs), robotics, generative models (e.g. diffusion models), and econometrics. |
||||||||
Selected Recent Publication [Full List] [LLM Reasoning] Segment Policy
Optimization: Effective Segment-Level Credit Assignment in RL
for Large Language Models
(Preprint) [PDF] [Code] [Media Coverage | Lead Story] [Multi-Objective RL] Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning (Preprint) [PDF] [Robust LLM] ROPO: Robust Preference Optimization for Large Language Models International Conference on Machine Learning (ICML), 2025 [PDF] [RL] On the Value of Myopic Behavior in Policy Reuse IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025 [PDF] [LLM] Online Preference Alignment for Language Models via Count-based Exploration International Conference on Learning Representations (ICLR Spotlight), 2025 [PDF] [Code] [Media Coverage] [Robust RL] Tackling Data Corruption in Offline Reinforcement Learning via Sequence Modeling International Conference on Learning Representations (ICLR), 2025 [PDF] [RL & Diffusion] Forward KL Regularized Preference Optimization for Aligning Diffusion Policies AAAI Conference on Artificial Intelligence (AAAI), 2025 [PDF] [RL & Econ] Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach Journal of Machine Learning Research (JMLR), 2024 [PDF] [Multi-Objective LLM] Rewards-in-Context: Multi-Objective Alignment of Foundation Models with Dynamic Preference Adjustment International Conference on Machine Learning (ICML), 2024 [PDF] [Code] [Risk-Sensitive RL] Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning International Conference on Machine Learning (ICML Spotlight), 2024 [PDF]
[Multi-Objective LLM]
Arithmetic Control of LLMs for Diverse User
Preferences: Directional Preference Alignment with
Multi-Objective Rewards
Annual Meeting of the Association for Computational Linguistics (ACL main), 2024 [PDF] [Code] [RL & Game] Posterior Sampling for Competitive RL: Function Approximation and Partial Observation Advances in Neural Information Processing Systems (NeurIPS), 2023 [PDF] [RL] Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics International Conference on Learning Representations (ICLR), 2023 [PDF] [Optimization] Gradient-Variation Bound for Online Convex Optimization with Constraints AAAI Conference on Artificial Intelligence (AAAI), 2023 [PDF] [RL & Game] Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning International Conference on Machine Learning (ICML), 2022 [PDF] [Code] [Optimization] In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle International Conference on Management of Data (SIGMOD), 2022 [PDF] [Extended Version] [RL & Game] On
Reward-Free RL with Kernel and Neural Function
Approximations: Single-Agent MDP and Markov Game
International Conference on Machine Learning (ICML), 2021 [PDF] [RL & Game] Provably
Efficient Fictitious Play Policy Optimization for
Zero-Sum Markov Games with Structured Transitions
International Conference on Machine Learning (ICML), 2021 [PDF] [Image Rendering] Stylized Neural Painting IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021 [PDF] [Code] [Project]
[Safe RL] Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss Advances in Neural Information Processing Systems (NeurIPS), 2020 [PDF]
[Compressed Sensing]
Robust One-Bit Recovery via ReLU Generative Networks:
Near-Optimal Statistical Rate and Global Landscape
Analysis
International Conference on Machine Learning (ICML), 2020 [PDF] |
||||||||
Grant
|
||||||||
Academic Service Conference
|