Recent Publications
- Unveil conditional diffusion models with classifier-free guidance: A sharp statistical theoryarXiv preprint arXiv:2403.11968, 2024
- Unveiling the statistical foundations of chain-of-thought prompting methodsarXiv preprint arXiv:2408.14511, 2024
- On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and GamesAdvances in Neural Information Processing Systems, 2024
- Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic FeedbacksarXiv preprint arXiv:2307.14085, 2023
- A two-timescale stochastic algorithm framework for bilevel optimization: Complexity analysis and application to actor-criticSIAM Journal on Optimization, 2023
- Is pessimism provably efficient for offline rl?In International Conference on Machine Learning, 2021
- A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing ObservationsarXiv preprint arXiv:2303.11187, 2023
- Can We Find Nash Equilibria at a Linear Rate in Markov Games?In The Eleventh International Conference on Learning Representations, 2022
- GEC: A posterior sampling framework for interactive decision makingarXiv preprint arXiv:2211.01962, 2022
- Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision ProcessesIn The Eleventh International Conference on Learning Representations, 2022
- On function approximation in reinforcement learning: Optimism in the face of large state spacesAdvances in Neural Information Processing Systems, 2020
- Provably Efficient Reinforcement Learning with Linear Function ApproximationMathematics of Operations Research, 2023
- Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated EquilibriumMathematics of Operations Research, 2023
- Strategic decision-making in the presence of information asymmetry: Provably efficient rl with algorithmic instrumentsarXiv preprint arXiv:2208.11040, 2022
- Sequential information design: Markov persuasion process and its efficient reinforcement learningarXiv preprint arXiv:2202.10678, 2022
- Multi-agent reinforcement learning: A selective overview of theories and algorithmsHandbook of reinforcement learning and control, 2021