Recent Publications

  1. Unveiling induction heads: Provable training dynamics and feature learning in transformers
    Siyu Chen, Heejune Sheen, Tianhao Wang, and Zhuoran Yang
    arXiv preprint arXiv:2409.10559, 2024
  2. Training dynamics of multi-head softmax attention for in-context learning: Emergence, convergence, and optimality
    Siyu Chen, Heejune Sheen, Tianhao Wang, and Zhuoran Yang
    arXiv preprint arXiv:2402.19442, 2024
  3. Unveil conditional diffusion models with classifier-free guidance: A sharp statistical theory
    Hengyu Fu, Zhuoran Yang, Mengdi Wang, and Minshuo Chen
    arXiv preprint arXiv:2403.11968, 2024
  4. Unveiling the statistical foundations of chain-of-thought prompting methods
    Xinyang Hu, Fengzhuo Zhang, Siyu Chen, and Zhuoran Yang
    arXiv preprint arXiv:2408.14511, 2024
  5. On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
    Awni Altabaa, and Zhuoran Yang
    Advances in Neural Information Processing Systems, 2024
  6. Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks
    Siyu Chen, Mengdi Wang, and Zhuoran Yang
    arXiv preprint arXiv:2307.14085, 2023
  7. A two-timescale stochastic algorithm framework for bilevel optimization: Complexity analysis and application to actor-critic
    Mingyi Hong, Hoi-To Wai, Zhaoran Wang, and Zhuoran Yang
    SIAM Journal on Optimization, 2023
  8. Is pessimism provably efficient for offline rl?
    Ying Jin, Zhuoran Yang, and Zhaoran Wang
    In International Conference on Machine Learning, 2021
  9. A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations
    Siyu Chen, Yitan Wang, Zhaoran Wang, and Zhuoran Yang
    arXiv preprint arXiv:2303.11187, 2023
  10. Can We Find Nash Equilibria at a Linear Rate in Markov Games?
    Zhuoqing Song, Jason D Lee, and Zhuoran Yang
    In The Eleventh International Conference on Learning Representations, 2022
  11. GEC: A posterior sampling framework for interactive decision making
    Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, and Tong Zhang
    arXiv preprint arXiv:2211.01962, 2022
  12. Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
    Miao Lu, Yifei Min, Zhaoran Wang, and Zhuoran Yang
    In The Eleventh International Conference on Learning Representations, 2022
  13. On function approximation in reinforcement learning: Optimism in the face of large state spaces
    Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, and Michael I Jordan
    Advances in Neural Information Processing Systems, 2020
  14. Provably Efficient Reinforcement Learning with Linear Function Approximation
    Chi Jin, Zhuoran Yang, Zhaoran Wang, and Michael I Jordan
    Mathematics of Operations Research, 2023
  15. Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
    Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang
    Mathematics of Operations Research, 2023
  16. Strategic decision-making in the presence of information asymmetry: Provably efficient rl with algorithmic instruments
    Mengxin Yu, Zhuoran Yang, and Jianqing Fan
    arXiv preprint arXiv:2208.11040, 2022
  17. Sequential information design: Markov persuasion process and its efficient reinforcement learning
    Jibang Wu, Zixuan Zhang, Zhe Feng, Zhaoran Wang, Zhuoran Yang, Michael I Jordan, and Haifeng Xu
    arXiv preprint arXiv:2202.10678, 2022
  18. Multi-agent reinforcement learning: A selective overview of theories and algorithms
    Kaiqing Zhang, Zhuoran Yang, and Tamer Başar
    Handbook of reinforcement learning and control, 2021