Recent Publications | Zhuoran Yang

Unveiling induction heads: Provable training dynamics and feature learning in transformers

Siyu Chen, Heejune Sheen, Tianhao Wang, and Zhuoran Yang

arXiv preprint arXiv:2409.10559, 2024

HTML Tweet
Training dynamics of multi-head softmax attention for in-context learning: Emergence, convergence, and optimality

Siyu Chen, Heejune Sheen, Tianhao Wang, and Zhuoran Yang

arXiv preprint arXiv:2402.19442, 2024

HTML Tweet
Unveil conditional diffusion models with classifier-free guidance: A sharp statistical theory

Hengyu Fu, Zhuoran Yang, Mengdi Wang, and Minshuo Chen

arXiv preprint arXiv:2403.11968, 2024

HTML
Unveiling the statistical foundations of chain-of-thought prompting methods

Xinyang Hu, Fengzhuo Zhang, Siyu Chen, and Zhuoran Yang

arXiv preprint arXiv:2408.14511, 2024

HTML
On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games

Awni Altabaa, and Zhuoran Yang

Advances in Neural Information Processing Systems, 2024

HTML
Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks

Siyu Chen, Mengdi Wang, and Zhuoran Yang

arXiv preprint arXiv:2307.14085, 2023

HTML
A two-timescale stochastic algorithm framework for bilevel optimization: Complexity analysis and application to actor-critic

Mingyi Hong, Hoi-To Wai, Zhaoran Wang, and Zhuoran Yang

SIAM Journal on Optimization, 2023

HTML
Is pessimism provably efficient for offline rl?

Ying Jin, Zhuoran Yang, and Zhaoran Wang

In International Conference on Machine Learning, 2021

HTML
A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations

Siyu Chen, Yitan Wang, Zhaoran Wang, and Zhuoran Yang

arXiv preprint arXiv:2303.11187, 2023

HTML
Can We Find Nash Equilibria at a Linear Rate in Markov Games?

Zhuoqing Song, Jason D Lee, and Zhuoran Yang

In The Eleventh International Conference on Learning Representations, 2022

HTML
GEC: A posterior sampling framework for interactive decision making

Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, and Tong Zhang

arXiv preprint arXiv:2211.01962, 2022

HTML
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes

Miao Lu, Yifei Min, Zhaoran Wang, and Zhuoran Yang

In The Eleventh International Conference on Learning Representations, 2022

HTML
On function approximation in reinforcement learning: Optimism in the face of large state spaces

Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, and Michael I Jordan

Advances in Neural Information Processing Systems, 2020

HTML
Provably Efficient Reinforcement Learning with Linear Function Approximation

Chi Jin, Zhuoran Yang, Zhaoran Wang, and Michael I Jordan

Mathematics of Operations Research, 2023

HTML
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, and Zhuoran Yang

Mathematics of Operations Research, 2023

HTML
Strategic decision-making in the presence of information asymmetry: Provably efficient rl with algorithmic instruments

Mengxin Yu, Zhuoran Yang, and Jianqing Fan

arXiv preprint arXiv:2208.11040, 2022

HTML
Sequential information design: Markov persuasion process and its efficient reinforcement learning

Jibang Wu, Zixuan Zhang, Zhe Feng, Zhaoran Wang, Zhuoran Yang, Michael I Jordan, and Haifeng Xu

arXiv preprint arXiv:2202.10678, 2022

HTML
Multi-agent reinforcement learning: A selective overview of theories and algorithms

Kaiqing Zhang, Zhuoran Yang, and Tamer Başar

Handbook of reinforcement learning and control, 2021

HTML