news
| Feb 20, 2026 | Invited talk at Microsoft on Generative Learning via Adversarial Reward Estimation. |
|---|---|
| Aug 25, 2025 | I am excited to return as a part-time research intern at Microsoft in Fall 2025. My spring intern paper Teaching Language Models to Gather Information Proactively is accepted by EMNLP-Findings 2025. I will keep working on post-training LLMs to align with human preferences:) |
| May 15, 2025 | Our paper R2D2: Remembering, Replaying and Dynamic Decision Making with a Reflective Agentic Memory is accepted by ACL 2025! |
| May 04, 2025 | We’re so excited to have our tutorial presented at NAACL 2025. This is a tutorial on Creative Planning in Large Language Models. |