Oct 05, 2025 | π¨ Thrilled to introduce AgentFlow β a trainable, tool-integrated agentic framework that directly optimizes agents within the system in an online fashion using Flow-GRPO ππ«, achieving superior tool use π and long-horizon reasoning π§ across diverse domains! |
Oct 02, 2025 | π Excited to share VideoScore2 β bringing RL to generative video evaluation with scoring and rich reasoning traces! |
Aug 22, 2025 | π VerlTool is approaching 550+ stars on GitHub! Itβs becoming a key framework for Tool-Agent RL training. |
Aug 18, 2025 | π VerlToolβs tech report (co-first author) is out! Please see on Hugging Face Daily Paper! |
Aug 15, 2025 | π Starting my CS PhD Journey at Teaxs A&M University! |
Aug 13, 2025 | π€ Our paper GReF (co-first author) has been accepted to CIKM 2026. This recommendation reranking LLM project was conducted during my internship at Kuaishou. |
Jun 11, 2025 | ποΈ I start my exciting journey as a research intern at Stanford University, working with Prof. James Zou, Prof. Yejin Choi. |