Zhuofeng Li

Texas A&M University; Stanford University;

prof_pic.jpg

Hi there πŸ‘‹. My name is Zhuofeng Li. I’m a CS Ph.D. student at Texas A&M University, advised by Prof. Yu Zhang. I am also a visiting student at Stanford University working with Prof. Yejin Choi and Prof. James Zou. Previously, I interned TIGER-Lab at University of Waterloo working with Prof. Wenhu Chen.

My research lies in LLM/VLM-post training, including reasoning, alignment, evaluation, and applications. Recently, I am particularly focus on agentic reinforcement learning. As part of this direction, I as a core contributor built AgentFlow (1.1K+⭐) and VerlTool (650+⭐) to push the boundaries of agentic reasoning.

🀝 I am actively seeking research collaborations and intern opportunities in LLM/VLM post-training, reinforcement learning, agents, and other exciting directions.

Feel free to reach out me through zhuofengli12345@gmail.com

news

Oct 19, 2025 πŸ”₯ Thrilled to announce that AgentFlow has been accepted to NeurIPS 2025 Workshop Oral (Top-3) β€” and ranked #2 Paper of the Day on Hugging Face πŸ€—!
Oct 05, 2025 πŸš€ Thrilled to introduce AgentFlow β€” a trainable, tool-integrated agentic framework that directly optimizes agents within the system in an online fashion using Flow-GRPO πŸŒ€πŸ’«, achieving superior tool use πŸ›  and long-horizon reasoning 🧠 across diverse domains!
Oct 02, 2025 πŸš€ Excited to share VideoScore2 β€” bringing RL to generative video evaluation with scoring and rich reasoning traces!
Aug 22, 2025 πŸŽ‰ VerlTool is approaching 550+ stars on GitHub! It’s becoming a key framework for Tool-Agent RL training.
Aug 18, 2025 πŸŽ‰ VerlTool’s tech report (co-first author) is out! Please see on Hugging Face Daily Paper!

selected publications

*Co-first Author

  1. NeurIPS 2025 Oral
    agentflow.png
    In-The-Flow Agentic System Optimization for Effective Planning and Tool Use
    Zhuofeng Li * , Haoxiang Zhang * , Seungju Han , and 6 more authors
    In NeurIPS 2025 Workshop, Oct 2025
  2. Arxiv
    verltool.png
    VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
    Dongfu Jiang * , Yi Lu * , Zhuofeng Li * , and 9 more authors
    In arxiv preprint, Sep 2025
  3. CIKM 2025
    gref.png
    GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction
    Zhijie Lin * , Zhuofeng Li * , Chenglei Dai , and 5 more authors
    In Proceedings of the 34th ACM International Conference on Information and Knowledge Management, Nov 2025
  4. TMLR 2025
    goat.png
    Avoiding Structural Pitfalls: Self-Supervised Low-Rank Feature Tuning for Graph Test-Time Adaptation
    Haoxiang Zhang * , Zhuofeng Li * , Qiannan Zhang , and 3 more authors
    In Transactions on Machine Learning Research (TMLR), Oct 2025, Oct 2025
  5. Arxiv
    video_eval_pro.png
    VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation
    Wentao Ma , Weiming Ren , Yiming Jia , and 4 more authors
    In arxiv preprint, May 2025
  6. Arxiv
    structeval.png
    StructEval: Benchmarking LLMs’ Capabilities to Generate Structural Outputs
    Jialin Yang , Dongfu Jiang , Lipeng He , and 8 more authors
    In arxiv preprint, May 2025
  7. NeurIPS 2024
    tegdb.png
    Teg-db: A comprehensive dataset and benchmark of textual-edge graphs
    Zhuofeng Li , Zixing Gou , Xiangnan Zhang , and 6 more authors
    In Advances in Neural Information Processing Systems, Dec 2024
  8. CIKM 2024
    cfkgc.png
    Learning from novel knowledge: Continual few-shot knowledge graph completion
    Zhuofeng Li , Haoxiang Zhang , Qiannan Zhang , and 2 more authors
    In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, Oct 2024