Zhuofeng Li

Texas A&M University; Stanford University;

prof_pic.jpg

Hi there ๐Ÿ‘‹. My name is Zhuofeng Li. Iโ€™m a CS Ph.D. student at Texas A&M University, advised by Prof. Yu Zhang. I am also a visiting student at Stanford University working with Prof. Yejin Choi and Prof. James Zou. Previously, I interned TIGER-Lab at University of Waterloo working with Prof. Wenhu Chen.

My research lies in LLM/VLM-post training, including reasoning, alignment, evaluation, and applications. Recently, I am particularly focus on agentic reinforcement learning. As part of this direction, I as the core contributor build VerlTool (500+ stars), a unified and extensible framework for tool-agent RL training.

I am actively seeking research collaborations and intern opportunities in LLM/VLM post-training, reinforcement learning, agents, and other exciting directions.

Feel free to reach out me through zhuofengli12345@gmail.com

news

Oct 05, 2025 ๐Ÿšจ Thrilled to introduce AgentFlow โ€” a trainable, tool-integrated agentic framework that directly optimizes agents within the system in an online fashion using Flow-GRPO ๐ŸŒ€๐Ÿ’ซ, achieving superior tool use ๐Ÿ›  and long-horizon reasoning ๐Ÿง  across diverse domains!
Oct 02, 2025 ๐Ÿš€ Excited to share VideoScore2 โ€” bringing RL to generative video evaluation with scoring and rich reasoning traces!
Aug 22, 2025 ๐ŸŽ‰ VerlTool is approaching 550+ stars on GitHub! Itโ€™s becoming a key framework for Tool-Agent RL training.
Aug 18, 2025 ๐ŸŽ‰ VerlToolโ€™s tech report (co-first author) is out! Please see on Hugging Face Daily Paper!
Aug 15, 2025 ๐Ÿš€ Starting my CS PhD Journey at Teaxs A&M University!

selected publications

*Co-first Author

  1. Arxiv
    agentflow.png
    In-The-Flow Agentic System Optimization for Effective Planning and Tool Use
    Zhuofeng Li * , Haoxiang Zhang * , Seungju Han , and 6 more authors
    In arxiv preprint, Oct 2025
  2. Arxiv
    verltool.png
    VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
    Dongfu Jiang * , Yi Lu * , Zhuofeng Li * , and 9 more authors
    In arxiv preprint, Sep 2025
  3. CIKM 2025
    gref.png
    GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction
    Zhijie Lin * , Zhuofeng Li * , Chenglei Dai , and 5 more authors
    In Proceedings of the 34th ACM International Conference on Information and Knowledge Management, Nov 2025
  4. TMLR 2025
    goat.png
    Avoiding Structural Pitfalls: Self-Supervised Low-Rank Feature Tuning for Graph Test-Time Adaptation
    Haoxiang Zhang * , Zhuofeng Li * , Qiannan Zhang , and 3 more authors
    In Transactions on Machine Learning Research (TMLR), Oct 2025, Oct 2025
  5. Arxiv
    video_eval_pro.png
    VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation
    Wentao Ma , Weiming Ren , Yiming Jia , and 4 more authors
    In arxiv preprint, May 2025
  6. Arxiv
    structeval.png
    StructEval: Benchmarking LLMsโ€™ Capabilities to Generate Structural Outputs
    Jialin Yang , Dongfu Jiang , Lipeng He , and 8 more authors
    In arxiv preprint, May 2025
  7. NeurIPS 2024
    tegdb.png
    Teg-db: A comprehensive dataset and benchmark of textual-edge graphs
    Zhuofeng Li , Zixing Gou , Xiangnan Zhang , and 6 more authors
    In Advances in Neural Information Processing Systems, Dec 2024
  8. CIKM 2024
    cfkgc.png
    Learning from novel knowledge: Continual few-shot knowledge graph completion
    Zhuofeng Li , Haoxiang Zhang , Qiannan Zhang , and 2 more authors
    In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, Oct 2024