Zhuofeng Li

Texas A&M University; Stanford University;

prof_pic.jpg

Hi there šŸ‘‹. My name is Zhuofeng Li. I’m a CS Ph.D. student at Texas A&M University, advised by Prof. Yu Zhang. I am also a visiting student at Stanford University working with Prof. Yejin Choi and Prof. James Zou. Previously, I interned TIGER-Lab at University of Waterloo working with Prof. Wenhu Chen.

My research lies in LLM/VLM-post training, including reasoning, alignment, evaluation, and applications. Recently, I am particularly focus on agentic reinforcement learning.

As part of this direction, I lead / co-lead OpenResearcher (Adopted by NVIDIA’s Nemotron family of models & Top 3 Trending Dataset on Hugging Face & 400+⭐), AgentFlow (ICLR 26 Oral & 1.6K+⭐) and VerlTool (850+⭐) to push the boundaries of agentic reasoning.

šŸ¤ I am actively seeking research collaborations and intern opportunities in LLM/VLM post-training, reinforcement learning, agents, and other exciting directions. Feel Feel free to reach out me through zhuofengli12345@gmail.com

Github / Linkedin / Twitter / Google Scholar / Email / WeChat

news

Mar 24, 2026 šŸ”„ New! OpenResearcher paper is now available, highlighting practical insights into deep research pipeline design. Already adopted by NVIDIA’s Nemotron family of models!
Feb 09, 2026 We’ve released OpenResearcher — a fully open, state-of-the-art deep research agent (30B-A3B). Model, data, code — everything is open. Try demo now!
Jan 27, 2026 Two papers accepted at ICLR 2026: AgentFlow (Oral) on in-the-flow agentic system optimization, and ImagenWorld on stress-testing image generation models!
Dec 13, 2025 šŸ† Excited that AgentFlow won Best Paper Runner-up at the Effective Reasoning Workshop!
Dec 12, 2025 šŸ”„ Thrilled to share that StructEval has been acceptecd to TMLR!

selected publications

*Co-first Author

  1. Arxiv
    openresearcher.png
    OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
    Zhuofeng Li * , Dongfu Jiang * , Xueguang Ma * , and 7 more authors
    In arxiv preprint, Mar 2026
  2. ICLR 2026 Oral
    agentflow.png
    In-The-Flow Agentic System Optimization for Effective Planning and Tool Use
    Zhuofeng Li * , Haoxiang Zhang * , Seungju Han , and 6 more authors
    In ICLR 2026 Oral & NeurIPS 2025 Workshop Best Paper Runner-up, Oct 2025
  3. Arxiv
    verltool.png
    VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
    Dongfu Jiang * , Yi Lu * , Zhuofeng Li * , and 9 more authors
    In arxiv preprint, Sep 2025
  4. CIKM 2025
    gref.png
    GReF: A Unified Generative Framework for Efficient Reranking via Ordered Multi-token Prediction
    Zhijie Lin * , Zhuofeng Li * , Chenglei Dai , and 5 more authors
    In Proceedings of the 34th ACM International Conference on Information and Knowledge Management, Nov 2025
  5. TMLR 2025
    goat.png
    Avoiding Structural Pitfalls: Self-Supervised Low-Rank Feature Tuning for Graph Test-Time Adaptation
    Haoxiang Zhang * , Zhuofeng Li * , Qiannan Zhang , and 3 more authors
    In Transactions on Machine Learning Research (TMLR), Oct 2025, Oct 2025
  6. ICLR 2026
    imagenworld.png
    ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks
    Samin Mahdizadeh Sani , Max Ku , Nima Jamali , and 23 more authors
    In ICLR 2026, Jan 2026
  7. Arxiv
    video_eval_pro.png
    VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation
    Wentao Ma , Weiming Ren , Yiming Jia , and 4 more authors
    In arxiv preprint, May 2025
  8. TMLR 2025
    structeval.png
    StructEval: Benchmarking LLMs’ Capabilities to Generate Structural Outputs
    Jialin Yang , Dongfu Jiang , Lipeng He , and 8 more authors
    In Transactions on Machine Learning Research (TMLR), Dec 2025, May 2025
  9. NeurIPS 2024
    tegdb.png
    Teg-db: A comprehensive dataset and benchmark of textual-edge graphs
    Zhuofeng Li , Zixing Gou , Xiangnan Zhang , and 6 more authors
    In Advances in Neural Information Processing Systems, Dec 2024
  10. CIKM 2024
    cfkgc.png
    Learning from novel knowledge: Continual few-shot knowledge graph completion
    Zhuofeng Li , Haoxiang Zhang , Qiannan Zhang , and 2 more authors
    In Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, Oct 2024