Zhenfei Yin
- Papers
- 26
Cite
Notes
Only stored in your browser.
Authored papers
26Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
arXiv 2026
TodoEvolve: Learning to Architect Agent Planning Systems
arXiv 2026
SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents
arXiv 2026
Think3D: Thinking with Space for Spatial Reasoning
arXiv 2026
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
arXiv 2026
StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
arXiv 2026
TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents
arXiv 2026
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems
arXiv 2025
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning
arXiv 2025
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
arXiv 2025
From Word to World: Can Large Language Models be Implicit Text-based World Models?
arXiv 2025
VeriGUI: Verifiable Long-Chain GUI Dataset
arXiv 2025
Interleaving Reasoning for Better Text-to-Image Generation
arXiv 2025
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
arXiv 2025
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
arXiv 2025
BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset
arXiv 2025
SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines
arXiv 2025
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
arXiv 2025
OASIS: Open Agent Social Interaction Simulations with One Million Agents
arXiv 2024
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
arXiv 2024
B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
arXiv 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
arXiv 2024
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System
arXiv 2024
ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models
arXiv 2023
Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models
arXiv 2023
Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy
arXiv 2022
Affiliations
Frequent co-authors
10from 26 papers