0

Zhenfei Yin

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

arXiv 2026

2026

TodoEvolve: Learning to Architect Agent Planning Systems

arXiv 2026

2026

SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents

arXiv 2026

2026

Think3D: Thinking with Space for Spatial Reasoning

arXiv 2026

2026

LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

arXiv 2026

2026

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

arXiv 2026

2026

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

arXiv 2026

2026

MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems

arXiv 2025

2025

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

arXiv 2025

2025

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

arXiv 2025

2025

From Word to World: Can Large Language Models be Implicit Text-based World Models?

arXiv 2025

2025

VeriGUI: Verifiable Long-Chain GUI Dataset

arXiv 2025

2025

Interleaving Reasoning for Better Text-to-Image Generation

arXiv 2025

2025

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

arXiv 2025

2025

CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards

arXiv 2025

2025

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

arXiv 2025

2025

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

arXiv 2025

2025

MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems

arXiv 2025

2025

OASIS: Open Agent Social Interaction Simulations with One Million Agents

arXiv 2024

2024

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

arXiv 2024

2024

B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens

arXiv 2024

2024

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

arXiv 2024

2024

Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System

arXiv 2024

2024

ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models

arXiv 2023

2023

Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models

arXiv 2023

2023

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers