Minghao Liu
- Papers
- 23
Cite
Notes
Only stored in your browser.
Authored papers
23EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
arXiv 2026
SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature
arXiv 2026
Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization
arXiv 2026
YuE: Scaling Open Foundation Models for Long-Form Music Generation
arXiv 2025
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
arXiv 2025
MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies
arXiv 2025
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models
arXiv 2025
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
arXiv 2025
Reverse-Engineered Reasoning for Open-Ended Generation
arXiv 2025
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs
arXiv 2025
COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes
arXiv 2025
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents
arXiv 2025
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
arXiv 2025
A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning
arXiv 2025
Objaverse++: Curated 3D Object Dataset with Quality Annotations
arXiv 2025
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
arXiv 2025
Towards Personalized Deep Research: Benchmarks and Evaluations
arXiv 2025
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures
arXiv 2025
VeriGUI: Verifiable Long-Chain GUI Dataset
arXiv 2025
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
CVPR 2025 1
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
arXiv 2024
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
arXiv 2024
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
arXiv 2024
Affiliations
Frequent co-authors
10from 23 papers