0

Minghao Liu

Papers
23

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
23papers

Authored papers

23

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

arXiv 2026

2026

SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature

arXiv 2026

2026

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

arXiv 2026

2026

YuE: Scaling Open Foundation Models for Long-Form Music Generation

arXiv 2025

2025

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix

arXiv 2025

2025

MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies

arXiv 2025

2025

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

arXiv 2025

2025

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

arXiv 2025

2025

Reverse-Engineered Reasoning for Open-Ended Generation

arXiv 2025

2025

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

arXiv 2025

2025

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

arXiv 2025

2025

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

arXiv 2025

2025

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

arXiv 2025

2025

A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

arXiv 2025

2025

Objaverse++: Curated 3D Object Dataset with Quality Annotations

arXiv 2025

2025

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

arXiv 2025

2025

Towards Personalized Deep Research: Benchmarks and Evaluations

arXiv 2025

2025

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

arXiv 2025

2025

VeriGUI: Verifiable Long-Chain GUI Dataset

arXiv 2025

2025

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

CVPR 2025 1

2024

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

arXiv 2024

2024

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

arXiv 2024

2024

A Comparative Study on Reasoning Patterns of OpenAI's o1 Model

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

10

from 23 papers