Zizhen Li
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8LongCLI-Bench: A Preliminary Benchmark and Study for Long-horizon Agentic Programming in Command-Line Interfaces
arXiv 2026
MeepleLM: A Virtual Playtester Simulating Diverse Subjective Experiences
arXiv 2026
World Craft: Agentic Framework to Create Visualizable Worlds via Text
arXiv 2026
Sekai: A Video Dataset towards World Exploration
arXiv 2025
InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles
arXiv 2025
MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models
mdk12-bench-a-multi-discipline-benchmark-for
ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy
arXiv 2025
ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges
ICCV 2025
Affiliations
Frequent co-authors
10from 8 papers