Pengfei Zhou
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents
arXiv 2026
EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models
arXiv 2025
Neural-Driven Image Editing
arXiv 2025
REPA Works Until It Doesn't: Early-Stopped, Holistic Alignment Supercharges Diffusion Training
arXiv 2025
MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models
mdk12-bench-a-multi-discipline-benchmark-for
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
arXiv 2025
ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges
ICCV 2025
OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation
CVPR 2025 1
CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
arXiv 2024
Affiliations
Frequent co-authors
10from 9 papers