Zhendong Mao
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25Lance: Unified Multimodal Modeling by Multi-Task Synergy
arXiv 2026
Stream-T1: Test-Time Scaling for Streaming Video Generation
arXiv 2026
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
arXiv 2026
DeepResearch Bench II: Diagnosing Deep Research Agents via Rubrics from Expert Report
arXiv 2026
NativeTok: Native Visual Tokenization for Improved Image Generation
arXiv 2026
FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents
arXiv 2026
WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora
arXiv 2026
Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles
arXiv 2026
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces
arXiv 2026
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
arXiv 2025
From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding
arXiv 2025
D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation
arXiv 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
ICCV 2025
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation
arXiv 2025
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory
ICCV 2025
Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability
arXiv 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
arXiv 2024
Benchmarking Large Language Models on Controllable Generation under Diversified Instructions
arXiv 2024
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
CVPR 2024 1
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation
arXiv 2023
ExpertPrompting: Instructing Large Language Models to be Distinguished Experts
arXiv 2023
Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization
towards-accurate-image-coding-improved
Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granularity
arXiv 2022
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
arXiv 2022
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
CVPR 2021 1
Affiliations
Frequent co-authors
10from 25 papers