0

Zhendong Mao

Papers
25

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
25papers

Authored papers

25

Lance: Unified Multimodal Modeling by Multi-Task Synergy

arXiv 2026

2026

Stream-T1: Test-Time Scaling for Streaming Video Generation

arXiv 2026

2026

Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation

arXiv 2026

2026

DeepResearch Bench II: Diagnosing Deep Research Agents via Rubrics from Expert Report

arXiv 2026

2026

NativeTok: Native Visual Tokenization for Improved Image Generation

arXiv 2026

2026

FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents

arXiv 2026

2026

WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora

arXiv 2026

2026

Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles

arXiv 2026

2026

A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces

arXiv 2026

2026

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

arXiv 2025

2025

From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding

arXiv 2025

2025

D$^2$iT: Dynamic Diffusion Transformer for Accurate Image Generation

arXiv 2025

2025

RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models

ICCV 2025

2025

MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation

arXiv 2025

2025

LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

ICCV 2025

2025

Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability

arXiv 2025

2025

RealCustom++: Representing Images as Real-Word for Real-Time Customization

arXiv 2024

2024

Benchmarking Large Language Models on Controllable Generation under Diversified Instructions

arXiv 2024

2024

RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization

CVPR 2024 1

2024

Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation

arXiv 2023

2023

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

arXiv 2023

2023

Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization

towards-accurate-image-coding-improved

2023

Improving Chinese Spelling Check by Character Pronunciation Prediction: The Effects of Adaptivity and Granularity

arXiv 2022

2022

ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting

arXiv 2022

2022

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

CVPR 2021 1

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 25 papers