Dong Zhang
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19MiMo-V2-Flash Technical Report
arXiv 2026
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
arXiv 2025
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
arXiv 2025
MiMo-VL Technical Report
arXiv 2025
VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions
arXiv 2025
Sparser Block-Sparse Attention via Token Permutation
arXiv 2025
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching
arXiv 2025
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems
arXiv 2025
Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents
arXiv 2025
SpeechAlign: Aligning Speech Generation to Human Preferences
arXiv 2024
GroundingGPT:Language Enhanced Multi-modal Grounding Model
arXiv 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
arXiv 2024
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
arXiv 2024
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
arXiv 2024
InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance
arXiv 2024
Aligning Medical Images with General Knowledge from Large Language Models
arXiv 2024
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
arXiv 2024
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models
arXiv 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection
arXiv 2023
Affiliations
Frequent co-authors
10from 19 papers