Dawei Zhu
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19MiMo-V2-Flash Technical Report
arXiv 2026
PaperBanana: Automating Academic Illustration for AI Scientists
arXiv 2026
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
arXiv 2025
MiMo-VL Technical Report
arXiv 2025
A Comprehensive Survey on Long Context Language Modeling
arXiv 2025
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
arXiv 2025
A Survey on Latent Reasoning
arXiv 2025
LongAttn: Selecting Long-context Training Data via Token-level Attention
arXiv 2025
AFRIDOC-MT: Document-level MT Corpus for African Languages
arXiv 2025
InternLM-Law: An Open Source Chinese Legal Large Language Model
arXiv 2024
Fine-Grained and Multi-Dimensional Metrics for Document-Level Machine Translation
arXiv 2024
The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models
arXiv 2024
More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
arXiv 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?
arXiv 2024
LongEmbed: Extending Embedding Models for Long Context Retrieval
arXiv 2024
Large Language Models are not Fair Evaluators
arXiv 2023
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
arXiv 2023
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech
COLING 2022 10
Analysing the Noise Model Error for Realistic Noisy Label Data
arXiv 2021
Affiliations
Frequent co-authors
10from 19 papers