Nan Du
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10AXLearn: Modular Large Model Training on Heterogeneous Infrastructure
arXiv 2025
S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
arXiv 2025
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
arXiv 2025
MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity
arXiv 2024
Self-playing Adversarial Language Game Enhances LLM Reasoning
arXiv 2024
Are Large Language Models Good Prompt Optimizers?
arXiv 2024
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining
doremi-optimizing-data-mixtures-speeds-up
On Diversified Preferences of Large Language Model Alignment
arXiv 2023
ReAct: Synergizing Reasoning and Acting in Language Models
arXiv 2022
ST-MoE: Designing Stable and Transferable Sparse Expert Models
arXiv 2022
Affiliations
Frequent co-authors
10from 10 papers