Nan Du

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

arXiv 2025

2025

AXLearn: Modular Large Model Training on Heterogeneous Infrastructure

arXiv 2025

2025

Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding

arXiv 2025

2025

MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity

arXiv 2024

2024

Self-playing Adversarial Language Game Enhances LLM Reasoning

arXiv 2024

2024

Are Large Language Models Good Prompt Optimizers?

arXiv 2024

2024

On Diversified Preferences of Large Language Model Alignment

arXiv 2023

2023

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining

doremi-optimizing-data-mixtures-speeds-up

2023

ReAct: Synergizing Reasoning and Acting in Language Models

arXiv 2022

2022

ST-MoE: Designing Stable and Transferable Sparse Expert Models

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Jian Li

Pengyu Cheng

Ruotian Ma

Tianhao Hu

Xiaolong Li

Xin Zhou

Yong Dai

Adams Wei Yu

Bang Zhang

Barret Zoph

founder

1 shared paper