Longxu Dou
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
arXiv 2025
FlowReasoner: Reinforcing Query-Level Meta-Agents
arXiv 2025
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
arXiv 2025
Diffusion Language Models are Super Data Learners
arXiv 2025
Efficient Process Reward Model Training via Active Learning
arXiv 2025
Sailor: Open Language Models for South-East Asia
arXiv 2024
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
arXiv 2024
RegMix: Data Mixture as Regression for Language Model Pre-training
arXiv 2024
SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types
arXiv 2024
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers