Xuefeng Li
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling
arXiv 2026
daVinci-Dev: Agent-native Mid-training for Software Engineering
arXiv 2026
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
arXiv 2026
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles
arXiv 2025
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
arXiv 2025
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling
arXiv 2025
LIMI: Less is More for Agency
arXiv 2025
LIMR: Less is More for RL Scaling
arXiv 2025
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?
arXiv 2024
OpenResearcher: Unleashing AI for Accelerated Scientific Research
arXiv 2024
Evaluating Mathematical Reasoning Beyond Accuracy
arXiv 2024
Reformatted Alignment
arXiv 2024
MathPile: A Billion-Token-Scale Pretraining Corpus for Math
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers