Xinwei Long
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4SSRL: Self-Search Reinforcement Learning
arXiv 2025
A Survey of Reinforcement Learning for Large Reasoning Models
arXiv 2025
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
arXiv 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers