Linfeng Song
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation
arXiv 2025
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
arXiv 2025
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
arXiv 2025
Don't Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls
arXiv 2025
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
arXiv 2024
Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal
arXiv 2024
The Trickle-down Impact of Reward (In-)consistency on RLHF
arXiv 2023
Enhanced Aspect-Based Sentiment Analysis Models with Progressive Self-supervised Attention Learning
arXiv 2021
Affiliations
Frequent co-authors
10from 8 papers