Siheng Li
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9RePO: Replay-Enhanced Policy Optimization
arXiv 2025
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
arXiv 2025
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability
arXiv 2024
Large Language Models Can Self-Improve in Long-context Reasoning
arXiv 2024
A Survey on the Honesty of Large Language Models
arXiv 2024
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast
arXiv 2024
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
arXiv 2024
LLM2: Let Large Language Models Harness System 2 Reasoning
arXiv 2024
Question Answering as Programming for Solving Time-Sensitive Questions
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers