Jiahao Xu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Free(): Learning to Forget in Malloc-Only Reasoning Models
arXiv 2026
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
arXiv 2025
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
arXiv 2025
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
arXiv 2025
The End of Manual Decoding: Towards Truly End-to-End Language Models
arXiv 2025
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
arXiv 2025
FanChuan: A Multilingual and Graph-Structured Benchmark For Parody Detection and Analysis
arXiv 2025
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
arXiv 2024
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability
arXiv 2024
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
arXiv 2024
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding
arXiv 2024
Affiliations
Frequent co-authors
10from 11 papers