Jiahao Xu

Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

11papers

Authored papers

Free(): Learning to Forget in Malloc-Only Reasoning Models

arXiv 2026

2026

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

arXiv 2025

2025

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

arXiv 2025

2025

The End of Manual Decoding: Towards Truly End-to-End Language Models

arXiv 2025

2025

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

arXiv 2025

2025

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

arXiv 2025

2025

FanChuan: A Multilingual and Graph-Structured Benchmark For Parody Detection and Analysis

arXiv 2025

2025

(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts

arXiv 2024

2024

Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability

arXiv 2024

2024

Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

arXiv 2024

2024

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 11 papers

Tian Liang

Zhaopeng Tu

Haitao Mi

Zhiwei He

Dong Yu

Rui Wang

Wenxuan Wang

Dongyang Ma

Lihui Chen

Linfeng Song