Zhipeng Chen
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration
arXiv 2026
Seed1.5-VL Technical Report
arXiv 2025
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
arXiv 2025
An Empirical Study on Eliciting and Improving R1-like Reasoning Models
arXiv 2025
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models
arXiv 2025
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
arXiv 2025
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models
arXiv 2025
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
arXiv 2024
Towards Effective and Efficient Continual Pre-training of Large Language Models
arXiv 2024
YuLan: An Open-source Large Language Model
arXiv 2024
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models
arXiv 2024
A Survey of Large Language Models
arXiv 2023
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
arXiv 2023
A Sentence Cloze Dataset for Chinese Machine Reading Comprehension
COLING 2020 8
Affiliations
Frequent co-authors
10from 14 papers