Zhenpeng Su
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
arXiv 2026
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning
arXiv 2025
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
arXiv 2025
LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference
arXiv 2025
Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
arXiv 2024
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts
arXiv 2024
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts
arXiv 2024
HC3 Plus: A Semantic-Invariant Human ChatGPT Comparison Corpus
arXiv 2023
T5-SR: A Unified Seq-to-Seq Decoding Strategy for Semantic Parsing
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers