Weijie Liu
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation
arXiv 2026
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
arXiv 2026
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
arXiv 2026
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
arXiv 2025
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
arXiv 2025
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
arXiv 2024
MemLong: Memory-Augmented Retrieval for Long Text Modeling
arXiv 2024
CSL: A Large-scale Chinese Scientific Literature Dataset
COLING 2022 10
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
arXiv 2022
Affiliations
Frequent co-authors
10from 9 papers