Cite
Notes
Only stored in your browser.
Attribution
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
arXiv 2026
Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging
arXiv 2025
from 2 papers
Sirui Han
Yike Guo
Dapeng Wu
Hao Wang
Hongming Piao
Jiacheng Wang
Kaixiong Gong
Lujun Li
Wei Li
Xiangyu Yue