Cite
Notes
Only stored in your browser.
Attribution
Post-Trained MoE Can Skip Half Experts via Self-Distillation
arXiv 2026
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
arXiv 2025
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
from 3 papers
Guohao Dai
Huazhong Yang
Tianyu Fu
Yu Wang
Bingning Wang
Bowen Zhou
professor
Enshu Liu
Fan Yang
Ganqu Cui
researcher
Junlin Yang