Zeyu Huang
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6A Controllable Examination for Long-Context Language Models
arXiv 2025
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
arXiv 2024
Layerwise Recurrent Router for Mixture-of-Experts
arXiv 2024
Unlocking Continual Learning Abilities in Language Models
arXiv 2024
Unlocking Emergent Modularity in Large Language Models
arXiv 2023
Mixture of Attention Heads: Selecting Attention Heads Per Token
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers