Weigao Sun
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
arXiv 2025
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
arXiv 2025
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
arXiv 2025
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models
arXiv 2025
Liger: Linearizing Large Language Models to Gated Recurrent Structures
arXiv 2025
Native Hybrid Attention for Efficient Sequence Modeling
arXiv 2025
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
arXiv 2024
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
arXiv 2024
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention
arXiv 2024
Scaling Laws for Linear Complexity Language Models
arXiv 2024
CO2: Efficient Distributed Training with Full Communication-Computation Overlap
arXiv 2024
Linear Attention Sequence Parallelism
arXiv 2024
Affiliations
Frequent co-authors
10from 12 papers