Weigao Sun

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

arXiv 2025

2025

Liger: Linearizing Large Language Models to Gated Recurrent Structures

arXiv 2025

2025

CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models

arXiv 2025

2025

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

arXiv 2025

2025

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

arXiv 2025

2025

Native Hybrid Attention for Efficient Sequence Modeling

arXiv 2025

2025

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

arXiv 2024

2024

LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

arXiv 2024

2024

Linear Attention Sequence Parallelism

arXiv 2024

2024

CO2: Efficient Distributed Training with Full Communication-Computation Overlap

arXiv 2024

2024

Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention

arXiv 2024

2024

Scaling Laws for Linear Complexity Language Models

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Yu Cheng

Dong Li

Xuyang Shen

Yiran Zhong

Zhen Qin

Xiaoye Qu

Disen Lan

Jiaxi Hu

Jusen Du

Tong Zhu