Chenyang Song
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity
arXiv 2025
Cost-Optimal Grouped-Query Attention for Long-Context Modeling
arXiv 2025
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models
arXiv 2024
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
arXiv 2024
ConPET: Continual Parameter-Efficient Tuning for Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers