Daize Dong
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
arXiv 2024
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
arXiv 2024
A Graph is Worth $K$ Words: Euclideanizing Graph using Pure Transformer
arXiv 2024
Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
arXiv 2024
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers