Lili Yu
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity
arXiv 2025
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
arXiv 2024
Byte Latent Transformer: Patches Scale Better Than Tokens
arXiv 2024
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers