Fanxu Meng
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5TransMLA: Multi-Head Latent Attention Is All You Need
arXiv 2025
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
arXiv 2025
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
arXiv 2025
CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning
arXiv 2024
Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers