Fanxu Meng

Cite

Notes

Only stored in your browser.

Attribution

5papers

Authored papers

TransMLA: Multi-Head Latent Attention Is All You Need

arXiv 2025

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference

arXiv 2025

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

arXiv 2025

CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning

arXiv 2024

Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners

arXiv 2023

No known affiliations.

from 5 papers

Muhan Zhang

Xiaojuan Tang

Di Yin

Pingzhi Tang

Xing Sun

Bo Ke

Daohai Yu

Fan Jiang

Haodong Lin

Haotong Yang