Muhan Zhang
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17TransMLA: Multi-Head Latent Attention Is All You Need
arXiv 2025
Griffin: Towards a Graph-Centric Relational Database Foundation Model
arXiv 2025
PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models
arXiv 2025
HD-PiSSA: High-Rank Distributed Orthogonal Adaptation
arXiv 2025
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
arXiv 2025
What Affects the Effective Depth of Large Language Models?
arXiv 2025
CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning
arXiv 2024
Number Cookbook: Number Understanding of Language Models and How to Improve It
arXiv 2024
Unifying Generation and Prediction on Graphs with Latent Graph Diffusion
arXiv 2024
LooGLE: Can Long-Context Language Models Understand Long Contexts?
arXiv 2023
Neural Common Neighbor with Completion for Link Prediction
arXiv 2023
From Relational Pooling to Subgraph GNNs: A Universal Framework for More Expressive Graph Neural Networks
arXiv 2023
One for All: Towards Training One Graph Model for All Classification Tasks
arXiv 2023
VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs
arXiv 2023
Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners
arXiv 2023
On the Stability of Expressive Positional Encodings for Graphs
arXiv 2023
Decoupling the Depth and Scope of Graph Neural Networks
decoupling-the-depth-and-scope-of-graph
Affiliations
Frequent co-authors
10from 17 papers