Cite
Notes
Only stored in your browser.
Attribution
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
arXiv 2025
Hawk: Learning to Understand Open-World Video Anomalies
arXiv 2024
Inference Performance Optimization for Large Language Models on CPUs
from 3 papers
Changqing Li
Chen Meng
Cheng Fang
Duyi Wang
Hao Lu
Jiangbo Lu
Jiaqi Tang
Ke Ma
Lixing Shen
Pujiang He