Bin Guo

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

arXiv 2025

Inference Performance Optimization for Large Language Models on CPUs

arXiv 2024

Hawk: Learning to Understand Open-World Video Anomalies

arXiv 2024

No known affiliations.

from 3 papers

Changqing Li

Chen Meng

Cheng Fang

Duyi Wang

Hao Lu

Jiangbo Lu

Jiaqi Tang

Ke Ma

Lixing Shen

Pujiang He