Zaoxing Liu

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

NetPress: Dynamically Generated LLM Benchmarks for Network Applications

arXiv 2025

GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM

arXiv 2024

No known affiliations.

from 2 papers

Eric S. Wang

Francis Y. Yan

Geonhwa Jeong

Hao Kang

Jiajun Ruan

Kevin Hsieh

Qingru Zhang

Sadjad Fouladi

Souvik Kundu

Tuo Zhao