Zhen Zheng

Cite

Notes

Only stored in your browser.

Attribution

1papers

Authored papers

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

arXiv 2023

No known affiliations.

from 1 papers

Donglin Zhuang

Haojun Xia

Shuaiwen Leon Song

Wei Lin

Xiafei Qiu

Yong Li

Yuchao Li

Zhongzhu Zhou