Guanchu Wang

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

arXiv 2025

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches

arXiv 2024

Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model

winner-take-all-column-row-sampling-for

DIVISION: Memory Efficient Training via Dual Activation Precision

arXiv 2022

No known affiliations.

from 4 papers

Xia Hu

Shaochen Zhong

Zirui Liu

Hongyi Liu

Jiayi Yuan

Vipin Chaudhary

Yu-Neng Chuang

Zhaozhuo Xu

Zhimeng Jiang

Andrew Wen