Chi-Chih Chang
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7xKV: Cross-Layer KV-Cache Compression via Aligned Singular Vector Extraction
arXiv 2025
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models
arXiv 2025
SplitReason: Learning To Offload Reasoning
arXiv 2025
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
arXiv 2025
TokenButler: Token Importance is Predictable
arXiv 2025
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs
arXiv 2025
Palu: Compressing KV-Cache with Low-Rank Projection
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers