Chien-Yu Lin
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5xKV: Cross-Layer KV-Cache Compression via Aligned Singular Vector Extraction
arXiv 2025
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
arXiv 2025
NanoFlow: Towards Optimal Large Language Model Serving Throughput
arXiv 2024
Palu: Compressing KV-Cache with Low-Rank Projection
arXiv 2024
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers