Zhongzhu Zhou
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization
arXiv 2026
Introspective Diffusion Language Models
arXiv 2026
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
arXiv 2025
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning
arXiv 2024
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers