HyungJun Kim
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
arXiv 2025
SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
arXiv 2024
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference
arXiv 2024
OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers