Sukjin Hong
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy
arXiv 2024
Token-Scaled Logit Distillation for Ternary Weight Generative Language Models
token-scaled-logit-distillation-for-ternary
Understanding and Improving Knowledge Distillation for Quantization-Aware Training of Large Transformer Encoders
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
8from 3 papers