Sukjin Hong

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

RILQ: Rank-Insensitive LoRA-based Quantization Error Compensation for Boosting 2-bit Large Language Model Accuracy

arXiv 2024

Token-Scaled Logit Distillation for Ternary Weight Generative Language Models

token-scaled-logit-distillation-for-ternary

Understanding and Improving Knowledge Distillation for Quantization-Aware Training of Large Transformer Encoders

arXiv 2022

No known affiliations.

from 3 papers

Du-Seong Chang

Jungwook Choi

Minsoo Kim

Janghwan Lee

Sihwa Lee

Euijai Ahn

Geonho Lee

Wonyong Sung