Cite
Notes
Only stored in your browser.
Attribution
GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance
arXiv 2025
Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming
arXiv 2023
from 2 papers
Hyun Oh Song
Jinuk Kim
Clemens JS Schaefer
Jae W. Lee
Marwa El Halabi
Wonpyo Park
Yeonhong Park
Yeonwoo Jeong