Cite
Notes
Only stored in your browser.
Attribution
A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency
arXiv 2025
Quantune: Post-training Quantization of Convolutional Neural Networks using Extreme Gradient Boosting for Fast Deployment
arXiv 2022
from 2 papers
Byung-soo Kim
Chaelyn Lee
Misun Yu
Seokhun Jeon
Sihyeong Park
Sungryeol Jeon
TaeHo Kim
Yongin Kwon