Haotong Qin
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19WorldCache: Accelerating World Models for Free via Heterogeneous Token Caching
arXiv 2026
Fast-SAM3D: 3Dfy Anything in Images but Faster
arXiv 2026
Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation
arXiv 2026
Low-bit Model Quantization for Deep Neural Networks: A Survey
arXiv 2025
An Empirical Study of Qwen3 Quantization
arXiv 2025
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models
arXiv 2025
Target-Bench: Can World Models Achieve Mapless Path Planning with Semantic Targets?
arXiv 2025
Quantized Visual Geometry Grounded Transformer
arXiv 2025
First-Order Error Matters: Accurate Compensation for Quantized Large Language Models
arXiv 2025
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
arXiv 2025
Accurate LoRA-Finetuning Quantization of LLMs via Information Retention
arXiv 2024
Binarized Diffusion Model for Image Super-Resolution
arXiv 2024
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models
arXiv 2024
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
arXiv 2024
An empirical study of LLaMA3 quantization: from LLMs to MLLMs
arXiv 2024
BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models
arXiv 2024
BiBench: Benchmarking and Analyzing Network Binarization
arXiv 2023
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges
arXiv 2023
BiBERT: Accurate Fully Binarized BERT
bibert-accurate-fully-binarized-bert
Affiliations
Frequent co-authors
10from 19 papers