Cite
Notes
Only stored in your browser.
Attribution
LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs
arXiv 2025
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
arXiv 2024
CursorCore: Assist Programming through Aligning Anything
from 3 papers
Yang Wang
Cheng Li
Hao Jiang
Jason Cong
Jicheng Wen
Li Lyna Zhang
Mao Yang
Qi Liu
Rui Li
Rui Ma