Cite
Notes
Only stored in your browser.
Attribution
WebLLM: A High-Performance In-Browser LLM Inference Engine
arXiv 2024
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
from 3 papers
Charlie F. Ruan
Ruihang Lai
Tianqi Chen
Yilong Zhao
Aixin Liu
Bei Feng
Bin Wang
Bingxuan Wang
Bo Liu
researcher
Bohan Hou