Tianqi Chen
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
arXiv 2025
Tilus: A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving
arXiv 2025
Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation
arXiv 2025
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
arXiv 2025
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding
arXiv 2025
WebLLM: A High-Performance In-Browser LLM Inference Engine
arXiv 2024
XGrammar: Flexible and Efficient Structured Generation Engine for Large Language Models
arXiv 2024
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding
arXiv 2024
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
arXiv 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
arXiv 2024
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
arXiv 2023
Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling
arXiv 2023
XGBoost: A Scalable Tree Boosting System
arXiv 2016
Affiliations
Frequent co-authors
10from 13 papers