Tim Dettmers
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12SERA: Soft-Verified Efficient Repository Agents
arXiv 2026
OLMoE: Open Mixture-of-Experts Language Models
arXiv 2024
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
arXiv 2024
QLoRA: Efficient Finetuning of Quantized LLMs
NeurIPS 2023 11
Stable and low-precision training for large-scale vision-language models
NeurIPS 2023 11
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
arXiv 2023
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
swarm-parallelism-training-large-models-can
Petals: Collaborative Inference and Fine-tuning of Large Models
arXiv 2022
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
arXiv 2022
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
arXiv 2022
8-bit Optimizers via Block-wise Quantization
8-bit-optimizers-via-block-wise-quantization-1
Sparse Networks from Scratch: Faster Training without Losing Performance
sparse-networks-from-scratch-faster-training-1
Affiliations
Frequent co-authors
10from 12 papers