Tim Dettmers

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

SERA: Soft-Verified Efficient Repository Agents

arXiv 2026

2026

OLMoE: Open Mixture-of-Experts Language Models

arXiv 2024

2024

Scaling Retrieval-Based Language Models with a Trillion-Token Datastore

arXiv 2024

2024

Stable and low-precision training for large-scale vision-language models

NeurIPS 2023 11

2023

QLoRA: Efficient Finetuning of Quantized LLMs

NeurIPS 2023 11

2023

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

arXiv 2023

2023

SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient

swarm-parallelism-training-large-models-can

2023

Petals: Collaborative Inference and Fine-tuning of Large Models

arXiv 2022

2022

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

arXiv 2022

2022

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

arXiv 2022

2022

8-bit Optimizers via Block-wise Quantization

8-bit-optimizers-via-block-wise-quantization-1

2021

Sparse Networks from Scratch: Faster Training without Losing Performance

sparse-networks-from-scratch-faster-training-1

2019

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Luke Zettlemoyer

professor

7 shared papers

Alexander Borzunov

3 shared papers

Ali Farhadi

CEO

Mike Lewis

Max Ryabinin

Noah A. Smith

Pang Wei Koh

Sewon Min

Weijia Shi

researcher

2 shared papers

Younes Belkada

2 shared papers