Sachin Kumar
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10TESS 2: A Large-Scale Generalist Diffusion Language Model
arXiv 2025
FLEXITOKENS: Flexible Tokenization for Evolving Language Models
arXiv 2025
BLAB: Brutally Long Audio Bench
arXiv 2025
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
arXiv 2024
RewardBench: Evaluating Reward Models for Language Modeling
arXiv 2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
arXiv 2024
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
arXiv 2024
Overriding Safety protections of Open-source Models
arXiv 2024
Assessing Language Model Deployment with Risk Cards
arXiv 2023
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control
arXiv 2022
Affiliations
Frequent co-authors
10from 10 papers
Noah A. Smith
Hannaneh Hajishirzi
professor
Yulia Tsvetkov
Faeze Brahman
researcher
Jacob Morrison
research-engineer
Khyathi Chandu
Nathan Lambert
researcher
Nouha Dziri
researcher
Orevaoghene Ahia
Valentin Hofmann