aditi raghunathan
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12One-step Language Modeling via Continuous Denoising
arXiv 2026
Base Models Look Human To AI Detectors
arXiv 2026
Terminal Wrench: A Dataset of 331 Reward-Hackable Environments and 3,632 Exploit Trajectories
arXiv 2026
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction
arXiv 2025
Jailbreaking in the Haystack
arXiv 2025
Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions
arXiv 2025
Repetition Improves Language Model Embeddings
arXiv 2024
Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic
arXiv 2024
Dissecting Adversarial Robustness of Multimodal LM Agents
arXiv 2024
Understanding Catastrophic Forgetting in Language Models via Implicit Inference
arXiv 2023
An Explanation of In-context Learning as Implicit Bayesian Inference
an-explanation-of-in-context-learning-as
The Pitfalls of Simplicity Bias in Neural Networks
NeurIPS 2020 12
Affiliations
Frequent co-authors
10from 12 papers
Chen Henry Wu
Ziqian Zhong
Daniel Fried
professor
Graham Neubig
professor
J. Zico Kolter
Jacob Mitchell Springer
Shashwat Saxena
Suhas Kotha
Alexander Robey
Amanda Bertsch