aditi raghunathan

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

One-step Language Modeling via Continuous Denoising

arXiv 2026

2026

Base Models Look Human To AI Detectors

arXiv 2026

2026

Terminal Wrench: A Dataset of 331 Reward-Hackable Environments and 3,632 Exploit Trajectories

arXiv 2026

2026

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

arXiv 2025

2025

Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions

arXiv 2025

2025

Jailbreaking in the Haystack

arXiv 2025

2025

Repetition Improves Language Model Embeddings

arXiv 2024

2024

Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic

arXiv 2024

2024

Dissecting Adversarial Robustness of Multimodal LM Agents

arXiv 2024

2024

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

arXiv 2023

2023

An Explanation of In-context Learning as Implicit Bayesian Inference

an-explanation-of-in-context-learning-as

2021

The Pitfalls of Simplicity Bias in Neural Networks

NeurIPS 2020 12

2020

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Chen Henry Wu

3 shared papers

Ziqian Zhong

3 shared papers

Daniel Fried

professor

2 shared papers

Graham Neubig

professor

2 shared papers

J. Zico Kolter

2 shared papers

Jacob Mitchell Springer

Shashwat Saxena

Suhas Kotha

Alexander Robey

Amanda Bertsch