Dan Jurafsky
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory
arXiv 2025
KTO: Model Alignment as Prospect Theoretic Optimization
arXiv 2024
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
arXiv 2024
ReFT: Representation Finetuning for Language Models
arXiv 2024
Dialect prejudice predicts AI decisions about people's character, employability, and criminality
arXiv 2024
Belief in the Machine: Investigating Epistemological Blind Spots of Language Models
arXiv 2024
Can Unconfident LLM Annotations Be Used for Confident Conclusions?
arXiv 2024
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
arXiv 2023
Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models
arXiv 2023
When and why vision-language models behave like bags-of-words, and what to do about it?
arXiv 2022
Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
arXiv 2022
Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale
arXiv 2022
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
arXiv 2022
Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference
EMNLP 2021 11
With Little Power Comes Great Responsibility
EMNLP 2020 11
Automatically Neutralizing Subjective Bias in Text
arXiv 2019
Affiliations
Frequent co-authors
10from 16 papers
Federico Bianchi
James Zou
Mirac Suzgun
grad-student
Aryaman Arora
Christopher D. Manning
Christopher Potts
Daniel E. Ho
Esin Durmus
Mert Yuksekgonul
Myra Cheng