Dan Jurafsky

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory

arXiv 2025

2025

KTO: Model Alignment as Prospect Theoretic Optimization

arXiv 2024

2024

CausalGym: Benchmarking causal interpretability methods on linguistic tasks

arXiv 2024

2024

ReFT: Representation Finetuning for Language Models

arXiv 2024

2024

Dialect prejudice predicts AI decisions about people's character, employability, and criminality

arXiv 2024

2024

Belief in the Machine: Investigating Epistemological Blind Spots of Language Models

arXiv 2024

2024

Can Unconfident LLM Annotations Be Used for Confident Conclusions?

arXiv 2024

2024

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

arXiv 2023

2023

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

arXiv 2023

2023

When and why vision-language models behave like bags-of-words, and what to do about it?

arXiv 2022

2022

Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

arXiv 2022

2022

Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding

arXiv 2022

2022

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

arXiv 2022

2022

Focus on what matters: Applying Discourse Coherence Theory to Cross Document Coreference

EMNLP 2021 11

2021

With Little Power Comes Great Responsibility

EMNLP 2020 11

2020

Automatically Neutralizing Subjective Bias in Text

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Federico Bianchi

5 shared papers

James Zou

5 shared papers

Mirac Suzgun

grad-student

4 shared papers

Aryaman Arora

2 shared papers

Christopher D. Manning

Christopher Potts

Daniel E. Ho

Esin Durmus

Mert Yuksekgonul

Myra Cheng