Peter Henderson

Papers: 14

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

14papers

Authored papers

Temporally Extended Mixture-of-Experts Models

arXiv 2026

2026

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

arXiv 2025

2025

Dynamic Risk Assessments for Offensive Cybersecurity Agents

arXiv 2025

2025

FrontierCS: Evolving Challenges for Evolving Intelligence

arXiv 2025

2025

Safety Alignment Should Be Made More Than Just a Few Tokens Deep

arXiv 2024

2024

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

arXiv 2024

2024

LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain

arXiv 2024

2024

On Evaluating the Durability of Safeguards for Open-Weight LLMs

arXiv 2024

2024

LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

legalbench-a-collaboratively-built-benchmark

2023

Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!

arXiv 2023

2023

Visual Adversarial Examples Jailbreak Aligned Large Language Models

arXiv 2023

2023

Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

arXiv 2022

2022

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset

arXiv 2021

2021

With Little Power Comes Great Responsibility

EMNLP 2020 11

2020

Affiliations

No known affiliations.

Frequent co-authors

from 14 papers

Prateek Mittal

Xiangyu Qi

Daniel E. Ho

Boyi Wei

Lucia Zheng

Neel Guha

Tinghao Xie

Zeyu Shen

Aleksandra Korolova

Ashwinee Panda