Peter Henderson
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Temporally Extended Mixture-of-Experts Models
arXiv 2026
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?
arXiv 2025
Dynamic Risk Assessments for Offensive Cybersecurity Agents
arXiv 2025
FrontierCS: Evolving Challenges for Evolving Intelligence
arXiv 2025
Safety Alignment Should Be Made More Than Just a Few Tokens Deep
arXiv 2024
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors
arXiv 2024
LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain
arXiv 2024
On Evaluating the Durability of Safeguards for Open-Weight LLMs
arXiv 2024
LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
legalbench-a-collaboratively-built-benchmark
Visual Adversarial Examples Jailbreak Aligned Large Language Models
arXiv 2023
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
arXiv 2023
Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset
arXiv 2022
When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset
arXiv 2021
With Little Power Comes Great Responsibility
EMNLP 2020 11
Affiliations
Frequent co-authors
10from 14 papers