Long Phan

Center for AI Safety (CAIS) researcher; co-organizer of MMLU-Pro and Humanity's Last Exam benchmarks.

Role: researcher
Currently at: Center for AI Safety (CAIS)
Twitter: twitter.com/justinphan3110
GitHub: github.com/justinphan3110
Scholar: scholar.google.com/citations
Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

12papers

Authored papers

Humanity's Last Exam

preprint

2025

TextQuests: How Good are LLMs at Text-Based Video Games?

arXiv 2025

2025

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

arXiv 2024

2024

Improving Alignment and Robustness with Circuit Breakers

arXiv 2024

2024

Tamper-Resistant Safeguards for Open-Weight LLMs

arXiv 2024

2024

Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation

arXiv 2024

2024

Representation Engineering: A Top-Down Approach to AI Transparency

arXiv 2023

2023

ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation

NAACL (ACL) 2022 7

2022

Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation

arXiv 2022

2022

MTet: Multi-domain Translation for English and Vietnamese

arXiv 2022

2022

SPBERT: An Efficient Pre-training BERT on SPARQL Queries for Question Answering over Knowledge Graphs

arXiv 2021

2021

CoTexT: Multi-task Learning with Code-Text Transformer

ACL (NLP4Prog) 2021 8

2021

Affiliations

Currently at

Center for AI Safety (CAIS)

researcher · non profit

Frequent co-authors

from 12 papers

Dan Hendrycks

director

6 shared papers

Hieu Tran

6 shared papers

Andy Zou

founder

5 shared papers

Mantas Mazeika

researcher

4 shared papers

Hieu Nguyen

3 shared papers

Nathaniel Li

grad-student

3 shared papers

Trieu H. Trinh

3 shared papers

Alice Gatti

researcher

2 shared papers

Bo Li

2 shared papers

Dawn Song

professor

2 shared papers