Long Phan
Center for AI Safety (CAIS) researcher; co-organizer of MMLU-Pro and Humanity's Last Exam benchmarks.
- Role
- researcher
- Currently at
- Center for AI Safety (CAIS)
- twitter.com/justinphan3110
- Scholar
- scholar.google.com/citations
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Humanity's Last Exam
preprint
TextQuests: How Good are LLMs at Text-Based Video Games?
arXiv 2025
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
arXiv 2024
Tamper-Resistant Safeguards for Open-Weight LLMs
arXiv 2024
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
arXiv 2024
Improving Alignment and Robustness with Circuit Breakers
arXiv 2024
Representation Engineering: A Top-Down Approach to AI Transparency
arXiv 2023
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation
NAACL (ACL) 2022 7
MTet: Multi-domain Translation for English and Vietnamese
arXiv 2022
Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation
arXiv 2022
SPBERT: An Efficient Pre-training BERT on SPARQL Queries for Question Answering over Knowledge Graphs
arXiv 2021
CoTexT: Multi-task Learning with Code-Text Transformer
ACL (NLP4Prog) 2021 8
Affiliations
Frequent co-authors
10from 12 papers
Dan Hendrycks
director
Hieu Tran
Andy Zou
founder
Mantas Mazeika
researcher
Hieu Nguyen
Nathaniel Li
grad-student
Trieu H. Trinh
Alice Gatti
researcher
Bo Li
Dawn Song
professor