Seungju Han
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Representation Bending for Large Language Model Safety
arXiv 2025
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
arXiv 2025
Verifying the Verifiers: Unveiling Pitfalls and Potentials in Fact Verifiers
arXiv 2025
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
arXiv 2024
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models
arXiv 2024
Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding
arXiv 2024
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
arXiv 2024
CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos
ICCV 2023 1
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models
arXiv 2023
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances
NAACL 2022 7
Affiliations
Frequent co-authors
10from 10 papers
Yejin Choi
professor
Youngjae Yu
Nouha Dziri
researcher
Allyson Ettinger
Liwei Jiang
Ximing Lu
Ashkan Yousefpour
Jack Hessel
researcher
Kavel Rao
Niloofar Mireshghallah