Nicolas Flammarion
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13HalluHard: A Hard Multi-Turn Hallucination Benchmark
arXiv 2026
OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents
arXiv 2025
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
arXiv 2024
Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
arXiv 2024
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks
arXiv 2024
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
arXiv 2024
Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics
arXiv 2024
Does Refusal Training in LLMs Generalize to the Past Tense?
arXiv 2024
Is In-Context Learning Sufficient for Instruction Following in LLMs?
arXiv 2024
Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings
transferable-adversarial-robustness-for
Why Do We Need Weight Decay in Modern Deep Learning?
arXiv 2023
A Modern Look at the Relationship between Sharpness and Generalization
arXiv 2023
SGD with Large Step Sizes Learns Sparse Features
arXiv 2022
Affiliations
Frequent co-authors
10from 13 papers