Nicolas Flammarion

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

HalluHard: A Hard Multi-Turn Hallucination Benchmark

arXiv 2026

2026

OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents

arXiv 2025

2025

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

arXiv 2024

2024

Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs

arXiv 2024

2024

Does Refusal Training in LLMs Generalize to the Past Tense?

arXiv 2024

2024

Is In-Context Learning Sufficient for Instruction Following in LLMs?

arXiv 2024

2024

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

arXiv 2024

2024

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks

arXiv 2024

2024

Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics

arXiv 2024

2024

Why Do We Need Weight Decay in Modern Deep Learning?

arXiv 2023

2023

A Modern Look at the Relationship between Sharpness and Generalization

arXiv 2023

2023

Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings

transferable-adversarial-robustness-for

2023

SGD with Large Step Sizes Learns Sparse Features

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

Maksym Andriushchenko

Francesco Croce

Hao Zhao

Aditya Varre

Florian Tramer

Agatha Duzan

Alexander Robey

Carmela Troncoso

Dongyang Fan

Edgar Dobriban