Kristian Kersting
- Papers
- 37
Cite
Notes
Only stored in your browser.
Authored papers
37METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding
arXiv 2025
Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models
arXiv 2025
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
arXiv 2025
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
arXiv 2024
T-FREE: Subword Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings
arXiv 2024
LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models
arXiv 2024
SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs
arXiv 2024
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
arXiv 2024
Answer Set Networks: Casting Answer Set Programming into Deep Learning
arXiv 2024
Right on Time: Revising Time Series Models by Constraining their Explanations
arXiv 2024
Checkmating One, by Using Many: Combining Mixture of Experts with MCTS to Improve in Chess
arXiv 2024
AtMan: Understanding Transformer Predictions Through Memory Efficient Attention Manipulation
NeurIPS 2023 11
Probabilistic Circuits That Know What They Don't Know
arXiv 2023
V-LoL: A Diagnostic Dataset for Visual Logical Learning
arXiv 2023
Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations
arXiv 2023
Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness
arXiv 2023
Be Careful What You Smooth For: Label Smoothing Can Be a Privacy Shield but Also a Catalyst for Model Inversion Attacks
arXiv 2023
Vision Relation Transformer for Unbiased Scene Graph Generation
ICCV 2023 1
Defending Our Privacy With Backdoors
arXiv 2023
Leveraging Diffusion-Based Image Variations for Robust Training on Poisoned Data
arXiv 2023
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
CVPR 2023 1
Does CLIP Know My Face?
arXiv 2022
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
arXiv 2022
A Typology for Exploring the Mitigation of Shortcut Behavior
arXiv 2022
ILLUME: Rationalizing Vision-Language Models through Human Interactions
arXiv 2022
Rickrolling the Artist: Injecting Backdoors into Text Encoders for Text-to-Image Synthesis
ICCV 2023 1
Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content?
arXiv 2022
Revision Transformers: Instructing Language Models to Change their Values
arXiv 2022
Speaking Multiple Languages Affects the Moral Bias of Language Models
arXiv 2022
Inferring Offensiveness In Images From Natural Language Supervision
inferring-offensiveness-in-images-from-1
Adaptive Rational Activations to Boost Deep Reinforcement Learning
arXiv 2021
Interactively Providing Explanations for Transformer Language Models
interactively-generating-explanations-for-1
Large Pre-trained Language Models Contain Human-like Biases of What is Right and Wrong to Do
arXiv 2021
To Trust or Not To Trust Prediction Scores for Membership Inference Attacks
arXiv 2021
TUDataset: A collection of benchmark datasets for learning with graphs
arXiv 2020
Making deep neural networks right for the right scientific reasons by interacting with their explanations
arXiv 2020
Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks
ICLR 2020 1
Affiliations
Frequent co-authors
10from 37 papers
Patrick Schramowski
Manuel Brack
Felix Friedrich
Dominik Hintersdorf
Lukas Struppek
Björn Deiseroth
Wolfgang Stammer
Devendra Singh Dhami
Alejandro Molina
Alexander Fraser