Zeynep Akata
- Papers
- 38
Cite
Notes
Only stored in your browser.
Authored papers
38Explaining CLIP Zero-shot Predictions Through Concepts
arXiv 2026
FINER: MLLMs Hallucinate under Fine-grained Negative Queries
arXiv 2026
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
arXiv 2025
Understanding the Limits of Lifelong Knowledge Editing in LLMs
arXiv 2025
DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation
arXiv 2025
Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models
arXiv 2025
Time Series Representations for Classification Lie Hidden in Pretrained Vision Transformers
arXiv 2025
FLAIR: VLM with Fine-grained Language-informed Image Representations
CVPR 2025 1
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
arXiv 2024
A Practitioner's Guide to Continual Multimodal Pretraining
arXiv 2024
EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
arXiv 2024
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
arXiv 2024
DataDream: Few-shot Guided Dataset Generation
arXiv 2024
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
CVPR 2025 1
Post-hoc Probabilistic Vision-Language Models
arXiv 2024
Vision-by-Language for Training-Free Compositional Image Retrieval
arXiv 2023
If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection
arXiv 2023
Bridging the Gap between Model Explanations in Partially Annotated Multi-label Classification
CVPR 2023 1
Image-free Classifier Injection for Zero-Shot Classification
ICCV 2023 1
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
arXiv 2023
Video-adverb retrieval with compositional adverb-action embeddings
arXiv 2023
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts
ICCV 2023 1
ProbVLM: Probabilistic Adapter for Frozen Vision-Language Models
ICCV 2023 1
Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation
arXiv 2023
Iterative Superquadric Recomposition of 3D Objects from Multiple Views
ICCV 2023 1
PDiscoNet: Semantically consistent part discovery for fine-grained recognition
ICCV 2023 1
DeViL: Decoding Vision features into Language
arXiv 2023
Text-to-feature diffusion for audio-visual few-shot learning
arXiv 2023
BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks
arXiv 2022
Temporal and cross-modal attention for audio-visual zero-shot learning
arXiv 2022
PlanT: Explainable Planning Transformers via Object-Level Representations
arXiv 2022
Audio Retrieval with Natural Language Queries
arXiv 2021
Learning Graph Embeddings for Compositional Zero-shot Learning
CVPR 2021 1
Keep CALM and Improve Visual Feature Attribution
ICCV 2021 10
Audio Retrieval with Natural Language Queries: A Benchmark Study
arXiv 2021
Evaluation for Weakly Supervised Object Localization: Protocol, Metrics, and Datasets
arXiv 2020
Generalized Zero- and Few-Shot Learning via Aligned Variational Autoencoders
arXiv 2018
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
multimodal-explanations-justifying-decisions-1
Affiliations
Frequent co-authors
10from 38 papers
Shyamgopal Karthik
A. Sophia Koepke
Karsten Roth
Stephan Alaniz
Massimiliano Mancini
Jae Myung Kim
Otniel-Bogdan Mercea
Thomas Hummel
Cordelia Schmid
Luca Eyring