Zeynep Akata

Understanding the Limits of Lifelong Knowledge Editing in LLMs

arXiv 2025

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

arXiv 2025

Time Series Representations for Classification Lie Hidden in Pretrained Vision Transformers

arXiv 2025

DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation

arXiv 2025

ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization

arXiv 2024

ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections

arXiv 2024

Post-hoc Probabilistic Vision-Language Models

arXiv 2024

FLAIR: VLM with Fine-grained Language-informed Image Representations

CVPR 2025 1

A Practitioner's Guide to Continual Multimodal Pretraining

arXiv 2024

DataDream: Few-shot Guided Dataset Generation

arXiv 2024

EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

arXiv 2024

COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training

CVPR 2025 1

Bridging the Gap between Model Explanations in Partially Annotated Multi-label Classification

CVPR 2023 1

Image-free Classifier Injection for Zero-Shot Classification

ICCV 2023 1

Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation

arXiv 2023

Iterative Superquadric Recomposition of 3D Objects from Multiple Views

ICCV 2023 1

PDiscoNet: Semantically consistent part discovery for fine-grained recognition

ICCV 2023 1

DeViL: Decoding Vision features into Language

arXiv 2023

Text-to-feature diffusion for audio-visual few-shot learning

arXiv 2023

Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model

arXiv 2023

Video-adverb retrieval with compositional adverb-action embeddings

arXiv 2023

Vision-by-Language for Training-Free Compositional Image Retrieval

arXiv 2023

Waffling around for Performance: Visual Classification with Random Words and Broad Concepts

ICCV 2023 1

ProbVLM: Probabilistic Adapter for Frozen Vision-Language Models

ICCV 2023 1

If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection

arXiv 2023

Temporal and cross-modal attention for audio-visual zero-shot learning

arXiv 2022

2022

PlanT: Explainable Planning Transformers via Object-Level Representations

arXiv 2022

2022

BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks

arXiv 2022

2022

Learning Graph Embeddings for Compositional Zero-shot Learning

CVPR 2021 1

Keep CALM and Improve Visual Feature Attribution

ICCV 2021 10

Audio Retrieval with Natural Language Queries: A Benchmark Study

arXiv 2021

Audio Retrieval with Natural Language Queries

arXiv 2021