Patrick Pérez
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15One View Is Enough! Monocular Training for In-the-Wild Novel View Generation
arXiv 2026
High-Fidelity Simultaneous Speech-To-Speech Translation
arXiv 2025
Vision-Speech Models: Teaching Speech Models to Converse about Images
arXiv 2025
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
arXiv 2025
ARC-Encoder: learning compressed text representations for large language models
arXiv 2025
Moshi: a speech-text foundation model for real-time dialogue
arXiv 2024
Fine-Tuning CLIP's Last Visual Projector: A Few-Shot Cornucopia
arXiv 2024
Three Pillars improving Vision Foundation Model Distillation for Lidar
CVPR 2024 1
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation
arXiv 2023
PØDA: Prompt-driven Zero-shot Domain Adaptation
arXiv 2022
Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation
arXiv 2022
OCTET: Object-aware Counterfactual Explanations
CVPR 2023 1
Localizing Objects with Self-Supervised Transformers and no Labels
arXiv 2021
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning
obow-online-bag-of-visual-words-generation
Zero-Shot Semantic Segmentation
zero-shot-semantic-segmentation
Affiliations
Frequent co-authors
10from 15 papers