Paolo Rota
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation
arXiv 2026
EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models
arXiv 2025
On Large Multimodal Models as Open-World Image Classifiers
ICCV 2025
Visual Text Processing: A Comprehensive Review and Unified Evaluation
arXiv 2025
Dense Motion Captioning
arXiv 2025
Video-BrowseComp: Benchmarking Agentic Video Research on Open Web
arXiv 2025
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
CVPR 2025 1
Test-Time Zero-Shot Temporal Action Localization
CVPR 2024 1
Automatic benchmarking of large multimodal models via iterative experiment programming
arXiv 2024
Vocabulary-free Image Classification and Semantic Segmentation
arXiv 2024
Vocabulary-free Image Classification
vocabulary-free-image-classification
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation
ICCV 2023 1
Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition
arXiv 2022
Affiliations
Frequent co-authors
10from 13 papers