Lorenzo Baraldi
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
arXiv 2026
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
CVPR 2025 1
RAID: A Dataset for Testing the Adversarial Robustness of AI-Generated Image Detectors
arXiv 2025
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
arXiv 2025
Hyperbolic Safety-Aware Vision-Language Models
CVPR 2025 1
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation
ICCV 2025
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
arXiv 2024
Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
arXiv 2024
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues
arXiv 2024
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
CVPR 2025 1
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models
arXiv 2023
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning
ICCV 2023 1
Evaluating Synthetic Pre-Training for Handwriting Processing Tasks
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers