Ivan Laptev
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11AGORA: Adversarial Generation Of Real-time Animatable 3D Gaussian Head Avatars
arXiv 2025
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
arXiv 2025
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
arXiv 2025
Mitigating Object Hallucination via Concentric Causal Attention
arXiv 2024
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
CVPR 2025 1
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
CVPR 2023 1
PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
arXiv 2023
Learning to Answer Visual Questions from Web Videos
arXiv 2022
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
arXiv 2022
TubeDETR: Spatio-Temporal Video Grounding with Transformers
CVPR 2022 1
Cross-task weakly supervised learning from instructional videos
cross-task-weakly-supervised-learning-from-1
Affiliations
Frequent co-authors
10from 11 papers