Abhinav Shrivastava
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19Scale Space Diffusion
arXiv 2026
Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory
arXiv 2026
UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders
arXiv 2026
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
arXiv 2025
CoLLM: A Large Language Model for Composed Image Retrieval
CVPR 2025 1
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
CVPR 2024 1
Measuring Style Similarity in Diffusion Models
arXiv 2024
QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos
arXiv 2024
LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior
larp-tokenizing-videos-with-a-learned
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
arXiv 2024
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
arXiv 2023
EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS
arXiv 2023
Align and Attend: Multimodal Summarization with Dual Contrastive Losses
CVPR 2023 1
HNeRV: A Hybrid Neural Representation for Videos
CVPR 2023 1
Do text-free diffusion models learn discriminative visual representations?
arXiv 2023
$BT^2$: Backward-compatible Training with Basis Transformation
arXiv 2022
NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-wise Modeling
CVPR 2023 1
Teaching Matters: Investigating the Role of Supervision in Vision Transformers
CVPR 2023 1
Towards Discovery and Attribution of Open-world GAN Generated Images
ICCV 2021 10
Affiliations
Frequent co-authors
10from 19 papers