Pavlo Molchanov
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing
arXiv 2026
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
arXiv 2026
C-RADIOv4 (Tech Report)
arXiv 2026
Scaling RL to Long Videos
arXiv 2025
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning
arXiv 2025
FeatSharp: Your Vision Model Features, Sharper
arXiv 2025
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
arXiv 2025
Universal Deep Research: Bring Your Own Model and Strategy
arXiv 2025
Scaling Vision Pre-Training to 4K Resolution
CVPR 2025 1
Fast-dLLM v2: Efficient Block-Diffusion LLM
arXiv 2025
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
arXiv 2025
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement
arXiv 2025
NVILA: Efficient Frontier Visual Language Models
CVPR 2025 1
DoRA: Weight-Decomposed Low-Rank Adaptation
arXiv 2024
RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models
arXiv 2024
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
CVPR 2025 1
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
arXiv 2024
Hymba: A Hybrid-head Architecture for Small Language Models
arXiv 2024
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation
arXiv 2024
LITA: Language Instructed Temporal-Localization Assistant
arXiv 2024
Entropy-Regularized Process Reward Model
arXiv 2024
Compact Language Models via Pruning and Knowledge Distillation
arXiv 2024
VILA: On Pre-training for Visual Language Models
CVPR 2024 1
FasterViT: Fast Vision Transformers with Hierarchical Attention
arXiv 2023
Global Context Vision Transformers
arXiv 2022
Affiliations
Frequent co-authors
10from 25 papers