Pavlo Molchanov

Papers: 25

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

25papers

Authored papers

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

arXiv 2026

2026

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

arXiv 2026

2026

C-RADIOv4 (Tech Report)

arXiv 2026

2026

Scaling RL to Long Videos

arXiv 2025

2025

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

arXiv 2025

2025

FeatSharp: Your Vision Model Features, Sharper

arXiv 2025

2025

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

arXiv 2025

2025

Fast-dLLM v2: Efficient Block-Diffusion LLM

arXiv 2025

2025

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

arXiv 2025

2025

Universal Deep Research: Bring Your Own Model and Strategy

arXiv 2025

2025

LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement

arXiv 2025

2025

Scaling Vision Pre-Training to 4K Resolution

CVPR 2025 1

2025

NVILA: Efficient Frontier Visual Language Models

CVPR 2025 1

2024

DoRA: Weight-Decomposed Low-Rank Adaptation

arXiv 2024

2024

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

arXiv 2024

2024

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

CVPR 2025 1

2024

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

arXiv 2024

2024

Hymba: A Hybrid-head Architecture for Small Language Models

arXiv 2024

2024

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

arXiv 2024

2024

LITA: Language Instructed Temporal-Localization Assistant

arXiv 2024

2024

Entropy-Regularized Process Reward Model

arXiv 2024

2024

Compact Language Models via Pruning and Knowledge Distillation

arXiv 2024

2024

VILA: On Pre-training for Visual Language Models

CVPR 2024 1

2023

FasterViT: Fast Vision Transformers with Hierarchical Attention

arXiv 2023

2023

Global Context Vision Transformers

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 25 papers

Jan Kautz

Hongxu Yin

Andrew Tao

Greg Heinrich

Song Han

Bryan Catanzaro

researcher

Shizhe Diao

Yao Lu

Yonggan Fu

Zhijian Liu