Boyi Li

Papers: 14

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

14papers

Authored papers

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

arXiv 2026

2026

V_1: Unifying Generation and Self-Verification for Parallel Reasoners

arXiv 2026

2026

Toward Cognitive Supersensing in Multimodal Large Language Model

arXiv 2026

2026

Describe Anything: Detailed Localized Image and Video Captioning

ICCV 2025

2025

Scaling Vision Pre-Training to 4K Resolution

CVPR 2025 1

2025

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

arXiv 2025

2025

Adaptive Graph Pruning for Multi-Agent Communication

arXiv 2025

2025

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

arXiv 2025

2025

Extrapolated Urban View Synthesis Benchmark

ICCV 2025

2024

Wolf: Captioning Everything with a World Summarization Framework

arXiv 2024

2024

Interactive Task Planning with Language Models

arXiv 2023

2023

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

arXiv 2023

2023

Language-driven Semantic Segmentation

language-driven-semantic-segmentation

2022

On Feature Normalization and Data Augmentation

CVPR 2021 1

2020

Affiliations

No known affiliations.

Frequent co-authors

from 14 papers

Trevor Darrell

professor

Long Lian

Marco Pavone

Song Han

Adam Yala

Baifeng Shi

Boris Ivanovic

Hongxu Yin

Jitendra Malik

Jan Kautz