Zhiqi Li

Papers: 14

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

14papers

Authored papers

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

arXiv 2026

2026

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

arXiv 2025

2025

AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs

arXiv 2025

2025

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

arXiv 2025

2025

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

arXiv 2025

2025

Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding

arXiv 2024

2024

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

CVPR 2024 1

2024

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

arXiv 2024

2024

Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting

arXiv 2024

2024

FB-BEV: BEV Representation from Forward-Backward View Transformations

ICCV 2023 1

2023

Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection

leveraging-vision-centric-multi-modal

2023

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

arXiv 2023

2023

InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

CVPR 2023 1

2022

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

CVPR 2022 1

2021

Affiliations

No known affiliations.

Frequent co-authors

from 14 papers

Tong Lu

Wenhai Wang

Zhiding Yu

Guo Chen

Jifeng Dai

Xizhou Zhu

Yu Qiao

Guilin Liu

Jan Kautz

Jose M. Alvarez