Zhiqi Li
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline
arXiv 2026
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
arXiv 2025
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
arXiv 2025
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
arXiv 2025
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs
arXiv 2025
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
arXiv 2024
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications
CVPR 2024 1
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
arXiv 2024
Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting
arXiv 2024
FB-BEV: BEV Representation from Forward-Backward View Transformations
ICCV 2023 1
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection
leveraging-vision-centric-multi-modal
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
arXiv 2023
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
CVPR 2023 1
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers
CVPR 2022 1
Affiliations
Frequent co-authors
10from 14 papers