Song Bai
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Monocular Normal Estimation via Shading Sequence Estimation
arXiv 2026
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
arXiv 2025
MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes
arXiv 2025
Liquid: Language Models are Scalable Multi-modal Generators
arXiv 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
arXiv 2024
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
CVPR 2024 1
General Object Foundation Model for Images and Videos at Scale
CVPR 2024 1
Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning
arXiv 2022
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding
CVPR 2023 1
Is synthetic data from generative models ready for image recognition?
arXiv 2022
An Empirical Study of End-to-End Temporal Action Detection
CVPR 2022 1
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
CVPR 2022 1
TransMix: Attend to Mix for Vision Transformers
CVPR 2022 1
Affiliations
Frequent co-authors
10from 13 papers