Song Bai

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

Monocular Normal Estimation via Shading Sequence Estimation

arXiv 2026

2026

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

arXiv 2025

2025

MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

arXiv 2025

2025

Liquid: Language Models are Scalable Multi-modal Generators

arXiv 2024

2024

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

arXiv 2024

2024

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

CVPR 2024 1

2023

General Object Foundation Model for Images and Videos at Scale

CVPR 2024 1

2023

Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning

arXiv 2022

2022

PLA: Language-Driven Open-Vocabulary 3D Scene Understanding

CVPR 2023 1

2022

Is synthetic data from generative models ready for image recognition?

arXiv 2022

2022

An Empirical Study of End-to-End Temporal Action Detection

CVPR 2022 1

2022

DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion

CVPR 2022 1

2021

TransMix: Attend to Mix for Vision Transformers

CVPR 2022 1

2021

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

Xiang Bai

Chuhui Xue

Junfeng Wu

Wenqing Zhang

Yi Jiang

Zehuan Yuan

Chang Liu

Philip H. S. Torr

Philip Torr

Shuyang Sun