Shanshan Zhao
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
arXiv 2025
Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images
arXiv 2025
Ovis2.5 Technical Report
arXiv 2025
Ovis-U1 Technical Report
arXiv 2025
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
arXiv 2023
Hierarchical Point-based Active Learning for Semi-supervised Point Cloud Semantic Segmentation
ICCV 2023 1
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
condaformer-disassembled-transformer-with
PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions
arXiv 2023
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting
CVPR 2023 1
Affiliations
Frequent co-authors
10from 9 papers