Yin Cui
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14SAGE: Scalable Agentic 3D Scene Generation for Embodied AI
arXiv 2026
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
arXiv 2025
Describe Anything: Detailed Localized Image and Video Captioning
ICCV 2025
Cosmos World Foundation Model Platform for Physical AI
arXiv 2025
Wolf: Captioning Everything with a World Summarization Framework
arXiv 2024
VideoGLUE: Video General Understanding Evaluation of Foundation Models
arXiv 2023
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
NeurIPS 2021 12
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
open-vocabulary-object-detection-via-vision
Spatiotemporal Contrastive Video Representation Learning
CVPR 2021 1
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
CVPR 2021 1
Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
ECCV 2020 8
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
spinenet-learning-scale-permuted-backbone-for-1
The iMaterialist Fashion Attribute Dataset
arXiv 2019
The iNaturalist Species Classification and Detection Dataset
the-inaturalist-species-classification-and-1
Affiliations
Frequent co-authors
10from 14 papers