Shiyi Lan
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Cosmos World Foundation Model Platform for Physical AI
arXiv 2025
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control
arXiv 2025
Generalized Trajectory Scoring for End-to-end Multimodal Planning
arXiv 2025
Play to Generalize: Learning to Reason Through Game Play
arXiv 2025
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
arXiv 2025
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
arXiv 2024
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
arXiv 2024
Fully Attentional Networks with Self-emerging Token Labeling
fully-attentional-networks-with-self-emerging
FocalFormer3D : Focusing on Hard Instance for 3D Object Detection
arXiv 2023
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding
arXiv 2023
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation
arXiv 2023
M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers
arXiv 2021
Affiliations
Frequent co-authors
10from 12 papers