Shiyi Lan

Papers: 12

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

12papers

Authored papers

Cosmos World Foundation Model Platform for Physical AI

arXiv 2025

2025

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control

arXiv 2025

2025

Generalized Trajectory Scoring for End-to-end Multimodal Planning

arXiv 2025

2025

Play to Generalize: Learning to Reason Through Game Play

arXiv 2025

2025

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

arXiv 2025

2025

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

arXiv 2024

2024

Fully Attentional Networks with Self-emerging Token Labeling

fully-attentional-networks-with-self-emerging

2024

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

arXiv 2024

2024

FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

arXiv 2023

2023

Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding

arXiv 2023

2023

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

arXiv 2023

2023

M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

from 12 papers

Jose M. Alvarez

Zuxuan Wu

Zhiding Yu

Huan Ling

Sanja Fidler

Alice Luo

Dieter Fox

Francesco Ferroni

Hanzi Mao

Jan Kautz