Yuan Gao
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing
arXiv 2026
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis
arXiv 2025
Valley2: Exploring Multimodal Models with Scalable Vision-Language Design
arXiv 2025
NeuralOM: Neural Ocean Model for Subseasonal-to-Seasonal Simulation
arXiv 2025
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
arXiv 2025
Target-Bench: Can World Models Achieve Mapless Path Planning with Semantic Targets?
arXiv 2025
From Words to Collisions: LLM-Guided Evaluation and Adversarial Generation of Safety-Critical Driving Scenarios
arXiv 2025
OneForecast: A Universal Framework for Global and Regional Weather Forecasting
arXiv 2025
DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
arXiv 2024
Bypass Back-propagation: Optimization-based Structural Pruning for Large Language Models via Policy Gradient
arXiv 2024
OneRestore: A Universal Restoration Framework for Composite Degradation
arXiv 2024
VideoTetris: Towards Compositional Text-to-Video Generation
arXiv 2024
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models
arXiv 2024
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating
arXiv 2024
Imaging foundation model for universal enhancement of non-ideal measurement CT
arXiv 2024
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
arXiv 2023
Composable Text Controls in Latent Space with ODEs
arXiv 2022
Partial FC: Training 10 Million Identities on a Single Machine
arXiv 2020
Unity: A General Platform for Intelligent Agents
arXiv 2018
Affiliations
Frequent co-authors
10from 19 papers