Wei Yin
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16HorizonStream: Long-Horizon Attention for Streaming 3D Reconstruction
arXiv 2026
Epona: Autoregressive Diffusion World Model for Autonomous Driving
ICCV 2025
3D and 4D World Modeling: A Survey
arXiv 2025
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving
CVPR 2025 1
LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging
arXiv 2025
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization
arXiv 2025
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
arXiv 2024
Depth Any Video with Scalable Synthetic Data
arXiv 2024
Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation
metric3d-v2-a-versatile-monocular-geometric
GIM: Learning Generalizable Image Matcher From Internet Videos
arXiv 2024
ComDrive: Comfort-Oriented End-to-End Autonomous Driving
arXiv 2024
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
arXiv 2024
OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity
arXiv 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration
ICCV 2025
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
arXiv 2024
LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment
arXiv 2024
Affiliations
Frequent co-authors
10from 16 papers