Zhiwen Fan
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
arXiv 2025
Generative AI for Autonomous Driving: Frontiers and Opportunities
arXiv 2025
Steepest Descent Density Control for Compact 3D Gaussian Splatting
CVPR 2025 1
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling
arXiv 2025
X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction
ICCV 2025
Can Test-Time Scaling Improve World Foundation Model?
arXiv 2025
InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
arXiv 2024
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
arXiv 2024
LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild
arXiv 2024
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS
arXiv 2023
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
CVPR 2024 1
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
arXiv 2023
Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts
ICCV 2023 1
NeuralLift-360: Lifting An In-the-wild 2D Photo to A 3D Object with 360° Views
arXiv 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
arXiv 2022
Neural Implicit Dictionary via Mixture-of-Expert Training
arXiv 2022
Affiliations
Frequent co-authors
10from 16 papers