Jianfei Cai
- Papers
- 32
Cite
Notes
Only stored in your browser.
Authored papers
32Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective
arXiv 2026
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
arXiv 2026
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
arXiv 2025
Unified Camera Positional Encoding for Controlled Video Generation
arXiv 2025
Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
CVPR 2025 1
SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization
arXiv 2025
PanFlow: Decoupled Motion Control for Panoramic Video Generation
arXiv 2025
OpenView: Empowering MLLMs with Out-of-view VQA
arXiv 2025
MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
arXiv 2024
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
arXiv 2024
PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
CVPR 2025 1
Fast Feedforward 3D Gaussian Splatting Compression
arXiv 2024
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching
arXiv 2024
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI
arXiv 2024
Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis
arXiv 2024
Explicit Correspondence Matching for Generalizable Neural Radiance Fields
arXiv 2023
ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces
ICCV 2023 1
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
ICCV 2023 1
QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models
arXiv 2023
Stitchable Neural Networks
CVPR 2023 1
Stitched ViTs are Flexible Vision Backbones
arXiv 2023
Unifying Flow, Stereo and Depth Estimation
arXiv 2022
MARLIN: Masked Autoencoder for facial video Representation LearnINg
CVPR 2023 1
Object-Compositional Neural Implicit Surfaces
arXiv 2022
EcoFormer: Energy-Saving Attention with Linear Complexity
arXiv 2022
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
arXiv 2022
Fast Vision Transformers with HiLo Attention
arXiv 2022
Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields
arXiv 2022
GMFlow: Learning Optical Flow via Global Matching
CVPR 2022 1
Mesa: A Memory-saving Training Framework for Transformers
arXiv 2021
Scalable Vision Transformers with Hierarchical Pooling
ICCV 2021 10
Pluralistic Image Completion
pluralistic-image-completion-1
Affiliations
Frequent co-authors
10from 32 papers