0

Jianfei Cai

Papers
32

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
32papers

Authored papers

32

Feed-Forward 3D Scene Modeling: A Problem-Driven Perspective

arXiv 2026

2026

MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE

arXiv 2026

2026

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

arXiv 2025

2025

Unified Camera Positional Encoding for Controlled Video Generation

arXiv 2025

2025

Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis

CVPR 2025 1

2025

SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization

arXiv 2025

2025

PanFlow: Decoupled Motion Control for Panoramic Video Generation

arXiv 2025

2025

OpenView: Empowering MLLMs with Out-of-view VQA

arXiv 2025

2025

MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images

arXiv 2024

2024

MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views

arXiv 2024

2024

PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

CVPR 2025 1

2024

Fast Feedforward 3D Gaussian Splatting Compression

arXiv 2024

2024

T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching

arXiv 2024

2024

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

arXiv 2024

2024

Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis

arXiv 2024

2024

Explicit Correspondence Matching for Generalizable Neural Radiance Fields

arXiv 2023

2023

ObjectSDF++: Improved Object-Compositional Neural Implicit Surfaces

ICCV 2023 1

2023

Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning

ICCV 2023 1

2023

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

arXiv 2023

2023

Stitchable Neural Networks

CVPR 2023 1

2023

Stitched ViTs are Flexible Vision Backbones

arXiv 2023

2023

Unifying Flow, Stereo and Depth Estimation

arXiv 2022

2022

MARLIN: Masked Autoencoder for facial video Representation LearnINg

CVPR 2023 1

2022

Object-Compositional Neural Implicit Surfaces

arXiv 2022

2022

EcoFormer: Energy-Saving Attention with Linear Complexity

arXiv 2022

2022

Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning

arXiv 2022

2022

Fast Vision Transformers with HiLo Attention

arXiv 2022

2022

Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields

arXiv 2022

2022

GMFlow: Learning Optical Flow via Global Matching

CVPR 2022 1

2021

Mesa: A Memory-saving Training Framework for Transformers

arXiv 2021

2021

Scalable Vision Transformers with Hierarchical Pooling

ICCV 2021 10

2021

Pluralistic Image Completion

pluralistic-image-completion-1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 32 papers