Guanbin Li
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv 2026
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
arXiv 2026
3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians
arXiv 2025
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
CVPR 2025 1
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis
arXiv 2025
Rethinking Query-based Transformer for Continual Image Segmentation
rethinking-query-based-transformer-for
MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments
arXiv 2024
MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
arXiv 2024
WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
arXiv 2024
Identity-Preserving Talking Face Generation with Landmark and Appearance Priors
CVPR 2023 1
SCoDA: Domain Adaptive Shape Completion for Real Scans
CVPR 2023 1
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
ICCV 2023 1
Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
ICCV 2023 1
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
ICCV 2023 1
Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training
arXiv 2022
Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowledge
arXiv 2022
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
CVPR 2021 1
Efficient Crowd Counting via Structured Knowledge Transfer
arXiv 2020
Affiliations
Frequent co-authors
10from 18 papers