Tiejun Huang
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18RoboBrain 2.5: Depth in Sight, Time in Mind
arXiv 2026
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
arXiv 2025
Emu3.5: Native Multimodal Models are World Learners
arXiv 2025
OmniGen2: Exploration to Advanced Multimodal Generation
arXiv 2025
MLVU: Benchmarking Multi-task Long Video Understanding
CVPR 2025 1
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models
arXiv 2024
Emu3: Next-Token Prediction is All You Need
arXiv 2024
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
CVPR 2025 1
Efficient Multimodal Learning from Data-centric Perspective
arXiv 2024
OmniGen: Unified Image Generation
CVPR 2025 1
Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
arXiv 2024
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
arXiv 2023
SegVol: Universal and Interactive Volumetric Medical Image Segmentation
arXiv 2023
Hard-aware Instance Adaptive Self-training for Unsupervised Cross-domain Semantic Segmentation
arXiv 2023
SegGPT: Segmenting Everything In Context
arXiv 2023
Generative Multimodal Models are In-Context Learners
CVPR 2024 1
SVIT: Scaling up Visual Instruction Tuning
arXiv 2023
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
CVPR 2022 1
Affiliations
Frequent co-authors
10from 18 papers