Xiaowei Hu
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14GLM-5: from Vibe Coding to Agentic Engineering
arXiv 2026
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv 2026
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
arXiv 2025
EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning
arXiv 2025
GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning
arXiv 2025
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI
arXiv 2024
SILT: Shadow-aware Iterative Label Tuning for Learning to Detect Shadows from Noisy Labels
ICCV 2023 1
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
CVPR 2023 1
GLIPv2: Unifying Localization and Vision-Language Understanding
arXiv 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
arXiv 2022
GIT: A Generative Image-to-text Transformer for Vision and Language
arXiv 2022
Demystify Transformers & Convolutions in Modern Image Deep Networks
arXiv 2022
VinVL: Revisiting Visual Representations in Vision-Language Models
CVPR 2021 1
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
arXiv 2021
Affiliations
Frequent co-authors
10from 14 papers