Artificial intelligence (AI) has significant potential in healthcare applications, but its training and deployment faces challenges due to healthcare's diverse data, complex tasks, and the need to preserve privacy. Foundation models that perform well on medical tasks and require less task-specific tuning data are critical to accelerate the development of healthcare AI applications. We introduce MedGemma, a collection of medical vision-language foundation models based on Gemma 3 4B and 27B. MedGemma demonstrates advanced medical understanding and reasoning on images and text, significantly exceeding the performance of similar-sized generative models and approaching the performance of task-specific models, while maintaining the general capabilities of the Gemma 3 base models. For out-of-distribution tasks, MedGemma achieves 2.6-10% improvement on medical multimodal question answering, 15.5-18.1% improvement on chest X-ray finding classification, and 10.8% improvement on agentic evaluations compared to the base models. Fine-tuning MedGemma further improves performance in subdomains, reducing errors in electronic health record information retrieval by 50% and reaching comparable performance to existing specialized state-of-the-art methods for pneumothorax classification and histopathology patch classification. We additionally introduce MedSigLIP, a medically-tuned vision encoder derived from SigLIP. MedSigLIP powers the visual understanding capabilities of MedGemma and as an encoder achieves comparable or better performance than specialized medical image encoders. Taken together, the MedGemma collection provides a strong foundation of medical image and text capabilities, with potential to significantly accelerate medical research and development of downstream applications. The MedGemma collection, including tutorials and model weights, can be found at https://goo.gle/medgemma.
MedGemma Technical Report
Artificial intelligence (AI) has significant potential in healthcare applications, but its training and deployment faces challenges due to healthcare's diverse data, complex tasks, and the need to preserve privacy.
- Year
- 2025
- Venue
- arXiv 2025
- Authors
- 81
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2507.05201v3ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
81Sebastian BorgeaudYossi MatiasAndrew SellergrenSahar KazemzadehTiam JaroensriAtilla KiralyMadeleine TraverseTimo KohlbergerShawn XuFayaz JamilCían HughesCharles LauJustin ChenFereshteh MahvarLiron YatzivTiffany ChenBram SterlingStefanie Anna BabySusanna Maria BabyJeremy LaiSamuel SchmidgallLu YangKejia ChenPer BjornssonShashir ReddyRyan BrushKenneth PhilbrickMercy AsieduInes MezerregHoward HuHoward YangRicha TiwariSunny JansenPreeti SinghYun LiuShekoofeh AziziAishwarya KamathJohan FerretShreya PathakNino VieillardRamona MerhejSarah PerrinTatiana MatejovicovaAlexandre RaméMorgane RiviereLouis RouillardThomas MesnardGeoffrey CideronJean-bastien GrillSabela RamosEdouard YvinecMichelle CasbonElena BuchatskayaJean-Baptiste AlayracDmitry LepikhinVlad FeinbergAlek AndreevCassidy HardinRobert DadashiLéonard HussenotArmand JoulinOlivier BachemKatherine ChouAvinatan HassidimKavi GoelClement FarabetJoelle BarralTris WarkentinJonathon ShlensDavid FleetVictor CotrutaOmar SansevieroGus MartinsPhoebe KirkAnand RaoShravya ShettyDavid F. SteinerCan KirmizibayrakRory PilgrimDaniel GoldenLin Yang