MedGemma 4B IT
MultimodalMedGemma is a collection of Gemma 3 variants trained for medical text processing and image understanding. MedGemma 4B uses a SigLIP image encoder that was specifically pre-trained on diverse de-identified medical data, including chest X-rays, dermatological images, ophthalmological images, and histopathological slides. Its LLM component is trained on a diverse medical dataset including radiological images, histopathological fragments, ophthalmological images, and dermatological images. MedGemma is a multimodal model primarily evaluated on single-image tasks. It has not been tested for multi-turn applications and may be more sensitive to specific prompts than its predecessor Gemma 3. Developers should consider biases in validation data and data contamination issues when using MedGemma.
Key Specifications
Timeline
Technical Specifications
Benchmark Results
Model performance metrics across various tests and benchmarks
Other Tests
License & Metadata
Similar Models
All ModelsGemini 1.5 Flash 8B
Gemma 3n E2B
Gemma 3n E2B Instructed
Gemma 3 4B
Gemma 3n E4B Instructed LiteRT Preview
Gemma 3n E2B Instructed LiteRT (Preview)
Gemma 3n E4B Instructed
Gemma 3n E4B
Recommendations are based on similarity of characteristics: developer organization, multimodality, parameter size, and benchmark performance. Choose a model to compare or go to the full catalog to browse all available AI models.