A diverse set of Google Gemma models optimized for specific tasks and platforms, ranging from medical analysis to captioning.
Image Captioning with PaliGemma 2 Vision Language Model
Image Captioning with T5Gemma2 encoder-decoder img-txt Model