Build Powerful Multimodal RAG with Gemini: Unlock Visual & Text Data Integration π
Learn how to create advanced Multimodal Retrieval-Augmented Generation (RAG) systems using Gemini. Enhance your AI's ability to process and generate based on both images and text seamlessly.

Google for Developers
95.9K views β’ May 16, 2024

About this video
The saying ""a picture is worth a thousand words"" encapsulates the immense potential of visual data. But most retrieval-augmented generation (RAG) applications rely only on text. This session applies RAG to multimodal use cases. It focuses on embeddings and attributed question answering to retrieve data. Weβll begin with a high-level architecture and quickly dive into a practical demo. Attendees will learn to create powerful LLM-based workflows and embed them in existing applications.
Speakers: Shilpa Kancharla, Jeff Nelson
Resources:
Try Gemini in Vertex AI β https://goo.gle/3Vttolh
Watch more:
Check out all the AI videos at Google I/O 2024 β https://goo.gle/io24-ai-yt
Check out all the Cloud videos at Google I/O 2024 β https://goo.gle/io24-cloud-yt
Subscribe to Google Developers β https://goo.gle/developers
#GoogleIO
Event: Google I/O 2024
Products Mentioned: Gemini
Speakers: Shilpa Kancharla, Jeff Nelson
Resources:
Try Gemini in Vertex AI β https://goo.gle/3Vttolh
Watch more:
Check out all the AI videos at Google I/O 2024 β https://goo.gle/io24-ai-yt
Check out all the Cloud videos at Google I/O 2024 β https://goo.gle/io24-cloud-yt
Subscribe to Google Developers β https://goo.gle/developers
#GoogleIO
Event: Google I/O 2024
Products Mentioned: Gemini
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
95.9K
Likes
1.9K
Duration
34:22
Published
May 16, 2024
User Reviews
4.7
(19) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
Trending Now