Build Powerful Multimodal RAG with Gemini: Unlock Visual & Text Data Integration πŸ“Š

Learn how to create advanced Multimodal Retrieval-Augmented Generation (RAG) systems using Gemini. Enhance your AI's ability to process and generate based on both images and text seamlessly.

Build Powerful Multimodal RAG with Gemini: Unlock Visual & Text Data Integration πŸ“Š
Google for Developers
95.9K views β€’ May 16, 2024
Build Powerful Multimodal RAG with Gemini: Unlock Visual & Text Data Integration πŸ“Š

About this video

The saying ""a picture is worth a thousand words"" encapsulates the immense potential of visual data. But most retrieval-augmented generation (RAG) applications rely only on text. This session applies RAG to multimodal use cases. It focuses on embeddings and attributed question answering to retrieve data. We’ll begin with a high-level architecture and quickly dive into a practical demo. Attendees will learn to create powerful LLM-based workflows and embed them in existing applications.

Speakers: Shilpa Kancharla, Jeff Nelson

Resources:
Try Gemini in Vertex AI β†’ https://goo.gle/3Vttolh

Watch more:
Check out all the AI videos at Google I/O 2024 β†’ https://goo.gle/io24-ai-yt
Check out all the Cloud videos at Google I/O 2024 β†’ https://goo.gle/io24-cloud-yt

Subscribe to Google Developers β†’ https://goo.gle/developers

#GoogleIO


Event: Google I/O 2024
Products Mentioned: Gemini

Tags and Topics

Browse our collection to discover more content in these categories.

Video Information

Views

95.9K

Likes

1.9K

Duration

34:22

Published

May 16, 2024

User Reviews

4.7
(19)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.