EmbeddingGemma: Top Open Model for On-Device Embeddings
Explore EmbeddingGemma, a 308M parameter model enabling high-quality generative AI directly on your device. đ

Google for Developers
114.0K views âą Sep 4, 2025

About this video
Discover EmbeddingGemma, a state-of-the-art 308 million parameter text embedding model designed to power generative AI experiences directly on your hardware. Ideal for mobile-first Al, EmbeddingGemma brings powerful capabilities to your applications, enabling features like semantic search, information retrieval, and custom classification â all while running efficiently on-device.
In this video, Alice Lisak and Lucas Gonzalez from the Gemma team introduce EmbeddingGemma and explain how it works. Learn how you can run this model on less than 200MB of RAM with quantization, customize its output dimensions with Matryoshka Representation Learning (MRL), and
build powerful offline Al features.
Resources:
Learn about EmbeddingGemma â https://developers.googleblog.com/en/introducing-embeddinggemma
EmbeddingGemma documentation â https://ai.google.dev/gemma/docs/embeddinggemma
Gemma Cookbook â https://github.com/google-gemini/gemma-cookbook
Quickstart RAG notebook â https://github.com/google-gemini/gemma-cookbook/blob/main/Gemma/%5BGemma_3%5DRAG_with_EmbeddingGemma.ipynb
Discover Gemma models â https://deepmind.google/models/gemma
Chapters
0:00 - Intro
0:26 - Model overview
1:18 - Model features
2:29 - RAG
2:54 - Website embedding demo
3:23 - Tools and platforms
3:41 - Conclusion
Subscribe to Google for Developers â https://goo.gle/developers
Speaker:Alice Lisak Lucas Gonzalez
Products Mentioned: Google AI, Gemma,Generative AI
In this video, Alice Lisak and Lucas Gonzalez from the Gemma team introduce EmbeddingGemma and explain how it works. Learn how you can run this model on less than 200MB of RAM with quantization, customize its output dimensions with Matryoshka Representation Learning (MRL), and
build powerful offline Al features.
Resources:
Learn about EmbeddingGemma â https://developers.googleblog.com/en/introducing-embeddinggemma
EmbeddingGemma documentation â https://ai.google.dev/gemma/docs/embeddinggemma
Gemma Cookbook â https://github.com/google-gemini/gemma-cookbook
Quickstart RAG notebook â https://github.com/google-gemini/gemma-cookbook/blob/main/Gemma/%5BGemma_3%5DRAG_with_EmbeddingGemma.ipynb
Discover Gemma models â https://deepmind.google/models/gemma
Chapters
0:00 - Intro
0:26 - Model overview
1:18 - Model features
2:29 - RAG
2:54 - Website embedding demo
3:23 - Tools and platforms
3:41 - Conclusion
Subscribe to Google for Developers â https://goo.gle/developers
Speaker:Alice Lisak Lucas Gonzalez
Products Mentioned: Google AI, Gemma,Generative AI
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
114.0K
Likes
4.5K
Duration
4:13
Published
Sep 4, 2025
User Reviews
4.7
(22) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.