Build a Retrieval-Augmented Generation Chatbot in 5 Minutes

In under 5 minutes and with only 100 lines of Python code, Rohan Rao, senior solutions architect at NVIDIA, demos how large language models (LLMs) can be dev...

NVIDIA Developer2.3M views4:08

🔥 Related Trending Topics

LIVE TRENDS

This video may be related to current global trending topics. Click any trend to explore more videos about what's hot right now!

THIS VIDEO IS TRENDING!

This video is currently trending in Thailand under the topic 'สภาพอากาศ'.

About this video

In under 5 minutes and with only 100 lines of Python code, Rohan Rao, senior solutions architect at NVIDIA, demos how large language models (LLMs) can be developed and deployed for AI chatbot applications—without needing your own GPU infrastructure. This demo showcases how to design and create an enterprise-grade retrieval-augmented generation (RAG) pipeline using NVIDIA AI Foundation models. With NVIDIA AI Foundation endpoints, all embedding and generation tasks are handled seamlessly, removing the need for dedicated GPUs. Step-by-Step guide: 0:31 - Experiment with OS LLMs on NVIDIA AI Foundation Endpoints 1:11 – Overview of RAG Pipeline Components: Custom Data Loader, Text Embedding Model, Vector Database, LLM 1:30 – Use the LangChain Connector 1:47 – Generate an API Key for NGC 2:03 – Build the Chat UI 2:13 – Add Custom Data Connector 2:25 – Access the Text Embedding Model with API Calls 2:34 – Deploy the Vector Database to Index Embeddings 2:40 – Create or Load a Vector Store 2:51 – Use FAISS Library to Store Chunks 2:55 – Connect Your RAG Pipeline Using Streamlit Developer resources: ▫️ Deploy and test NVIDIA generative AI pipeline examples on GitHub: https://nvda.ws/41gNtfJ ▫️ Read the Mastering LLM Techniques series: https://nvda.ws/3YOdUKN ▫️ Deploy, test, and extend this RAG application example on GitHub: https://github.com/NVIDIA/GenerativeAIExamples/tree/main/community/5_mins_rag_no_gpu ▫️ Join the NVIDIA Developer Program: https://nvda.ws/3OhiXfl ▫️ Read and subscribe to the NVIDIA Technical Blog: https://nvda.ws/3XHae9F #largelanguagemodels #retrievalaugmentedgeneration #llm #rag #generativeai #langchain #aichatbot #ragtutorial #chatbotapp #buildingchatbots

Video Information

Views
2.3M

Total views since publication

Likes
1.2K

User likes and reactions

Duration
4:08

Video length

Published
Jan 24, 2024

Release date

Quality
hd

Video definition

Captions
Available

Subtitles enabled