Build a Retrieval-Augmented Generation Chatbot in 5 Minutes
In under 5 minutes and with only 100 lines of Python code, Rohan Rao, senior solutions architect at NVIDIA, demos how large language models (LLMs) can be dev...
🔥 Related Trending Topics
LIVE TRENDSThis video may be related to current global trending topics. Click any trend to explore more videos about what's hot right now!
THIS VIDEO IS TRENDING!
This video is currently trending in Thailand under the topic 'สภาพอากาศ'.
About this video
In under 5 minutes and with only 100 lines of Python code, Rohan Rao, senior solutions architect at NVIDIA, demos how large language models (LLMs) can be developed and deployed for AI chatbot applications—without needing your own GPU infrastructure.
This demo showcases how to design and create an enterprise-grade retrieval-augmented generation (RAG) pipeline using NVIDIA AI Foundation models. With NVIDIA AI Foundation endpoints, all embedding and generation tasks are handled seamlessly, removing the need for dedicated GPUs.
Step-by-Step guide:
0:31 - Experiment with OS LLMs on NVIDIA AI Foundation Endpoints
1:11 – Overview of RAG Pipeline Components: Custom Data Loader, Text Embedding Model, Vector Database, LLM
1:30 – Use the LangChain Connector
1:47 – Generate an API Key for NGC
2:03 – Build the Chat UI
2:13 – Add Custom Data Connector
2:25 – Access the Text Embedding Model with API Calls
2:34 – Deploy the Vector Database to Index Embeddings
2:40 – Create or Load a Vector Store
2:51 – Use FAISS Library to Store Chunks
2:55 – Connect Your RAG Pipeline Using Streamlit
Developer resources:
▫️ Deploy and test NVIDIA generative AI pipeline examples on GitHub: https://nvda.ws/41gNtfJ
▫️ Read the Mastering LLM Techniques series: https://nvda.ws/3YOdUKN
▫️ Deploy, test, and extend this RAG application example on GitHub: https://github.com/NVIDIA/GenerativeAIExamples/tree/main/community/5_mins_rag_no_gpu
▫️ Join the NVIDIA Developer Program: https://nvda.ws/3OhiXfl
▫️ Read and subscribe to the NVIDIA Technical Blog: https://nvda.ws/3XHae9F
#largelanguagemodels #retrievalaugmentedgeneration #llm #rag #generativeai #langchain #aichatbot #ragtutorial #chatbotapp #buildingchatbots
Video Information
Views
2.3M
Total views since publication
Likes
1.2K
User likes and reactions
Duration
4:08
Video length
Published
Jan 24, 2024
Release date
Quality
hd
Video definition
Captions
Available
Subtitles enabled