NExT-GPT: First Any-to-Any Multimodal LLM π€
NExT-GPT is the first end-to-end MM-LLM capable of processing diverse inputs like text, images, video, and audio in any combination.

AI Bites
7.2K views β’ Sep 28, 2023

About this video
NExT-GPT is the first end-to-end Multimodal Large Language Model (MM-LLM) that can take inputs in arbitrary combinations of text, image, video, and audio and generate outputs in any of the same modalities. In short, it is the first any-to-any MM-LLM model.
In this video I go through some of the NExT-GPT model architecture, the proposed alignment techniques like Encoding-side LLM-centric Alignment,
Decoding-side Instruction-following Alignment, Modality-switching Instruction Tuning, and the MosIT dataset.
Hope it's useful. Please leave your comments for any clarifications.
Website: https://next-gpt.github.io
Paper Link: https://arxiv.org/pdf/2309.05519
Code: https://github.com/NExT-GPT/NExT-GPT
MY KEY LINKS
YouTube: https://www.youtube.com/@AIBites
Twitter: https://twitter.com/ai_bitesβ
Patreon: https://www.patreon.com/ai_bitesβ
Github: https://github.com/ai-bitesβ
π π π MY SOFTWARE TOOLS π π π
βοΈ Notion - https://affiliate.notion.so/aibites-yt
βοΈ Notion AI - https://affiliate.notion.so/ys9rqzv2vdd8
πΉ OBS Studio for video editing - https://obsproject.com
πΌ Manim for some animations - https://www.manim.community
π΅ My music - https://www.bensound.com and
π π π BOOKS I HAVE READ, REFER AND RECOMMEND π π π
π Deep Learning by Ian Goodfellow - https://amzn.to/3Wnyixv
π Pattern Recognition and Machine Learning by Christopher M. Bishop - https://amzn.to/3ZVnQQA
π Machine Learning: A Probabilistic Perspective by Kevin Murphy - https://amzn.to/3kAqThb
π Multiple View Geometry in Computer Vision by R Hartley and A Zisserman - https://amzn.to/3XKVOWi
WHO AM I?
I am a Machine Learning Researcher/practitioner who has seen the grind of academia and start-ups equally. I started my career as a software engineer 15 years ago. Because of my love for Mathematics (coupled with a glimmer of luck), I graduated with a Master's in Computer Vision and Robotics in 2016 when the now happening AI revolution just started. Life has changed for the better ever since.
#machinelearning #deeplearning #aibites
In this video I go through some of the NExT-GPT model architecture, the proposed alignment techniques like Encoding-side LLM-centric Alignment,
Decoding-side Instruction-following Alignment, Modality-switching Instruction Tuning, and the MosIT dataset.
Hope it's useful. Please leave your comments for any clarifications.
Website: https://next-gpt.github.io
Paper Link: https://arxiv.org/pdf/2309.05519
Code: https://github.com/NExT-GPT/NExT-GPT
MY KEY LINKS
YouTube: https://www.youtube.com/@AIBites
Twitter: https://twitter.com/ai_bitesβ
Patreon: https://www.patreon.com/ai_bitesβ
Github: https://github.com/ai-bitesβ
π π π MY SOFTWARE TOOLS π π π
βοΈ Notion - https://affiliate.notion.so/aibites-yt
βοΈ Notion AI - https://affiliate.notion.so/ys9rqzv2vdd8
πΉ OBS Studio for video editing - https://obsproject.com
πΌ Manim for some animations - https://www.manim.community
π΅ My music - https://www.bensound.com and
π π π BOOKS I HAVE READ, REFER AND RECOMMEND π π π
π Deep Learning by Ian Goodfellow - https://amzn.to/3Wnyixv
π Pattern Recognition and Machine Learning by Christopher M. Bishop - https://amzn.to/3ZVnQQA
π Machine Learning: A Probabilistic Perspective by Kevin Murphy - https://amzn.to/3kAqThb
π Multiple View Geometry in Computer Vision by R Hartley and A Zisserman - https://amzn.to/3XKVOWi
WHO AM I?
I am a Machine Learning Researcher/practitioner who has seen the grind of academia and start-ups equally. I started my career as a software engineer 15 years ago. Because of my love for Mathematics (coupled with a glimmer of luck), I graduated with a Master's in Computer Vision and Robotics in 2016 when the now happening AI revolution just started. Life has changed for the better ever since.
#machinelearning #deeplearning #aibites
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
7.2K
Likes
160
Duration
9:56
Published
Sep 28, 2023
User Reviews
4.6
(1) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
Trending Now