NExT-GPT: First Any-to-Any Multimodal LLM πŸ€–

NExT-GPT is the first end-to-end MM-LLM capable of processing diverse inputs like text, images, video, and audio in any combination.

NExT-GPT: First Any-to-Any Multimodal LLM πŸ€–
AI Bites
7.2K views β€’ Sep 28, 2023
NExT-GPT: First Any-to-Any Multimodal LLM πŸ€–

About this video

NExT-GPT is the first end-to-end Multimodal Large Language Model (MM-LLM) that can take inputs in arbitrary combinations of text, image, video, and audio and generate outputs in any of the same modalities. In short, it is the first any-to-any MM-LLM model.

In this video I go through some of the NExT-GPT model architecture, the proposed alignment techniques like Encoding-side LLM-centric Alignment,
Decoding-side Instruction-following Alignment, Modality-switching Instruction Tuning, and the MosIT dataset.

Hope it's useful. Please leave your comments for any clarifications.

Website: https://next-gpt.github.io
Paper Link: https://arxiv.org/pdf/2309.05519
Code: https://github.com/NExT-GPT/NExT-GPT

MY KEY LINKS
YouTube: https://www.youtube.com/@AIBites
Twitter: https://twitter.com/ai_bites​
Patreon: https://www.patreon.com/ai_bites​
Github: https://github.com/ai-bites​

πŸ›  πŸ›  πŸ›  MY SOFTWARE TOOLS πŸ›  πŸ›  πŸ› 
✍️ Notion - https://affiliate.notion.so/aibites-yt
✍️ Notion AI - https://affiliate.notion.so/ys9rqzv2vdd8
πŸ“Ή OBS Studio for video editing - https://obsproject.com
πŸ“Ό Manim for some animations - https://www.manim.community
🎡 My music - https://www.bensound.com and

πŸ“š πŸ“š πŸ“š BOOKS I HAVE READ, REFER AND RECOMMEND πŸ“š πŸ“š πŸ“š
πŸ“– Deep Learning by Ian Goodfellow - https://amzn.to/3Wnyixv
πŸ“™ Pattern Recognition and Machine Learning by Christopher M. Bishop - https://amzn.to/3ZVnQQA
πŸ“— Machine Learning: A Probabilistic Perspective by Kevin Murphy - https://amzn.to/3kAqThb
πŸ“˜ Multiple View Geometry in Computer Vision by R Hartley and A Zisserman - https://amzn.to/3XKVOWi

WHO AM I?
I am a Machine Learning Researcher/practitioner who has seen the grind of academia and start-ups equally. I started my career as a software engineer 15 years ago. Because of my love for Mathematics (coupled with a glimmer of luck), I graduated with a Master's in Computer Vision and Robotics in 2016 when the now happening AI revolution just started. Life has changed for the better ever since.

#machinelearning #deeplearning #aibites

Tags and Topics

Browse our collection to discover more content in these categories.

Video Information

Views

7.2K

Likes

160

Duration

9:56

Published

Sep 28, 2023

User Reviews

4.6
(1)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.