NExT-GPT: Versatile Multimodal AI Model π€
NExT-GPT is the first end-to-end MM-LLM capable of processing and generating text, images, videos, and audio in any combination.

Hao Fei
16.2K views β’ Sep 11, 2023

About this video
NExT-GPT, the first end-to-end MM-LLM that perceives input and generates output in arbitrary combinations (any-to-any) of text, image, video and audio, and beyond.
Webpage: https://next-gpt.github.io
Paper: https://arxiv.org/pdf/2309.05519
Code: https://github.com/NExT-GPT/NExT-GPT
Webpage: https://next-gpt.github.io
Paper: https://arxiv.org/pdf/2309.05519
Code: https://github.com/NExT-GPT/NExT-GPT
Video Information
Views
16.2K
Likes
83
Duration
2:41
Published
Sep 11, 2023
User Reviews
4.1
(3) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
Trending Now