NExT-GPT: Versatile Multimodal AI Model πŸ€–

NExT-GPT is the first end-to-end MM-LLM capable of processing and generating text, images, videos, and audio in any combination.

NExT-GPT: Versatile Multimodal AI Model πŸ€–
Hao Fei
16.2K views β€’ Sep 11, 2023
NExT-GPT: Versatile Multimodal AI Model πŸ€–

About this video

NExT-GPT, the first end-to-end MM-LLM that perceives input and generates output in arbitrary combinations (any-to-any) of text, image, video and audio, and beyond.
Webpage: https://next-gpt.github.io
Paper: https://arxiv.org/pdf/2309.05519
Code: https://github.com/NExT-GPT/NExT-GPT

Video Information

Views

16.2K

Likes

83

Duration

2:41

Published

Sep 11, 2023

User Reviews

4.1
(3)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.