AI Learns to Sumo Wrestle (deep reinforcement learning)
AI vs AI Sumo! https://brilliant.org/AIWarehouse/ If you want to learn more about AI and deep reinforcement learning (how Albert is trained), there are amazi...

AI Warehouse
728.0K views β’ Oct 1, 2025

About this video
AI vs AI Sumo!
https://brilliant.org/AIWarehouse/
If you want to learn more about AI and deep reinforcement learning (how Albert is trained), there are amazing courses teaching those exact concepts on Brilliant! You can use my link to get 20% off! I've personally gone through the course "Introduction to Neural Networks", and it's one of the best courses on Neural Networks I've ever seen. They're paying us to promote them, but they're genuinely a great service, I've had a Brilliant account for over 5 years and can't recommend it enough :)
In this video Albert and Kai, two AI Warehouse agents fight. The AI were trained using Deep Reinforcement Learning, a method of Machine Learning which involves rewarding the agent for doing something correctly, and punishing it for doing anything incorrectly. The agents actions are controlled by Neural Networks that are updated after each attempt in order to try to give the AI more rewards and less punishments over time
Thank you for watching :D
Current Subscribers: 741,510
https://brilliant.org/AIWarehouse/
If you want to learn more about AI and deep reinforcement learning (how Albert is trained), there are amazing courses teaching those exact concepts on Brilliant! You can use my link to get 20% off! I've personally gone through the course "Introduction to Neural Networks", and it's one of the best courses on Neural Networks I've ever seen. They're paying us to promote them, but they're genuinely a great service, I've had a Brilliant account for over 5 years and can't recommend it enough :)
In this video Albert and Kai, two AI Warehouse agents fight. The AI were trained using Deep Reinforcement Learning, a method of Machine Learning which involves rewarding the agent for doing something correctly, and punishing it for doing anything incorrectly. The agents actions are controlled by Neural Networks that are updated after each attempt in order to try to give the AI more rewards and less punishments over time
Thank you for watching :D
Current Subscribers: 741,510
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
728.0K
Likes
20.1K
Duration
10:16
Published
Oct 1, 2025
User Reviews
4.8
(145) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
Trending Now