Mastering Q-Learning: Train AI to Play Video Games Without Pre-Built World Models 🎮
Discover how Q-Learning enables AI agents to learn and achieve objectives in video game environments without prior world modeling. This tutorial breaks down the concepts and practical steps!

Siraj Raval
366.0K views • Dec 2, 2017

About this video
Can we train an AI to complete it's objective in a video game world without needing to build a model of the world before hand? The answer is yes using Q learning! I'll go through several use cases and show some python code of how Q learning works.
Code for this video:
https://github.com/llSourcell/Q_Learning_Explained/
Adnan's Winning code:
https://github.com/AdnanZahid/ReinforcementLearning
Alberto's runner up code:
https://github.com/alberduris
Please Subscribe! And like. And comment. That's what keeps me going.
Want more inspiration & education? Connect with me:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
More learning resources:
http://mnemstudio.org/path-finding-q-learning-tutorial.htm
https://ocw.mit.edu/courses/aeronautics-and-astronautics/16-410-principles-of-autonomy-and-decision-making-fall-2010/lecture-notes/MIT16_410F10_lec23.pdf
http://uhaweb.hartford.edu/compsci/ccli/projects/QLearning.pdf
https://medium.com/@m.alzantot/deep-reinforcement-learning-demysitifed-episode-2-policy-iteration-value-iteration-and-q-978f9e89ddaa
https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node19.html
http://cs.stanford.edu/people/karpathy/reinforcejs/gridworld_dp.html
https://www.quora.com/How-is-policy-iteration-different-from-value-iteration
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching_files/DP.pdf
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: http://chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available): https://www.wagergpt.xyz
Code for this video:
https://github.com/llSourcell/Q_Learning_Explained/
Adnan's Winning code:
https://github.com/AdnanZahid/ReinforcementLearning
Alberto's runner up code:
https://github.com/alberduris
Please Subscribe! And like. And comment. That's what keeps me going.
Want more inspiration & education? Connect with me:
Twitter: https://twitter.com/sirajraval
Facebook: https://www.facebook.com/sirajology
More learning resources:
http://mnemstudio.org/path-finding-q-learning-tutorial.htm
https://ocw.mit.edu/courses/aeronautics-and-astronautics/16-410-principles-of-autonomy-and-decision-making-fall-2010/lecture-notes/MIT16_410F10_lec23.pdf
http://uhaweb.hartford.edu/compsci/ccli/projects/QLearning.pdf
https://medium.com/@m.alzantot/deep-reinforcement-learning-demysitifed-episode-2-policy-iteration-value-iteration-and-q-978f9e89ddaa
https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node19.html
http://cs.stanford.edu/people/karpathy/reinforcejs/gridworld_dp.html
https://www.quora.com/How-is-policy-iteration-different-from-value-iteration
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching_files/DP.pdf
Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/
And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: http://chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available): https://www.wagergpt.xyz
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
366.0K
Likes
7.0K
Duration
9:27
Published
Dec 2, 2017
User Reviews
4.8
(73) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
No specific trending topics match this video yet.
Explore All Trends