Master Reinforcement Learning With These 3 Projects
Too locked in to realize my hair was sticking up most the time Resources: https://github.com/ALucek/three-RL-projects https://gymnasium.farama.org/ https://...

Adam Lucek
13.7K views โข Oct 17, 2024

About this video
Too locked in to realize my hair was sticking up most the time
Resources:
https://github.com/ALucek/three-RL-projects
https://gymnasium.farama.org/
https://huggingface.co/learn/deep-rl-course/
Chapters:
00:00 - Intro
00:52 - What Is Reinforcement Learning?
03:48 - Q-Learning: Introduction
05:33 - Q-Learning: Environment Setup
07:46 - Q-Learning: Hyperparameters Explained
11:09 - Q-Learning: Defining Rewards
13:18 - Q-Learning: e-greedy strategy
14:04 - Q-Learning: The Bellman Equation
17:18 - Q-Learning: Training Script
18:07 - Q-Learning: Visualization & Evaluation
20:33 - Deep Q Networks: Introduction
21:28 - Deep Q Networks: Environment Setup
24:22 - Deep Q Networks: Neural Network Setup
25:38 - Deep Q Networks: Agent Setup
26:01 - Deep Q Networks: Experience Replay
28:15 - Deep Q Networks: Q-Target Stabilization
31:15 - Deep Q Networks: Double DQN
32:58 - Deep Q Networks: Hyperparameters & Training
34:03 - Deep Q Networks: Visualization & Evaluation
36:41 - Value-Based vs Policy Based Reinforcement Learning
38:14 - Proximal Policy Optimization: Introduction
41:45 - Proximal Policy Optimization: Environment Setup
43:39 - Proximal Policy Optimization: Image Preprocessing & Stacking
46:55 - Proximal Policy Optimization: Neural Network Architecture
37:53 - Proximal Policy Optimization: Surrogate Objective Function
50:47 - Proximal Policy Optimization: Value Function Loss & Entropy Bonus
51:46 - Proximal Policy Optimization: Creating the Model
53:39 - Proximal Policy Optimization: Training
54:46 - Proximal Policy Optimization: Evaluation & Visualization
57:27 - Tie Back to LLM RLHF
#ai #machinelearning #programming
Resources:
https://github.com/ALucek/three-RL-projects
https://gymnasium.farama.org/
https://huggingface.co/learn/deep-rl-course/
Chapters:
00:00 - Intro
00:52 - What Is Reinforcement Learning?
03:48 - Q-Learning: Introduction
05:33 - Q-Learning: Environment Setup
07:46 - Q-Learning: Hyperparameters Explained
11:09 - Q-Learning: Defining Rewards
13:18 - Q-Learning: e-greedy strategy
14:04 - Q-Learning: The Bellman Equation
17:18 - Q-Learning: Training Script
18:07 - Q-Learning: Visualization & Evaluation
20:33 - Deep Q Networks: Introduction
21:28 - Deep Q Networks: Environment Setup
24:22 - Deep Q Networks: Neural Network Setup
25:38 - Deep Q Networks: Agent Setup
26:01 - Deep Q Networks: Experience Replay
28:15 - Deep Q Networks: Q-Target Stabilization
31:15 - Deep Q Networks: Double DQN
32:58 - Deep Q Networks: Hyperparameters & Training
34:03 - Deep Q Networks: Visualization & Evaluation
36:41 - Value-Based vs Policy Based Reinforcement Learning
38:14 - Proximal Policy Optimization: Introduction
41:45 - Proximal Policy Optimization: Environment Setup
43:39 - Proximal Policy Optimization: Image Preprocessing & Stacking
46:55 - Proximal Policy Optimization: Neural Network Architecture
37:53 - Proximal Policy Optimization: Surrogate Objective Function
50:47 - Proximal Policy Optimization: Value Function Loss & Entropy Bonus
51:46 - Proximal Policy Optimization: Creating the Model
53:39 - Proximal Policy Optimization: Training
54:46 - Proximal Policy Optimization: Evaluation & Visualization
57:27 - Tie Back to LLM RLHF
#ai #machinelearning #programming
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
13.7K
Likes
450
Duration
01:00:16
Published
Oct 17, 2024
User Reviews
4.6
(2) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
No specific trending topics match this video yet.
Explore All Trends