Master Reinforcement Learning With These 3 Projects
Too locked in to realize my hair was sticking up most the time Resources: https://github.com/ALucek/three-RL-projects https://gymnasium.farama.org/ https://...

Adam Lucek
13.7K views โข Oct 17, 2024

About this video
Too locked in to realize my hair was sticking up most the time
Resources:
https://github.com/ALucek/three-RL-projects
https://gymnasium.farama.org/
https://huggingface.co/learn/deep-rl-course/
Chapters:
00:00 - Intro
00:52 - What Is Reinforcement Learning?
03:48 - Q-Learning: Introduction
05:33 - Q-Learning: Environment Setup
07:46 - Q-Learning: Hyperparameters Explained
11:09 - Q-Learning: Defining Rewards
13:18 - Q-Learning: e-greedy strategy
14:04 - Q-Learning: The Bellman Equation
17:18 - Q-Learning: Training Script
18:07 - Q-Learning: Visualization & Evaluation
20:33 - Deep Q Networks: Introduction
21:28 - Deep Q Networks: Environment Setup
24:22 - Deep Q Networks: Neural Network Setup
25:38 - Deep Q Networks: Agent Setup
26:01 - Deep Q Networks: Experience Replay
28:15 - Deep Q Networks: Q-Target Stabilization
31:15 - Deep Q Networks: Double DQN
32:58 - Deep Q Networks: Hyperparameters & Training
34:03 - Deep Q Networks: Visualization & Evaluation
36:41 - Value-Based vs Policy Based Reinforcement Learning
38:14 - Proximal Policy Optimization: Introduction
41:45 - Proximal Policy Optimization: Environment Setup
43:39 - Proximal Policy Optimization: Image Preprocessing & Stacking
46:55 - Proximal Policy Optimization: Neural Network Architecture
37:53 - Proximal Policy Optimization: Surrogate Objective Function
50:47 - Proximal Policy Optimization: Value Function Loss & Entropy Bonus
51:46 - Proximal Policy Optimization: Creating the Model
53:39 - Proximal Policy Optimization: Training
54:46 - Proximal Policy Optimization: Evaluation & Visualization
57:27 - Tie Back to LLM RLHF
#ai #machinelearning #programming
Resources:
https://github.com/ALucek/three-RL-projects
https://gymnasium.farama.org/
https://huggingface.co/learn/deep-rl-course/
Chapters:
00:00 - Intro
00:52 - What Is Reinforcement Learning?
03:48 - Q-Learning: Introduction
05:33 - Q-Learning: Environment Setup
07:46 - Q-Learning: Hyperparameters Explained
11:09 - Q-Learning: Defining Rewards
13:18 - Q-Learning: e-greedy strategy
14:04 - Q-Learning: The Bellman Equation
17:18 - Q-Learning: Training Script
18:07 - Q-Learning: Visualization & Evaluation
20:33 - Deep Q Networks: Introduction
21:28 - Deep Q Networks: Environment Setup
24:22 - Deep Q Networks: Neural Network Setup
25:38 - Deep Q Networks: Agent Setup
26:01 - Deep Q Networks: Experience Replay
28:15 - Deep Q Networks: Q-Target Stabilization
31:15 - Deep Q Networks: Double DQN
32:58 - Deep Q Networks: Hyperparameters & Training
34:03 - Deep Q Networks: Visualization & Evaluation
36:41 - Value-Based vs Policy Based Reinforcement Learning
38:14 - Proximal Policy Optimization: Introduction
41:45 - Proximal Policy Optimization: Environment Setup
43:39 - Proximal Policy Optimization: Image Preprocessing & Stacking
46:55 - Proximal Policy Optimization: Neural Network Architecture
37:53 - Proximal Policy Optimization: Surrogate Objective Function
50:47 - Proximal Policy Optimization: Value Function Loss & Entropy Bonus
51:46 - Proximal Policy Optimization: Creating the Model
53:39 - Proximal Policy Optimization: Training
54:46 - Proximal Policy Optimization: Evaluation & Visualization
57:27 - Tie Back to LLM RLHF
#ai #machinelearning #programming
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
13.7K
Likes
450
Duration
01:00:16
Published
Oct 17, 2024
User Reviews
4.6
(2) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.