Stanford CS224R: RL Techniques for LLMs (Spring 2025) π
Lecture 9 explores reinforcement learning methods for large language models, focusing on preferences and applications. April 30, 2025.

Stanford Online
3.5K views β’ Dec 8, 2025

About this video
View course details: https://online.stanford.edu/courses/xcs224r-deep-reinforcement-learning
April 30, 2025
This guest lecture covers RL for LLMs: preference optimization.
To learn more about enrolling in the graduate course, visit: https://online.stanford.edu/courses/cs224r-deep-reinforcement-learning
To follow along with the course schedule and syllabus, visit:
https://cs224r.stanford.edu/
Archit Sharma
Researcher on Gemini team, Lead author of DPO
April 30, 2025
This guest lecture covers RL for LLMs: preference optimization.
To learn more about enrolling in the graduate course, visit: https://online.stanford.edu/courses/cs224r-deep-reinforcement-learning
To follow along with the course schedule and syllabus, visit:
https://cs224r.stanford.edu/
Archit Sharma
Researcher on Gemini team, Lead author of DPO
Tags and Topics
Browse our collection to discover more content in these categories.
Video Information
Views
3.5K
Likes
62
Duration
01:02:51
Published
Dec 8, 2025
User Reviews
4.6
(3) Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
Trending Now