Stanford CS224R: RL Techniques for LLMs (Spring 2025) πŸ“š

Lecture 9 explores reinforcement learning methods for large language models, focusing on preferences and applications. April 30, 2025.

Stanford CS224R: RL Techniques for LLMs (Spring 2025) πŸ“š
Stanford Online
3.5K views β€’ Dec 8, 2025
Stanford CS224R: RL Techniques for LLMs (Spring 2025) πŸ“š

About this video

View course details: https://online.stanford.edu/courses/xcs224r-deep-reinforcement-learning

April 30, 2025
This guest lecture covers RL for LLMs: preference optimization.

To learn more about enrolling in the graduate course, visit: https://online.stanford.edu/courses/cs224r-deep-reinforcement-learning

To follow along with the course schedule and syllabus, visit:
https://cs224r.stanford.edu/

Archit Sharma
Researcher on Gemini team, Lead author of DPO

Tags and Topics

Browse our collection to discover more content in these categories.

Video Information

Views

3.5K

Likes

62

Duration

01:02:51

Published

Dec 8, 2025

User Reviews

4.6
(3)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.