Lex Fridman's MIT lecture introducing deep reinforcement learning: how agents learn to act through trial, error, and reward.

Lex Fridman — AI researcher and MIT lecturer teaching the 6.S091 deep learning course; the lecture is solo, so the host is also the speaker.
This is a solo MIT 6.S091 lecture in which Lex Fridman overviews the field of deep reinforcement learning, marrying neural network representations with the ability to act on them. He frames RL as learning by experience through trial and error, contrasts it with supervised learning, and emphasizes that the hardest, most consequential design choices are the environment and the reward structure. He walks through core algorithm families (model-based, value-based, policy-based) and landmark systems like DQN, A2C/A3C, DDPG, PPO/TRPO, and AlphaGo/AlphaZero. He closes with the sim-to-real gap as the field's central unsolved challenge and practical advice for getting into RL research.
Books, products and media the guest or host genuinely endorsed here — with the buy link.
Affiliate link — we may earn a commission at no extra cost to you.
Friedrich Nietzsche
“there's been a few books a couple written throughout the last few centuries from Socrates to Nietzsche I recommend the latter especially” — Lex Fridman 00:04:11Find it on Amazon