FILENAME | SIZE |  | 1. Introduction and Outline/1. Introduction and outline.mp4 | 10.1 MB |
 | 1. Introduction and Outline/1. Introduction and outline.vtt | 12 KB |
 | 1. Introduction and Outline/2. What is Reinforcement Learning.mp4 | 22 MB |
 | 1. Introduction and Outline/2. What is Reinforcement Learning.vtt | 24 KB |
 | 1. Introduction and Outline/3. Where to get the Code.mp4 | 4.5 MB |
 | 1. Introduction and Outline/3. Where to get the Code.vtt | 4.9 KB |
 | 1. Introduction and Outline/4. Strategy for Passing the Course.mp4 | 9.5 MB |
 | 1. Introduction and Outline/4. Strategy for Passing the Course.vtt | 10.7 KB |
 | 2. Return of the Multi-Armed Bandit/1. Problem Setup and The Explore-Exploit Dilemma.mp4 | 6.5 MB |
 | 2. Return of the Multi-Armed Bandit/1. Problem Setup and The Explore-Exploit Dilemma.vtt | 7.1 KB |
 | 2. Return of the Multi-Armed Bandit/2. Epsilon-Greedy.mp4 | 2.8 MB |
 | 2. Return of the Multi-Armed Bandit/2. Epsilon-Greedy.vtt | 2.9 KB |
 | 2. Return of the Multi-Armed Bandit/3. Updating a Sample Mean.mp4 | 2.2 MB |
 | 2. Return of the Multi-Armed Bandit/3. Updating a Sample Mean.vtt | 2 KB |
 | 2. Return of the Multi-Armed Bandit/4. Comparing Different Epsilons.mp4 | 8 MB |
 | 2. Return of the Multi-Armed Bandit/4. Comparing Different Epsilons.vtt | 4.9 KB |
 | 2. Return of the Multi-Armed Bandit/5. Optimistic Initial Values.mp4 | 5.1 MB |
 | 2. Return of the Multi-Armed Bandit/5. Optimistic Initial Values.vtt | 3 KB |
 | 2. Return of the Multi-Armed Bandit/6. UCB1.mp4 | 8.2 MB |
 | 2. Return of the Multi-Armed Bandit/6. UCB1.vtt | 7.4 KB |
 | 2. Return of the Multi-Armed Bandit/7. Bayesian Thompson Sampling.mp4 | 51.8 MB |
 | 2. Return of the Multi-Armed Bandit/7. Bayesian Thompson Sampling.vtt | 11 KB |
 | 2. Return of the Multi-Armed Bandit/8. Thompson Sampling vs. Epsilon-Greedy vs. Optimistic Initial Values vs. UCB1.mp4 | 10.6 MB |
 | 2. Return of the Multi-Armed Bandit/8. Thompson Sampling vs. Epsilon-Greedy vs. Optimistic Initial Values vs. UCB1.vtt | 5.5 KB |
 | 2. Return of the Multi-Armed Bandit/9. Nonstationary Bandits.mp4 | 7.5 MB |
 | 2. Return of the Multi-Armed Bandit/9. Nonstationary Bandits.vtt | 7.1 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/1. Naive Solution to Tic-Tac-Toe.mp4 | 6.1 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/1. Naive Solution to Tic-Tac-Toe.vtt | 6.6 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/10. Tic Tac Toe Code Main Loop and Demo.mp4 | 9.4 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/10. Tic Tac Toe Code Main Loop and Demo.vtt | 8.4 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/11. Tic Tac Toe Summary.mp4 | 8.3 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/11. Tic Tac Toe Summary.vtt | 9.3 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/2. Components of a Reinforcement Learning System.mp4 | 12.7 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/2. Components of a Reinforcement Learning System.vtt | 13.4 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/3. Notes on Assigning Rewards.mp4 | 4.2 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/3. Notes on Assigning Rewards.vtt | 4.5 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/4. The Value Function and Your First Reinforcement Learning Algorithm.mp4 | 103.7 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/4. The Value Function and Your First Reinforcement Learning Algorithm.vtt | 21.7 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/5. Tic Tac Toe Code Outline.mp4 | 5 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/5. Tic Tac Toe Code Outline.vtt | 5.9 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/6. Tic Tac Toe Code Representing States.mp4 | 4.4 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/6. Tic Tac Toe Code Representing States.vtt | 4.5 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/7. Tic Tac Toe Code Enumerating States Recursively.mp4 | 9.8 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/7. Tic Tac Toe Code Enumerating States Recursively.vtt | 10.3 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/8. Tic Tac Toe Code The Environment.mp4 | 10 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/8. Tic Tac Toe Code The Environment.vtt | 10.9 KB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/9. Tic Tac Toe Code The Agent.mp4 | 9 MB |
 | 3. Build an Intelligent Tic-Tac-Toe Agent/9. Tic Tac Toe Code The Agent.vtt | 10 KB |
 | 4. Markov Decision Proccesses/1. Gridworld.mp4 | 3.4 MB |
 | 4. Markov Decision Proccesses/1. Gridworld.vtt | 3.7 KB |
 | 4. Markov Decision Proccesses/2. The Markov Property.mp4 | 7.2 MB |
 | 4. Markov Decision Proccesses/2. The Markov Property.vtt | 7.7 KB |
 | 4. Markov Decision Proccesses/3. Defining and Formalizing the MDP.mp4 | 6.6 MB |
 | 4. Markov Decision Proccesses/3. Defining and Formalizing the MDP.vtt | 7.2 KB |
 | 4. Markov Decision Proccesses/4. Future Rewards.mp4 | 5.2 MB |
 | 4. Markov Decision Proccesses/4. Future Rewards.vtt | 5.5 KB |
 | 4. Markov Decision Proccesses/5. Value Function Introduction.mp4 | 19.7 MB |
 | 4. Markov Decision Proccesses/5. Value Function Introduction.vtt | 14.5 KB |
 | 4. Markov Decision Proccesses/6. Value Functions.mp4 | 8.3 MB |
 | 4. Markov Decision Proccesses/6. Value Functions.vtt | 11 KB |
 | 4. Markov Decision Proccesses/7. Bellman Examples.mp4 | 87.1 MB |
 | 4. Markov Decision Proccesses/7. Bellman Examples.vtt | 25.8 KB |
 | 4. Markov Decision Proccesses/8. Optimal Policy and Optimal Value Function.mp4 | 3.2 MB |
 | 4. Markov Decision Proccesses/8. Optimal Policy and Optimal Value Function.vtt | 4.7 KB |
 | 4. Markov Decision Proccesses/9. MDP Summary.mp4 | 2.4 MB |
 | 4. Markov Decision Proccesses/9. MDP Summary.vtt | 2.4 KB |
 | 5. Dynamic Programming/1. Intro to Dynamic Programming and Iterative Policy Evaluation.mp4 | 4.8 MB |
 | 5. Dynamic Programming/1. Intro to Dynamic Programming and Iterative Policy Evaluation.vtt | 4.9 KB |
 | 5. Dynamic Programming/10. Dynamic Programming Summary.mp4 | 8.3 MB |
 | 5. Dynamic Programming/10. Dynamic Programming Summary.vtt | 8.6 KB |
 | 5. Dynamic Programming/2. Gridworld in Code.mp4 | 11.5 MB |
 | 5. Dynamic Programming/2. Gridworld in Code.vtt | 10 KB |
 | 5. Dynamic Programming/3. Iterative Policy Evaluation in Code.mp4 | 12.1 MB |
 | 5. Dynamic Programming/3. Iterative Policy Evaluation in Code.vtt | 9.3 KB |
 | 5. Dynamic Programming/4. Policy Improvement.mp4 | 4.5 MB |
 | 5. Dynamic Programming/4. Policy Improvement.vtt | 4.7 KB |
 | 5. Dynamic Programming/5. Policy Iteration.mp4 | 3.1 MB |
 | 5. Dynamic Programming/5. Policy Iteration.vtt | 3.2 KB |
 | 5. Dynamic Programming/6. Policy Iteration in Code.mp4 | 7.6 MB |
 | 5. Dynamic Programming/6. Policy Iteration in Code.vtt | 5.6 KB |
 | 5. Dynamic Programming/7. Policy Iteration in Windy Gridworld.mp4 | 9.1 MB |
 | 5. Dynamic Programming/7. Policy Iteration in Windy Gridworld.vtt | 7.5 KB |
 | 5. Dynamic Programming/8. Value Iteration.mp4 | 6.2 MB |
 | 5. Dynamic Programming/8. Value Iteration.vtt | 6.4 KB |
 | 5. Dynamic Programming/9. Value Iteration in Code.mp4 | 4.9 MB |
 | 5. Dynamic Programming/9. Value Iteration in Code.vtt | 3 KB |
 | 6. Monte Carlo/1. Monte Carlo Intro.mp4 | 5 MB |
 | 6. Monte Carlo/1. Monte Carlo Intro.vtt | 5.4 KB |
 | 6. Monte Carlo/2. Monte Carlo Policy Evaluation.mp4 | 8.8 MB |
 | 6. Monte Carlo/2. Monte Carlo Policy Evaluation.vtt | 9.8 KB |
 | 6. Monte Carlo/3. Monte Carlo Policy Evaluation in Code.mp4 | 7.9 MB |
 | 6. Monte Carlo/3. Monte Carlo Policy Evaluation in Code.vtt | 5.6 KB |
 | 6. Monte Carlo/4. Policy Evaluation in Windy Gridworld.mp4 | 7.8 MB |
 | 6. Monte Carlo/4. Policy Evaluation in Windy Gridworld.vtt | 4.9 KB |
 | 6. Monte Carlo/5. Monte Carlo Control.mp4 | 9.3 MB |
 | 6. Monte Carlo/5. Monte Carlo Control.vtt | 9.3 KB |
 | 6. Monte Carlo/6. Monte Carlo Control in Code.mp4 | 10.2 MB |
 | 6. Monte Carlo/6. Monte Carlo Control in Code.vtt | 5.3 KB |
 | 6. Monte Carlo/7. Monte Carlo Control without Exploring Starts.mp4 | 4.6 MB |