Tentative Class Schedule

All the lecture scribbles can be found here.

Lec Date Topic Lecture Notes Reference Readings
1 29 Aug 2022 Introduction to RL, Immediate RL Logistics Slides,
Chapter 01, Chapter 02
SB Chapter 1 and 2 Computing Machinery and Intelligence by Alan Turing,
Probability Review,
Linear Algebra Review
2 12 Sep 2022 Immediate RL, MDP Chapter 02, Chapter-3 SB Chapter 2 and 3 Reward is enough
3 19 Sep 2022 DP - value iteration, policy iteration, Monte-Carlo Methods Chapter-04, Chapter-05 SB Chapter 4 and 5  
4 26 Sep 2022 Off-policy Monte-Carlo, TD Learning, n-step Bootstrapping Chapter-05
Chapter-6
SB Chapter 5, 6, and 7  
5 04 Oct 2022 n-step methods, Function Approximation with on-policy methods Chapter-6
Chapter-7
SB Chapter 7, 9, and 10  
6 17 Oct 2022 Average reward setting, Function approximation with off-policy methods, Gradient Bandits Chapter-7, Chapter-8
Chapter-2
SB Chapter 10, 11, and 2 DQN, Double DQN, Prioritized Experience Replay
7 24 Oct 2022 Policy Gradient Methods, REINFORCE, Actor-Critic, A3C Check scribbles SB Book chapter-13 and A3C Paper Policy Gradient with Function Approximation
8 31 Oct 2022 Deterministic Policy Gradient, DDPG, Model-based RL, Dyna Check scribbles DPG, DDPG, SB Chapter 08  
9 07 Nov 2022 MCTS, Random Shooting, CEM, back to policy gradients: TRPO, PPO Check scribbles SB Chapter 08, TRPO, PPO TRPO, PPO
10 14 Nov 2022 Eligibility Traces, Partially Observable MDPs (POMDPs) Check scribbles SB Chapter 12, POMDP  
11 21 Nov 2022 Hierarchical RL Check scribbles Hierarchical Approaches, HRL Survey Option-Critic
12 28 Nov 2022 Imitation Learning, Offline RL, Multi-agent RL Check scribbles Chapter-18 in Algorithms for Decision Making Offline RL - Tutorial
13 05 Dec 2022 Course Summary, Frontiers in RL, What is Next? Check scribbles   Multi-objective RL - Review

Tutorials

Lec Date Time Topic Lecture Videos Lecture Materials
1 02 Sep 2022 9 am to 10:30 am Probability Video Slides
2 06 Sep 2022 9 am to 10:30 am Python, Numpy, Plotting Video Notebook
3 09 Sep 2022 9 am to 10:30 am Linear Algebra Video Notebook
4 19 Oct 2022 9 am to 10:30 am PyTorch Video Notebook
5 09 Nov 2022 10 am to 11:30 am RLHive Video