Schedule

Tentative Class Schedule

All the lecture scribbles can be found here.

Lec	Date	Instructor	Topic	Mandatory Readings	Optional Readings
1	26 Aug 2024	Sarath Chandar	Introduction to Reinforcement Learning, Sequential Decision Problems	Logistics and Intro Slides, Sutton and Barto Chapter 01, Reward is Enough, Scalar Reward is Not Enough, Faulty Reward Functions	Computing Machinery and Intelligence by Alan Turing, Probability Review, Linear Algebra Review
2	09 Sep 2024	Sarath Chandar	Immediate Reinforcement Learning and Multi-armed Bandits	Sutton and Barto Chapter 02, Proof of UCB1’s regret bound by Ann He and Jeremy Kun	3 lectures on UCB1 by Prof. Balaraman Ravindran: UCB1, Concentration Bounds, UCB Regret Bound
3	16 Sep 2024	Nishanth Anand	Markov Decision Process	Sutton and Barto Chapter 03
4	23 Sep 2024	Nishanth Anand	Dynamic Programming, Monte-Carlo Methods	Sutton and Barto Chapter 04 and Chapter 05
5	01 Oct 2024	Nishanth Anand	Monte-Carlo Methods, Temporal Difference (TD) Learning - I	Sutton and Barto Chapter 05 and Chapter 06
6	07 Oct 2024	Nishanth Anand	Temporal Difference (TD) Learning - II	Sutton and Barto Chapter 06 and Chapter 07
7	21 Oct 2024	Sarath Chandar	Function Approximation - I	Sutton and Barto Chapter 09
	25 Oct 2024		Mid-Term Exam [12:45 pm to 2:15 pm]	Last year Mid-Term Questions
8	28 Oct 2024	Sarath Chandar	Function Approximation - II	Sutton and Barto Chapter 10, and Chapter 11, DQN, DQN Nature, Double DQN, Rainbow, Prioritized Experience Replay	Dueling DQN, Distributional RL
9	04 Nov 2024	Sarath Chandar	Policy Gradients	Sutton and Barto Chapter 13, Off-Policy Actor-Critic, A3C paper	REINFORCE paper, Soft Q-Learning
10	11 Nov 2024	Sarath Chandar	Determistic Policy Gradients, Natural Policy Gradient	DPG, DDPG, TD3, NPG Explained	cleanrl, Deep RL that Matters
11	18 Nov 2024	Sarath Chandar	Natural Policy Gradients Continued, Eligibility Traces, Multi-Agent RL	NPG Explained, TRPO, PPO, Sutton and Barto Chapter 12, Multi-Agent RL
12	25 Nov 2024	Sarath Chandar	Model-based RL	SB Chapter 8, AlphaGo, Dreamer	Dreamer v2, Dreamer v3, MuZero
13	02 Dec 2024	Sarath Chandar	Offline RL, Hierarchical RL, Frontiers in RL	CQL, HRL	Continual RL
	05 Dec 2024		Final Exam [1:30 pm to 4 pm]	Last year final exam questions	The exam will be based on the first 12 weeks of lectures.

Tutorials

Lec	Date	Time	Topic	Lecture Videos	Lecture Materials
1	30 Aug 2024		No lab. Use the lab time for reading.
2	06 Sep 2024		No lab. Use the lab time for reading.
3	13 Sep 2024	11:30 pm - 02:45 pm	Online Office Hours by Ali Rahimi-Kalahroudi (link in Piazza)
4	20 Sep 2024	11:30 pm - 02:45 pm	Online Office Hours by Ali Rahimi-Kalahroudi (link in Piazza)
5	27 Sep 2024	11:30 pm - 02:45 pm	Online Office Hours by Esther Derman (link in Piazza)
6	04 Oct 2024		No lab. Use the lab time for doing the assignment.
	07 Oct 2024	4:00 pm - 7:15 pm	Online Office Hours by Esther Derman (link in Piazza)
7	11 Oct 2024	1 pm to 2:30 pm	Jax Tutorial by Artem Zholus (in person)	Recording	Colab Notebook
	11 Oct 2024	2:30 pm to 4:30 pm	Online Office Hours by Esther Derman (link in Piazza)
	16 Oct 2024	10:00 am to 12:00 pm	Online Office Hours by Esther Derman (link in Piazza)
	16 Oct 2024	10:00 am to 11:00 am	Online Office Hours by Sarath Chandar and Nishanth Anand (link in Piazza)
	21 Oct 2024	04:00 pm to 06:00 pm	Online Office Hours by Esther Derman (link in Piazza)
8	25 Oct 2024	12:45 pm to 2:15 pm	Mid-term Exam
9	01 Nov 2024		No lab. Use the lab time for doing the assignment.
	04 Nov 2024	9 am to 11 am	Online Office Hours by Artem Zholus (link in Piazza)
10	08 Nov 2024	11:30 pm to 2:30 pm	Online Office Hours by Artem Zholus (link in Piazza)
11	15 Nov 2024	11:30 pm - 02:45 pm	Online Office Hours by Antoine Clavaud (link in Piazza)
12	22 Nov 2024	11:30 pm - 02:45 pm	Online Office Hours by Antoine Clavaud (link in Piazza)
13	29 Nov 2024	11:30 pm - 02:45 pm	Online Office Hours by Antoine Clavaud (link in Piazza)