Basic knowledge of Probability Theory/statistics (MTH2302 or equivalent), calculus, and linear algebra (MTH1007 or equivalent) is required.

You should be already familiar with the following sections in this book: Mathematics for Machine Learning.

  • Section 2: Subsections 2.1, to 2.6 (inclusive)
  • Section 3: All subsections
  • Section 4: Subsections 4.1 to 4.5.1 (inclusive)
  • Section 5: Subsections 5.1, 5.2, 5.3, 5.4, 5.5, 5.7
  • Section 6: Subsections 6.1 to 6.5 (inclusive)

The course is intended for hard-working, technically skilled, highly motivated students. Participants will be expected to display initiative, creativity, scientific rigour, critical thinking, and good communication skills.

If you do not have the necessary prerequisites, then you have to spend a lot of time in this course (more than what is required for a 4-credit course).

Useful Online Courses Covering the Prerequisites

While I do not expect you to know everything from the following courses, I recommend you to do these video courses at some point in the future if you are serious about doing Reinforcement Learning.

Video Recordings

The lectures and tutorials might be recorded and released to the public. By registering for the course, you agree to record and release videos.

Programming Language

We will use Python 3 in all the assignments.

Evaluation Criteria

The class grade will be based on the following components:

  • 4 Theory/Programming assignments (individual) - 30%
  • Mid-term examination - 15%
  • End-term examination - 30%
  • course project (team) - 25%

We will use Gradescope for all the assignments. More detailed instructions on how to use Gradescope will be released at the beginning of the course.

Late Submissions

If you submit your assignments and project reports after the deadline, we will follow the following penalty scheme:

  • You will be penalized 5% if your submission is within 24 hours (1 day) from the deadline.
  • You will be penalized 10% if your submission is after 24 hours from the deadline and within 48 hours (2 days) from the deadline.
  • You will be penalized 20% if your submission is after 48 hours from the deadline and within 72 hours (3 days) from the deadline.
  • You cannot submit your assignments/reports after 72 hours from the deadline.