CMPT 729: Reinforcement Learning
|
Reinforcement learning is the branch of machine learning that studies learning to act. Agents observe, predict, and act to change their environment. Reinforcement learning has notable success in learning to play games and control robots. In this course, we will cover fundamental concepts and algorithms, and introduce techniques that underlie many of the successes from reinforcement learning.
Instructor: Jason Peng (Office Hour: Wed 4-5pm TASC 9213)
TA: Sha Hu (Office Hour: Fri 3-4pm Zoom)
Lectures:
   
Wed 11:30am-12:20pm (SWH10051)
   
Fri 10:30am-12:20pm (AQ5016)
Grading
3 programming assignments (30%)
- A1 (10%) - Due Feb 4
- A2 (10%) - Due Feb 25
- A3 (10%) - Due Mar 10
Paper presentation (20%)
Course project (50%)
- Info
- Proposal (10%) - Due Feb 18
- Presentation (20%) - Apr 10, 12
- Report (20%) - Due Apr 16
Late days: You have 3 late days that you can use for any assignment. You can distribute the late days however you like, but they can only be applied to programming assignments. Once you run out of late days, any late assignments will no longer be accepted.
Syllabus
Jan 10: Introduction
             
|
Jan 12: MDP
             
|
Jan 17: Snow Day (No Class)
             
|
Jan 19: Policy Evaluation
             
|
Jan 24: Behavioral Cloning
             
|
Jan 26: Behavioral Cloning, Policy Search
             
|
|
Jan 31: Policy Gradient
             
|
Feb 2: Policy Gradient
             
|
Feb 7: Q-Learning
             
|
Feb 9: Q-Learning
             
|
Feb 14: Actor-Critic Algorithms
             
|
Feb 16: Actor-Critic Algorithms, Model-Based RL
             
|
|
Feb 21: Reading Break (No Class)
             
|
Feb 23: Reading Break (No Class)
             
|
Feb 28: Model-Based RL
             
|
Mar 1: Model-Based RL, On-Policy vs Off-Policy Algorithms
             
|
|
Mar 6: Advance Policy Gradient
             
|
Mar 8: Advance Policy Gradient, Paper Presentations
             
|
|
Mar 13: Advance Q-Learning
             
|
Mar 15: Advance Q-Learning, Paper Presentations
             
|
|
Mar 20: Exploration
             
|
Mar 22: Exploration, Paper Presentations
             
|
|
Mar 27: Paper Presentations
             
|
Mar 29: Good Friday (No Class)
             
|
Apr 3: Domain Transfer, Paper Presentations
             
|
|
Apr 5: Domain Transfer, Project Presentations
             
|
|
Apr 10: Project Presentations
             
|
Apr 12: Project Presentations
             
|