ECE 586: Markov Decision Processes and Reinforcement Learning (Spring 2019)

Scribing Template

Use this .tex file for scribing. Output should look like this.

Additional Reading

Title Author File Final Exam Inclusion
Overview Dimitrios Katselis .pdf included
Stochastic Approximation Dimitrios Katselis .pdf not included
ODE Appendix Dimitrios Katselis .pdf not included
HMMs, POMDPs and Linear Quadratic Regulation Dimitrios Katselis .pdf not included

Lecture Notes

Note: Files with status "not checked" correspond to course notes documents, exactly as they were submitted by the scribing team. These files will be gradually corrected (if necessary) by me and Joseph. The status will then change to "checked". Any posterior corrections on your scribed files will not affect the grade that you received for scribing.

Final Exam Material: The material for the final exam is up to and including the Law of Large Numbers. Some of the remaining files may be "checked" after the end of the semester.

Title File Scribing Status
Markov Chains 1 lec1 Zeyu Zhou, Lucas Buccafusca checked
Markov Chains 2 lec2 Andrew Chen, Zih-Siou Hung checked
Steepest and Gradient Descent 1 lec3 Duc Phan, Alireza Moradzadeh checked
Steepest and Gradient Descent 2 lec4 William Wei, Aditya Deshmukh checked
Gradient Projection and Stochastic Gradient Descent 1 lec5 Xingyu Bai, Tiancheng Zhao checked
Stochastic Gradient Descent 2 lec6 Kunhao Li, Zhikai Guo checked
Neural Networks lec7 Alireza Moradzadeh, Cathy Shih not checked
Multi-armed bandits lec8 Brando Miranda, Hanwen Hu checked
Discounted Cost MDPs, Value and Policy Iteration, Monotone Policies lec9 Tianhao Wu, Xiaoyang Bai, Junchi Yang, Shen Yan checked
Q-Learning, Function Approximation, Temporal Difference Learning lec10 Zeyu Zhou, Lucas Buccafusca, William Wei, Cathy Shih, Kunhao Li, Zhikai Guo, Duc Phan checked
Law of Large Numbers lec11 Andrew Chen, Zih-Siou Hung not checked
Average Cost MDPs lec12 Xingyu Bai, Tiancheng Zhao not checked
Average Cost Q-Learning, Linear Programming and Constrained MDPs -- not scribed
Policy Gradient Part I lec14 Xiaoyang Bai, Junchi Yang not checked

Useful Resources

The following links provide useful material (papers, books, implementations of algorithms) for Reinforcement Learning.

Awesome Reinforcement Learning

Spinning Up in Deep RL