Schedule
| Date | Lecture | Readings | ||
|---|---|---|---|---|
| 08/21 |
Lecture #1
(Bo Dai):
Course Overview [ slides ] |
|||
| Module I: Background Knowledge | ||||
| 08/23 |
Lecture #2
(Bo Dai):
Optimization: convex preliminary [ notes ] |
|
||
| 08/28 |
Lecture #3
(Bo Dai):
Optimization: convex set and function [ slides | notes ] |
|
||
| 08/30 |
Lecture #4
(Bo Dai):
Optimization: gradient descent [ notes ] |
|
||
| 09/04 | No Class (Labor Day) | |||
| 09/06 |
Lecture #5
(Bo Dai):
Sampling: basic sampling method [ notes ] |
|
||
| 09/11 |
Lecture #6
(Bo Dai):
Sampling: Acceptance-Rejection & Importance Sampling [ notes ] |
|
||
| 09/13 |
Lecture #7
(Bo Dai):
Sampling: MCMC (MH, Gibbs & Hamiltonian) [ notes ] |
|
||
| 09/18 |
Lecture #8
(Bo Dai):
Density Parametrization [ notes ] |
|
||
| 09/20 | No Class | |||
| 09/25 |
Lecture #9
(Bo Dai):
Neural Network Revisit [ notes ] |
|||
| Module II: Deep Generative Models | ||||
| 09/27 |
Lecture #10
(Bo Dai):
EBM (CD, Score Matching) [ notes ] |
|
||
| 10/02 |
Lecture #11
(Bo Dai):
Autoregressive Model [ notes ] |
|
||
| 10/04 |
Lecture #12
(Bo Dai):
VAE and Diffusion [ notes ] |
|
||
| 10/09 | No Class (Fall Break) | |||
| 10/11 |
Lecture #13
(Ruiqi Gao (Virtual) - Google DeepMind):
Diffusion Process [ slides | notes | recording (Canvas) ] |
|
||
| 10/16 |
Lecture #14
(Lingkai Kong (in-person)):
Decision-Focused Learning [ slides | notes ] |
|||
| 10/18 |
Lecture #15
(Sherry Yang (Virtual) - Google DeepMind):
Foundation Models for Decision Making: Problems, Methods, and Applications [ slides | notes | recording (Canvas) ] |
|||
| 10/23 |
Lecture #16
(Bo Dai):
Generative Adversarial Nets (GAN) [ notes ] |
|
||
| 10/25 |
Lecture #17
(Bo Dai):
Normalizing Flow Models [ notes ] |
|
||
| Module III: Differentiable Programming | ||||
| 10/30 |
Lecture #18
(Bo Dai):
Differentiable Algorithm I: differentiable optimizer/dynamic programming [ notes ] |
|
||
| 11/01 |
Lecture #19
(Bo Dai):
Differentiable Algorithm II: top-K/sorting layer [ notes ] |
|
||
| Module IV: Reinforcement Learning | ||||
| 11/06 |
Lecture #20
(Bo Dai):
MDP: Bellman Recursion [ notes ] |
|
||
| 11/08 |
Lecture #21
(Bo Dai):
DP: Value and Policy Iteration [ notes ] |
|
||
| 11/13 |
Lecture #22
(Bo Dai):
Learning with MDPs [ notes ] |
|
||
| 11/15 |
Lecture #23
(Bo Dai):
Policy Gradient and Actor-Critic [ notes ] |
|||
| 11/20 |
Lecture #24
(Bo Dai):
Imitation Learning and RLHF [ slides | notes ] |
|
||
| 11/22 | No Class (Student Recess) | |||
| 11/27 |
Lecture #25
(Bo Dai):
Review [ notes ] |
|||
| 11/29 |
Lecture #26
:
Project Presentations (online session: 4:00 - 6:15 PM) |
|||
| 12/11 | Project Report Due | |||