Schedule
Date | Lecture | Readings | ||
---|---|---|---|---|
08/21 |
Lecture #1
(Bo Dai):
Course Overview [ slides ] |
|||
Module I: Background Knowledge | ||||
08/23 |
Lecture #2
(Bo Dai):
Optimization: convex preliminary [ notes ] |
|
||
08/28 |
Lecture #3
(Bo Dai):
Optimization: convex set and function [ slides | notes ] |
|
||
08/30 |
Lecture #4
(Bo Dai):
Optimization: gradient descent [ notes ] |
|
||
09/04 | No Class (Labor Day) | |||
09/06 |
Lecture #5
(Bo Dai):
Sampling: basic sampling method [ notes ] |
|
||
09/11 |
Lecture #6
(Bo Dai):
Sampling: Acceptance-Rejection & Importance Sampling [ notes ] |
|
||
09/13 |
Lecture #7
(Bo Dai):
Sampling: MCMC (MH, Gibbs & Hamiltonian) [ notes ] |
|
||
09/18 |
Lecture #8
(Bo Dai):
Density Parametrization [ notes ] |
|
||
09/20 | No Class | |||
09/25 |
Lecture #9
(Bo Dai):
Neural Network Revisit [ notes ] |
|||
Module II: Deep Generative Models | ||||
09/27 |
Lecture #10
(Bo Dai):
EBM (CD, Score Matching) [ notes ] |
|
||
10/02 |
Lecture #11
(Bo Dai):
Autoregressive Model [ notes ] |
|
||
10/04 |
Lecture #12
(Bo Dai):
VAE and Diffusion [ notes ] |
|
||
10/09 | No Class (Fall Break) | |||
10/11 |
Lecture #13
(Ruiqi Gao (Virtual) - Google DeepMind):
Diffusion Process [ slides | notes | recording (Canvas) ] |
|
||
10/16 |
Lecture #14
(Lingkai Kong (in-person)):
Decision-Focused Learning [ slides | notes ] |
|||
10/18 |
Lecture #15
(Sherry Yang (Virtual) - Google DeepMind):
Foundation Models for Decision Making: Problems, Methods, and Applications [ slides | notes | recording (Canvas) ] |
|||
10/23 |
Lecture #16
(Bo Dai):
Generative Adversarial Nets (GAN) [ notes ] |
|
||
10/25 |
Lecture #17
(Bo Dai):
Normalizing Flow Models [ notes ] |
|
||
Module III: Differentiable Programming | ||||
10/30 |
Lecture #18
(Bo Dai):
Differentiable Algorithm I: differentiable optimizer/dynamic programming [ notes ] |
|
||
11/01 |
Lecture #19
(Bo Dai):
Differentiable Algorithm II: top-K/sorting layer [ notes ] |
|
||
Module IV: Reinforcement Learning | ||||
11/06 |
Lecture #20
(Bo Dai):
MDP: Bellman Recursion [ notes ] |
|
||
11/08 |
Lecture #21
(Bo Dai):
DP: Value and Policy Iteration [ notes ] |
|
||
11/13 |
Lecture #22
(Bo Dai):
Learning with MDPs [ notes ] |
|
||
11/15 |
Lecture #23
(Bo Dai):
Policy Gradient and Actor-Critic [ notes ] |
|||
11/20 |
Lecture #24
(Bo Dai):
Imitation Learning and RLHF [ slides | notes ] |
|
||
11/22 | No Class (Student Recess) | |||
11/27 |
Lecture #25
(Bo Dai):
Review [ notes ] |
|||
11/29 |
Lecture #26
:
Project Presentations (online session: 4:00 - 6:15 PM) |
|||
12/11 | Project Report Due |