浙江大学数据科学研究中心- Statistical theory of deep learning

教育教学

Statistical theory of deep learning

作者：admin

时间：2026-04-13

阅读量：1155次

Recently a lot of progress has been made regarding the theoretical understanding for deep artificial neural networks. One of the very promising directions is the statistical approach, which interprets deep learning as a statistical method and builds on existing techniques in mathematical statistics to derive theoretical error bounds and to understand novel phenomena such as benign overfitting and the regularising effect of dropout. The lecture surveys this field and describes future challenges.

Preliminary outline:

Lecture 1 (from approximation to generalisation bounds): Universal approximation theorem, approximation rates for shallow neural networks, Barron spaces, advantages of additional hidden layers, deep ReLU networks.

Lecture 2 (theory of gradient descent in machine learning): optimization in machine learning, weight balancing phenomenon, analysis of dropout, benign overfitting, grokking

Course Slides：

https://jschmidthieber.personalweb.utwente.nl/hangz.pdf

Resources:

https://mjt.cs.illinois.edu/dlt/

https://www.cs.princeton.edu/courses/archive/fall19/cos597B/lecnotes/bookdraft.pdf

https://www.di.ens.fr/%7Efbach/ltfp_book.pdf

For questions, please contact a.j.schmidt-hieber@utwente.nl