浙江大学数据科学研究中心- 【ZJU-CDS Short Courses (2023 Summer Ⅱ)】Reinforcement Learning

教育教学

【ZJU-CDS Short Courses (2023 Summer Ⅱ)】Reinforcement Learning

作者：admin

时间：2023-07-31

阅读量：1870次

This course is about reinforcement learning (RL), covering the fundamental concepts of reinforcement learning framework and solution methods. The focus is on the underlying methodology as well as theory. The course will cover the following topics, including foundations of RL, dynamic programming, Monte Carlo methods, temporal difference learning and Q-learning.

This short course is made based on materials in Sutton and Barto (2018), Puterman (1994), DeepMind & UCL RL lecture series and some research papers. It covers the foundations of reinforcement learning, planning and learning.

The course website is available at https://github.com/callmespring/RL-short-course.

主讲人：

Chengchun Shi，Associate Professor in data science at London School of Economics and Political Science

时间：8月17日至8月19日，9:00-12:00，共3次课

地点：浙江大学数据科学研究中心，行政楼1417

本次短期课程计划从以下方面讨论强化学习：

第一部分：Foundations of Reinforcement Learning

1. Introduction to Reinforcement Learning

1.1 Multi-armed Bandits

1.2 Contextual Bandits

2. Markov Decision Processes

2.1 Time-varying Markov Decision Processes

2.2 Partially Observable Markov Decision Processes

3. Existence of the Optimal Stationary Policy

第二部分：Planning and Learning

1. Preliminaries

2. Planning: Dynamic Programming

2.1 Policy Iteration

2.2 Value Iteration

3. Learning: Monte Carlo Methods

3.1 Monte Carlo Prediction (Policy Evaluation)

3.2 Monte Carlo Control (Policy Optimization)

4. Learning: Temporal Difference Methods

4.1 Temporal Difference Prediction

4.2 Temporal Difference Control: SARSA

第三部分：Q-Learning and Beyond

1. Tabular Q-learning

2. Fitted Q-iteration

3. Case Study 1: Deep Q-networks

4. Case Study 2: Reinforcement Learning for Ridesharing

首页

中心概况

新闻中心

学术交流

科学研究

教育教学

招聘信息

综合服务

联系我们

课程资源