Online Learning and Bandit Problems with Applications to Revenue Management

讲座简介

1. Multi-armed Bandit Framework

2. The Upper Confidence Bound (UCB) Method

3. Elementary Improvements for Both Regret Minimization and Pure Exploration Settings

4. The Law-of-Iterated-Logarithm UCB Algorithm

5. Linear Contextual Bandits

6. Generalized Linear Models and Generalized Linear Contextual Bandits

7. Dynamic Assortment Optimization and the Multilinear Logit Bandit Problem

8. (If Time Permits) Zeroth Order Convex Optimization

主讲人

Yuan Zhou, Indiana University at Bloomington

Dr. Yuan Zhou is currently an Assistant Professor at the Computer Science Department of Indiana University at Bloomington. He is also a Visiting Assistant Professor at Shanghai University of Finance and Economics and an Adjunct Assistant Professor at University of Illinois at Urbana-Champaign. Before joining Indiana University, Yuan was an Applied Mathematics Instructor at the Mathematics Department of Massachusetts Institute of Technology. Prior to MIT, Yuan was the recipient of the Simons Graduate Fellowship and obtained his Ph.D. in Computer Science at Carnegie Mellon University. He also ranked the 1st in the International Olympiad in Informatics and the 2nd in the World Finals of ACM International Collegiate Programming Contest.

Yuan’s research interests include stochastic and combinatorial optimizations and their applications to operations management and machine learning. He is also interested in and publishes on analysis of mathematical programming, approximation algorithms, and hardness of approximation.

时间

2018年7月11日，7月13日，7月22日，7月27日，7月29日，8月5日

上午9:00

地点

信息管理与工程学院104室
上海财经大学（第三教学楼西侧）
上海市杨浦区武东路100号

时间

2018年7月15日,8月3日

上午9:00

地点

信息管理与工程学院102室
上海财经大学（第三教学楼西侧）
上海市杨浦区武东路100号