RI: Small: Towards Optimal and Adaptive Reinforcement Learning with Offline Data and Limited Adaptivity |
Principal Investigator Yu-Xiang Wang, University of California at Santa Barbara |
Project Summary Products |
Funded by NSF RI 2007117.
This material is based upon work supported by the National
Science Foundation under Grant No. 2007117. Any opinions, findings, and
conclusions or recommendations expressed in this material are those of the
author(s) and do not necessarily reflect the views of the National Science
Foundation.
Reinforcement learning (RL) is one of the fastest-growing research areas in machine learning. RL-based techniques have led to several recent breakthroughs in artificial intelligence, such as beating human champions in the game of Go. The application of RL to real life problems, however, remains limited, even in areas where a large amount of data has already been collected. The crux of the problem is that most existing RL methods require an environment for the agent to interact with, but in real-life applications, it is rarely possible to have access to such an environment — deploying an algorithm that learns by trial-and-errors may have serious legal, ethical and safety issues. This project aims to address this conundrum by developing algorithms that learn from offline data. The outcome of the research could significantly reduce the overhead of using RL techniques in real-life sequential decision-making problems such as those in power transmission, personalized medicine, scientific discoveries, computer networking and public policy.
Talks
INFORMS'22.
AI For Ukraine.
Berkeley Simons Institute Workshop on RL from Batch Data and Simultation. [Slides]
RL Theory Seminar [Slides, Video]
Manuscript. [arxiv]
Manuscript. [arxiv]
Manuscript [arxiv]
UAI 2022. [arxiv]
ICML 2022. [arxiv]
ICLR 2022. [openreview]
NeurIPS 2021. [arxiv]
NeurIPS 2021. [arxiv]
NeurIPS 2021. [arxiv]
AISTATS 2021. (*Plenary oral presentation) [arxiv]
AISTATS 2020. [arxiv]
NeurIPS 2019. [arxiv]
NeurIPS 2019. [arxiv]
Instructor: Yu-Xiang Wang, 2021 Spring [ Course website ]
Nine teams of student presentations. [ Website ]
[ Project webpage]