CS292F (Spring 2021): Mini-Symposium on Statistical RL (June 2, 2021)

Presentation schedule

TimeTeamProject title
1:00 - 1:22Ari Polakof and Chang Yuan LiOffline Reinforcement Learning in Theory and in Practice
1:22 - 1:44Fuheng ZhaoOptimize Join Queries with Deep Reinforcement Learning
1:44 - 2:06Yichen Feng, Mengye Liu, Ming MinProvably Efficient Q-Learning with Low Switching Cost
2:06 - 2:28Avinash Nargund Reinforcement Learning for Radio Network Design
2:28 - 2:50Rohan BhatiaDualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
2:52 - 3:14Yi-lin Tuan, Wanrong ZhuReinforcement Learning for Text Generation
3:14 - 3:36Kaiqi ZhangMinimax OPE for multi-armed bandits
3:36 - 3:58Dheeraj Baby, Ming Yin, Xuandong ZhaoA unifying view of optimism in Episodic Reinforcement Learning
3:58 - 4:20Yuqing Zhu, Jianyu XuIs reinforcement learning more difficult than bandits?