UCSB CS Theory Colloquium Series

Fall 2021

Friday, October 22, 2pm, Phelps 2510 Speaker: Sanjoy Dasgupta (UC San Diego) Title: Some recent theoretical directions in clustering Abstract: I’ll talk about two recent lines of work on clustering that have seen quite a bit of algorithmic progress. 1. Cost functions for hierarchical clustering. The development of algorithms for hierarchical clustering has traditionally been hindered by a lack of precise objective functions. I’ll talk about how this situation has recently been remedied. This will mostly be about the following paper as well as more recent progress by Charikar, Chatziafratis, Cohen-Addad, Mathieu, Moseley, and Wang. A cost function for similarity-based hierarchical clustering. STOC 2016. 2. Explainable k-means clustering. Common clustering algorithms return clusters that are arbitrary subsets of the data, or at best convex regions of space. What is the price of asking for simpler (and therefore more “explainable”) clusters, e.g. hyperrectangular and based on just a few features? This will mostly be about the following paper as well as more recent work by Laber, Makarychev, Mirrokni, Svensson, and others. (With Cyrus Rashtchian, Michal Moshkovitz, and Nave Frost) Explainable k-means and k-medians clustering. ICML 2020. |