Geometric and Statistical Approaches to Shallow and Deep Clustering

Monday, April 12, 2021 - 2:00pm for 1 hour (actually 50 minutes)
James Murphy – Tufts University – JM.Murphy@tufts.edu
Wenjing Liao

We propose approaches to unsupervised clustering based on data-dependent distances and dictionary learning.  By considering metrics derived from data-driven graphs, robustness to noise and ambient dimensionality is achieved.  Connections to geometric analysis, stochastic processes, and deep learning are emphasized.  The proposed algorithms enjoy theoretical performance guarantees on flexible data models and in some cases guarantees ensuring quasilinear scaling in the number of data points.  Applications to image processing and bioinformatics will be shown, demonstrating state-of-the-art empirical performance.  Extensions to active learning, generative modeling, and computational geometry will be discussed.