Seminars and Colloquia by Series

Series

Pseudo-Maximum Likelihood Theory for High-Dimension Rank-One Inference

Series: Stochastics Seminar
Time: Thursday, September 19, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Justin Ko – University of Waterloo

We consider the task of estimating a rank-one matrix from noisy observations. Models that fall in this framework include community detection and spiked Wigner models. In this talk, I will discuss pseudo-maximum likelihood theory for such inference problems. We provide a variational formula for the asymptotic maximum pseudo-likelihood and characterize the asymptotic performance of pseudo maximum likelihood estimators. We will also discuss the implications of these findings to least squares estimators. Our approach uses the recent connections between statistical inference and statistical physics, and in particular the connection between the maximum likelihood and the ground state of a modified spin glass.

Based on joint work with Curtis Grant and Aukosh Jagannath.

Higher-Order Graphon Theory: Fluctuations, Inference, and Degeneracies

Series: Stochastics Seminar
Time: Thursday, September 12, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Bhaswar Bhattacharya – University of Pennsylvania

Motifs (patterns of subgraphs), such as edges and triangles, encode important structural information about the geometry of a network. Consequently, counting motifs in a large network is an important statistical and computational problem. In this talk we will consider the problem of estimating motif densities and fluctuations of subgraph counts in an inhomogeneous random graph sampled from a graphon. We will show that the limiting distributions of subgraph counts can be Gaussian or non-Gaussian, depending on a notion of regularity of subgraphs with respect to the graphon. Using these results and a novel multiplier bootstrap for graphons, we will construct joint confidence sets for the motif densities. Finally, we will discuss various structure theorems and open questions about degeneracies of the limiting distribution and connections to quasirandom graphs.

Joint work with Anirban Chatterjee, Soham Dan, and Svante Janson

Large deviations for triangles in random graphs in the critical regime

Series: Stochastics Seminar
Time: Thursday, September 5, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Will Perkins – Georgia Tech

A classic problem in probability theory and combinatorics is to estimate the probability that the random graph G(n,p) contains no triangles. This problem can be viewed as a question in "non-linear large deviations". The asymptotics of the logarithm of this probability (and related lower tail probabilities) are known in two distinct regimes. When p>> 1/\sqrt{n}, at this level of accuracy the probability matches that of G(n,p) being bipartite; and when p<< 1/\sqrt{n}, Janson's Inequality gives the asymptotics of the log. I will discuss a new approach to estimating this probability in the "critical regime": when p = \Theta(1/\sqrt{n}). The approach uses ideas from statistical physics and algorithms and gives information about the typical structure of graphs drawn from the corresponding conditional distribution. Based on joint work with Matthew Jenssen, Aditya Potukuchi, and Michael Simkin.

Estimation of trace functionals of covariance operators

Series: Stochastics Seminar
Time: Thursday, August 29, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Vladimir Koltchinskii – Georgia Tech

We will discuss a problem of estimation of functionals of the form $\tau_f(\Sigma):= {\rm tr} (f(\Sigma))$ of unknown covariance operator $\Sigma$ of a centered Gaussian random variable $X$ in a separable Hilbert space ${\mathbb H}$ based on i.i.d. observation $X_1,\dots, X_n$ of $X,$ where $f:{\mathbb R}\mapsto {\mathbb R}$ is a given function. A naive plug-in estimator $\tau_f(\hat \Sigma_n)$ based on the sample covariance operator $\hat \Sigma_n$ has a large bias and bias reduction methods are needed to construct estimators with better error rates. We develop estimators with reduced bias based on linear aggregation of several plug-in estimators with different sample sizes and obtain the error bounds for such estimators with explicit dependence on the sample size $n,$ the effective rank ${\bf r}(\Sigma)= \frac{tr(\Sigma)}{\|\Sigma\|}$ of covariance operator $\Sigma$ and the degree of smoothness of function $f.$

Asymptotic mutual information for quadratic estimation problems over compact groups

Series: Stochastics Seminar
Time: Thursday, August 22, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Timothy Wee – Georgia Tech

Motivated by applications to group synchronization and quadratic assignment on random data, we study a general problem of Bayesian inference of an unknown “signal” belonging to a high-dimensional compact group, given noisy pairwise observations of a featurization of this signal.

We establish a quantitative comparison between the signal-observation mutual information in any such problem with that in a simpler model with linear observations, using interpolation methods. For group synchronization, our result proves a replica formula for the asymptotic mutual information and Bayes-optimal mean-squared error. Via analyses of this replica formula, we show that the conjectural phase transition threshold for computationally-efficient weak recovery of the signal is determined by a classification of the real-irreducible components of the observed group representation(s), and we fully characterize the information-theoretic limits of estimation in the example of angular/phase synchronization over SO(2)/U(1). For quadratic assignment, we study observations given by a kernel matrix of pairwise similarities and a randomly permuted and noisy counterpart, and we show in a bounded signal-to-noise regime that the asymptotic mutual information coincides with that in a Bayesian spiked model with i.i.d. signal prior.

This is based on joint work with Kaylee Yang and Zhou Fan.

Max-sliced Wasserstein distances

Series: Stochastics Seminar
Time: Thursday, April 25, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: March Boedihardjo – Michigan State University – boedihar@msu.edu

I will give essentially matching upper and lower bounds for the expected max-sliced 1-Wasserstein distance between a probability measure on a separable Hilbert space and its empirical distribution from n samples. A version of this result for Banach spaces will also be presented. From this, we will derive an upper bound for the expected max-sliced 2-Wasserstein distance between a symmetric probability measure on a Euclidean space and its symmetrized empirical distribution.

Branching Brownian motion and the road-field model

Series: Stochastics Seminar
Time: Thursday, April 18, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Nick Cook – Duke University – nickcook@math.duke.edu

The Fisher-KPP equation was introduced in 1937 to model the spread of an advantageous gene through a spatially distributed population. Remarkably precise information on the traveling front has been obtained via a connection with branching Brownian motion, beginning with works of McKean and Bramson in the 70s. I will discuss an extension of this probabilistic approach to the Road-Field Model: a reaction-diffusion PDE system introduced by H. Berestycki et al. to describe enhancement of biological invasions by a line of fast diffusion, such as a river or a road. Based on joint work with Amir Dembo.

From Ehrhard to Generalized Bobkov inequality, and more

Series: Stochastics Seminar
Time: Thursday, April 11, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Galyna Livshyts – Georgia Tech – glivshyts6@gatech.edu

We discuss a general scheme that allows to realize certain geometric functional inequalities as statements about convexity of some functionals, and, inspired by the work of Bobkov and Ledoux, we obtain various interesting inequalities as their realizations. For example, we draw a link between Ehrhard’s inequality and an interesting extension of Bobkov’s inequality, and several new and more general inequalities are discussed as well. In this talk we discuss a joint project with Barthe, Cordero-Erausquin and Ivanisvili, and also mention briefly some results from a joint project with Cordero-Erausquin and Rotem.

Local vs Non-Local Poincar\'e Inequalities and Quantitative Exponential Concentration

Series: Stochastics Seminar
Time: Thursday, April 4, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Christian Houdré – Georgia Institute of Technology

Weighted Poincar\'e inequalities known for various laws such as the exponential or Cauchy ones are shown to follow from the "usual" Poincar\'e inequality involving the non-local gradient. A key ingredient in showing so is a covariance representation and Hardy's inequality.

The framework under study is quite general and comprises infinitely divisible laws as well as some log-concave ones. This same covariance representation is then used to obtain quantitative concentration inequalities of exponential type, recovering in particular the Gaussian results.

Joint Work with Benjamin Arras.

Improving Predictions by Combining Models

Series: Stochastics Seminar
Time: Thursday, March 28, 2024 - 15:30 for 1 hour (actually 50 minutes)
Location: Skiles 006
Speaker: Jason Klusowski – Princeton University

When performing regression analysis, researchers often face the challenge of selecting the best single model from a range of possibilities. Traditionally, this selection is based on criteria evaluating model goodness-of-fit and complexity, such as Akaike's AIC and Schwartz's BIC, or on the model's performance in predicting new data, assessed through cross-validation techniques. In this talk, I will show that a linear combination of a large number of these possible models can have better predictive accuracy than the best single model among them. Algorithms and theoretical guarantees will be discussed, which involve interesting connections to constrained optimization and shrinkage in statistics.

Georgia Institute of Technology College of Sciences

Search form