Seminars and Colloquia by Series

Series

Combinatorial Topological Dynamics

Series: Applied and Computational Mathematics Seminar
Time: Monday, November 7, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/98355006347
Speaker: Thomas Wanner – George Mason University – twanner@gmu.edu

Morse theory establishes a celebrated link between classical gradient dynamics and the topology of the
underlying phase space. It provided the motivation for two independent developments. On the one hand, Conley's
theory of isolated invariant sets and Morse decompositions, which is a generalization of Morse theory, is able
to encode the global dynamics of general dynamical systems using topological information. On the other hand,
Forman's discrete Morse theory on simplicial complexes, which is a combinatorial version of the classical
theory, and has found numerous applications in mathematics, computer science, and applied sciences.
In this talk, we introduce recent work on combinatorial topological dynamics, which combines both of the
above theories and leads as a special case to a dynamical Conley theory for Forman vector fields, and more
general, for multivectors. This theory has been developed using the general framework of finite topological
spaces, which contain simplicial complexes as a special case.

Multi-scale modeling for complex flows at extreme computational scales

Series: Applied and Computational Mathematics Seminar
Time: Monday, October 10, 2022 - 14:00 for
Location: Skiles 005 and https://gatech.zoom.us/j/98355006347
Speaker: Spencer Bryngelson – Georgia Tech CSE – shb@gatech.edu

Many fluid flows display at a wide range of space and time scales. Turbulent and multiphase flows can include small eddies or particles, but likewise large advected features. This challenge makes some degree of multi-scale modeling or homogenization necessary. Such models are restricted, though: they should be numerically accurate, physically consistent, computationally expedient, and more. I present two tools crafted for this purpose. First, the fast macroscopic forcing method (Fast MFM), which is based on an elliptic pruning procedure that localizes solution operators and sparse matrix-vector sampling. We recover eddy-diffusivity operators with a convergence that beats the best spectral approximation (from the SVD), attenuating the cost of, for example, targeted RANS closures. I also present a moment-based method for closing multiphase flow equations. Buttressed by a recurrent neural network, it is numerically stable and achieves state-of-the-art accuracy. I close with a discussion of conducting these simulations near exascale. Our simulations scale ideally on the entirety of ORNL Summit's GPUs, though the HPC landscape continues to shift.

Efficient Krylov subspace methods for uncertainty quantification

Series: Applied and Computational Mathematics Seminar
Time: Monday, September 19, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/98355006347
Speaker: Julianne Chung – Emory University – jmchung@emory.edu

Uncertainty quantification for linear inverse problems remains a challenging task, especially for problems with a very large number of unknown parameters (e.g., dynamic inverse problems), for problems where computation of the square root and inverse of the prior covariance matrix are not feasible, and for hierarchical problems where the mean is not known a priori. This work exploits Krylov subspace methods to develop and analyze new techniques for large-scale uncertainty quantification in inverse problems. We assume that generalized Golub-Kahan based methods have been used to compute an estimate of the solution, and we describe efficient methods to explore the posterior distribution. We present two methods that use the preconditioned Lanczos algorithm to efficiently generate samples from the posterior distribution. Numerical examples from dynamic photoacoustic tomography and atmospheric inverse modeling, including a case study from NASA's Orbiting Carbon Observatory 2 (OCO-2) satellite, demonstrate the effectiveness of the described approaches.

Neural Oracle Search on N-BEST Hypotheses

Series: Applied and Computational Mathematics Seminar
Time: Monday, September 12, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/98355006347
Speaker: Tongzhou Chen – Google – tongzhou@google.com

In this talk, we propose a Neural Oracle Search(NOS) model in Automatic Speech Recognition(ASR) to select the most likely hypothesis using a sequence of acoustic representations and multiple hypotheses as input. The model provides a sequence level score for each audio-hypothesis pair that is obtained by integrating information from multiple sources, such as the input acoustic representations, N-best hypotheses, additional 1st-pass statistics, and unpaired textual information through an external language model. These scores are then used to map the search problem of identifying the most likely hypothesis to a sequence classification problem. The definition of the proposed model is broad enough to allow its use as an alternative to beam search in the 1st-pass or as a 2nd-pass, rescoring step. This model achieves up to 12% relative reductions in Word Error Rate (WER) across several languages over state-of-the-art baselines with relatively few additional parameters. In addition, we investigate the use of the NOS model on a 1st-pass multilingual model and show that similar to the 1st-pass model, the NOS model can be made multilingual.

Convergence of denoising diffusion models

Series: Applied and Computational Mathematics Seminar
Time: Monday, August 29, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005
Speaker: Valentin DE BORTOLI – CNRS and ENS Ulm – valentin.debortoli@gmail.com

Generative modeling is the task of drawing new samples from an underlying distribution known only via an empirical measure. There exists a myriad of models to tackle this problem with applications in image and speech processing, medical imaging, forecasting and protein modeling to cite a few. Among these methods score-based generative models (or diffusion models) are a new powerful class of generative models that exhibit remarkable empirical performance. They consist of a ``noising'' stage, whereby a diffusion is used to gradually add Gaussian noise to data, and a generative model, which entails a ``denoising'' process defined by approximating the time-reversal of the diffusion.

In this talk I will present some of their theoretical guarantees with an emphasis on their behavior under the so-called manifold hypothesis. Such theoretical guarantees are non-vacuous and provide insight on the empirical behavior of these models. I will show how these results imply generalization bounds on denoising diffusion models. This presentation is based on https://arxiv.org/abs/2208.05314

Recent advances on structure-preserving algorithms

Series: Applied and Computational Mathematics Seminar
Time: Monday, April 25, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: https://gatech.zoom.us/j/96551543941
Speaker: Philippe G. LeFloch – Sorbonne Univ. and CNRS – pglefloch@gmail.com

Structure-preserving methodologies led to interesting advances on the design of computational algorithms: one observes that an (obvious or hidden) structure is enjoyed by the problem under consideration and one then designs numerical approximations enjoying the same structure at the discrete level. For problems involving a large number of dimensions, for instance in mathematical finance and machine learning, I have introduced the 'transport-based mesh-free method' which uses a reproducing kernel and a transport mapping in a way that is reminiscent of Lagrangian methods developed in computational fluid dynamics. This method is now implemented in a Python library (CodPy) and used in industrial applications.

In compressible fluid dynamics, astrophysics, or cosmology, one needs to compute with propagating singularities, such as shock waves, moving interfaces, or gravitational singularities, I will overview recent progress on structure-preserving algorithms in presence of small-scale dependent waves which drive the global flow dynamics. I recently introduced asymptotic-preserving or dissipation-preserving methods adapted to such problems. This lecture is based on joint collaborations with F. Beyer (Dunedin), J.-M. Mercier (Paris), S. Miryusupov (Paris), and Y. Cao (Shenzhen). Blog: philippelefloch.org

Sampling Approximately Low-Rank Ising Models: MCMC meets Variational Methods

Series: Applied and Computational Mathematics Seminar
Time: Monday, April 18, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: Hybrid: Skiles 005 and https://gatech.zoom.us/j/96551543941
Speaker: Holden Lee – Duke University – holden.lee@duke.edu

MCMC and variational inference are two competing paradigms for the problem of sampling from a given probability distribution. In this talk, I'll show how they can work together to give the first polynomial-time sampling algorithm for approximately low-rank Ising models. Sampling was previously known when all eigenvalues of the interaction matrix fit in an interval of length 1; however, a single outlier can cause Glauber dynamics to mix torpidly. Our result covers the case when all but O(1) eigenvalues lie in an interval of length 1. To deal with positive eigenvalues, we use a temperature-based heuristic for MCMC called simulated tempering, while to deal with negative eigenvalues, we define a nonconvex variational problem over Ising models, solved using SGD. Our result has applications to sampling Hopfield networks with a fixed number of patterns, Bayesian clustering models with low-dimensional contexts, and antiferromagnetic/ferromagnetic Ising model on expander graphs.

Learning Operators with Coupled Attention

Series: Applied and Computational Mathematics Seminar
Time: Monday, April 11, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: https://gatech.zoom.us/j/96551543941
Speaker: Paris Perdikaris – University of Pennsylvania – pgp@seas.upenn.edu

Supervised operator learning is an emerging machine learning paradigm with applications to modeling the evolution of spatio-temporal dynamical systems and approximating general black-box relationships between functional data. We propose a novel operator learning method, LOCA (Learning Operators with Coupled Attention), motivated from the recent success of the attention mechanism. In our architecture, the input functions are mapped to a finite set of features which are then averaged with attention weights that depend on the output query locations. By coupling these attention weights together with an integral transform, LOCA is able to explicitly learn correlations in the target output functions, enabling us to approximate nonlinear operators even when the number of output function measurementsin the training set is very small. Our formulation is accompanied by rigorous approximation theoretic guarantees on the universal expressiveness of the proposed model. Empirically, we evaluate the performance of LOCA on several operator learning scenarios involving systems governed by ordinary and partial differential equations, as well as a black-box climate prediction problem. Through these scenarios we demonstrate state of the art accuracy, robustness with respect to noisy input data, and a consistently small spread of errors over testing data sets, even for out-of-distribution prediction tasks.

The Approximation Properties of Convex Hulls, Greedy Algorithms, and Applications to Neural Networks

Series: Applied and Computational Mathematics Seminar
Time: Monday, April 4, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: Hybrid: Skiles 005 and https://gatech.zoom.us/j/96551543941
Speaker: Jonathan Siegel – Penn State Mathematics Department – jus1949@psu.edu

Given a collection of functions in a Banach space, typically called a dictionary in machine learning, we study the approximation properties of its convex hull. Specifically, we develop techniques for bounding the metric entropy and n-widths, which are fundamental quantities in approximation theory that control the limits of linear and non-linear approximation. Our results generalize existing methods by taking the smoothness of the dictionary into account, and in particular give sharp estimates for shallow neural networks. Consequences of these results include: the optimal approximation rates which can be attained for shallow neural networks, that shallow neural networks dramatically outperform linear methods of approximation, and indeed that shallow neural networks outperform all continuous methods of approximation on the associated convex hull. Next, we discuss greedy algorithms for constructing approximations by non-linear dictionary expansions. Specifically, we give sharp rates for the orthogonal greedy algorithm for dictionaries with small metric entropy, and for the pure greedy algorithm. Finally, we give numerical examples showing that greedy algorithms can be used to solve PDEs with shallow neural networks.

How Differential Equations Insight Benefit Deep Learning

Series: Applied and Computational Mathematics Seminar
Time: Monday, March 28, 2022 - 14:00 for 1 hour (actually 50 minutes)
Location: https://gatech.zoom.us/j/96551543941 (note: Zoom, not Bluejeans)
Speaker: Prof. Bao Wang – University of Utah – bwang@math.utah.edu

We will present a new class of continuous-depth deep neural networks that were motivated by the ODE limit of the classical momentum method, named heavy-ball neural ODEs (HBNODEs). HBNODEs enjoy two properties that imply practical advantages over NODEs: (i) The adjoint state of an HBNODE also satisfies an HBNODE, accelerating both forward and backward ODE solvers, thus significantly accelerate learning and improve the utility of the trained models. (ii) The spectrum of HBNODEs is well structured, enabling effective learning of long-term dependencies from complex sequential data.

Second, we will extend HBNODE to graph learning leveraging diffusion on graphs, resulting in new algorithms for deep graph learning. The new algorithms are more accurate than existing deep graph learning algorithms and more scalable to deep architectures, and also suitable for learning at low labeling rate regimes. Moreover, we will present a fast multipole method-based efficient attention mechanism for modeling graph nodes interactions.

Third, if time permits, we will discuss proximal algorithms for accelerating learning continuous-depth neural networks.

Georgia Institute of Technology College of Sciences

Search form