Seminars and Colloquia by Series

Series

Interpretable machine learning with governing law discovery

Series: Applied and Computational Mathematics Seminar
Time: Monday, October 28, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/94954654170
Speaker: Mars Gao – University of Washington – marsgao@uw.edu

Spatio-temporal modeling of real-world data presents significant challenges due to high-dimensionality, noisy measurements, and limited data. In this talk, we introduce two frameworks that jointly solve the problems of sparse identification of governing equations and latent space reconstruction: the Bayesian SINDy autoencoder and SINDy-SHRED. The Bayesian SINDy autoencoder leverages a spike-and-slab prior to enable robust discovery of governing equations and latent coordinate systems, providing uncertainty estimates in low-data, high-noise settings. In our experiments, we applied the Bayesian SINDy autoencoder to real video data, marking the first example of learning governing equations directly from such data. This framework successfully identified underlying physical laws, such as accurately estimating constants like gravity from pendulum videos, even in the presence of noise and limited samples.

In parallel, SINDy-SHRED integrates Gated Recurrent Units (GRUs) with a shallow decoder network to model temporal sequences and reconstruct full spatio-temporal fields using only a few sensors. Our proposed algorithm introduces a SINDy-based regularization. Beginning with an arbitrary latent state space, the dynamics of the latent space progressively converges to a SINDy-class functional. We conduct a systematic experimental study including synthetic PDE data, real-world sensor measurements for sea surface temperature, and direct video data. With no explicit encoder, SINDy-SHRED allows for efficient training with minimal hyperparameter tuning and laptop-level computing. SINDy-SHRED demonstrates robust generalization in a variety of applications with minimal to no hyperparameter adjustments. Additionally, the interpretable SINDy model of latent state dynamics enables accurate long-term video predictions, achieving state-of-the-art performance and outperforming all baseline methods considered, including Convolutional LSTM, PredRNN, ResNet, and SimVP.

Approximation of differential operators on unknown manifolds and applications

Series: Applied and Computational Mathematics Seminar
Time: Wednesday, October 16, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 006 and https://gatech.zoom.us/j/98355006347
Speaker: John Harlim – Pennsylvania State University – jharlim@psu.edu

I will discuss the numerical approximation of differential operators on unknown manifolds where the manifolds are identified by a finite sample of point cloud data. While our formulation is general, we will focus on Laplacian operators whose spectral properties are relevant to manifold learning. I will report the spectral convergence results of these formulations with Radial Basis Functions approximation and their strengths/weaknesses in practice. Supporting numerical examples, involving the spectral estimation of various vector Laplacians will be demonstrated. Applications to solve elliptic PDEs will be discussed. To address the practical issue with the RBF approximation, I will discuss a weak approximation with a higher-order local mesh method that not only promotes sparsity but also allows for an estimation of differential operators with nontrivial Cristoffel symbols such as Bochner and Hodge Laplacians.

Data-driven model discovery meets mechanistic modeling for biological systems

Series: Applied and Computational Mathematics Seminar
Time: Monday, October 7, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/98355006347
Speaker: Niall M Mangan – Northwestern University – niall.mangan@northwestern.edu

Abstract: Building models for biological, chemical, and physical systems has traditionally relied on domain-specific intuition about which interactions and features most strongly influence a system. Alternatively, machine-learning methods are adept at finding novel patterns in large data sets and building predictive models but can be challenging to interpret in terms of or integrate with existing knowledge. Our group balances traditional modeling with data-driven methods and optimization to get the best of both worlds. Recently developed for and applied to dynamical systems, sparse optimization strategies can select a subset of terms from a library that best describes data, automatically interfering potential model structures from a broad but well-defined class. I will discuss my group's application and development of data-driven methods for model selection to 1) recover chaotic systems models from data with hidden variables, 2) discover models for metabolic and temperature regulation in hibernating mammals, and 3) model selection for differential-algebraic-equations. I'll briefly discuss current preliminary work and roadblocks in developing new methods for model selection of biological metabolic and regulatory networks.

Short Bio: Niall M. Mangan received the Dual BS degrees in mathematics and physics, with a minor in chemistry, from Clarkson University, Potsdam, NY, USA, in 2008, and the PhD degree in systems biology from Harvard University, Cambridge, MA, USA, in 2013. Dr. Mangan worked as a postdoctoral associate in the Photovoltaics Lab at MIT from 2013-2015 and as an Acting Assistant Professor at the University of Washington, Seattle from 2016-2017. She is currently an Assistant Professor of engineering sciences and applied mathematics with Northwestern University, where she works at the interface of mechanistic modeling, machine learning, and statistical inference. Her group applies these methods to many applications including metabolic and regulatory networks to accelerate engineering.

Exploring Conditional Computation in Transformer models

Series: Applied and Computational Mathematics Seminar
Time: Monday, September 30, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and ONLINE
Speaker: Xin Wang – Google Research – xinwangmath@gmail.com

Transformer (Vaswani et al. 2017) architecture is a popular deep learning architecture that today comprises the foundation for most tasks in natural language processing and forms the backbone of all the current state-of-the-art language models. Central to its success is the attention mechanism, which allows the model to weigh the importance of different input tokens. However, Transformers can become computationally expensive, especially for large-scale tasks. To address this, researchers have explored techniques for conditional computation, which selectively activate parts of the model based on the input. In this talk, we present two case studies of conditional computation in Transformer models. In the first case, we examine the routing mechanism in the Mixture-of-Expert (MoE) Transformer models, and show theoretical and empirical evidence that the router’s ability to route intelligently confers a significant advantage to MoE models. In the second case, we introduce Alternating Updates (AltUp), a method to take advantage of increased residual stream width in the Transformer models without increasing the computation cost.

Speaker's brief introduction: Xin Wang is a research engineer in the Algorithms team at Google Research. Xin finished his PhD in Mathematics at Georgia Institute of Technology before coming to Google. Xin's research interests include efficient computing, memory mechanism for machine learning, and optimization.

The talk will be presented online at

https://gatech.zoom.us/j/93087689904

Finding Cheeger cuts via 1-Laplacian of graphs

Series: Applied and Computational Mathematics Seminar
Time: Monday, September 23, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005
Speaker: Wei Zhu – University of Alabama at Tuscaloosa

Finding Cheeger cuts of graphs is an NP-hard problem, and one often resorts to approximate solutions. In the literature, spectral graph theory provides the most popular approaches for obtaining such approximate solutions. Recently, K.C. Chang introduced a novel nonlinear spectral graph theory and proved that the seek of Cheeger cuts is equivalent to solving a constrained optimization problem. However, this resulting optimization problem is also very challenging as it involves a non-differentiable function over a non-convex set that is composed of simplex cells of different dimensions. In this talk, we will discuss an ADMM algorithm for solving this optimization problem and provide some convergence analysis. Experimental results will be presented for typical graphs, including Petersen's graph and Cockroach graphs, the well-known Zachary karate club graph, and some preliminary applications in material sciences.

Maximal volume matrix cross approximation for image compression and least squares solution

Series: Applied and Computational Mathematics Seminar
Time: Monday, September 16, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005
Speaker: Zhaiming Shen – Georgia Tech – zshen49@gatech.edu

We study the classic matrix cross approximation based on the maximal volume submatrices. Our main results consist of an improvement of the classic estimate for matrix cross approximation and a greedy approach for finding the maximal volume submatrices. More precisely, we present a new proof of the classic estimate of the inequality with an improved constant. Also, we present a family of greedy maximal volume algorithms to improve the computational efficiency of matrix cross approximation. The proposed algorithms are shown to have theoretical guarantees of convergence. Finally, we present two applications: image compression and the least squares approximation of continuous functions. Our numerical results demonstrate the effective performance of our approach.

Poisson Meets Poisson: Implicit boundary integral method for linearized Poisson Boltzmann equation

Series: Applied and Computational Mathematics Seminar
Time: Monday, August 26, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005
Speaker: Yimin Zhong – Auburn University

In this talk, I will give an introduction to the implicit boundary integral method based on the co-area formula and it provides a simple quadrature rule for boundary integral on general surfaces. Then, I will focus on the application of solving the linearized Poisson Boltzmann equation, which is used to model the electric potential of protein molecules in a solvent. Near the singularity, I will briefly discuss the choices of regularization/correction and illustrate the effect of both cases. In the end, I will show the numerical analysis for the error estimate.

Degeneracy of eigenvalues and singular values of parameter dependent matrices

Series: Applied and Computational Mathematics Seminar
Time: Monday, May 6, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/93530218689?pwd=SFkzMXZyZXhZOTdRazhyL1BoVXprdz09
Speaker: Alessandro Pugliese – Università degli Studi di Bari Aldo Moro – alessandro.pugliese@uniba.it

Speaker will present in person.

Hermitian matrices have real eigenvalues and an orthonormal set of eigenvectors. Do smooth Hermitian matrix valued functions have smooth eigenvalues and eigenvectors? Starting from such question, we will first review known results on the smooth eigenvalue and singular values decompositions of matrices that depend on one or several parameters, and then focus on our contribution, which has been that of devising topological tools to detect and approximate parameters' values where eigenvalues or singular values of a matrix valued function are degenerate (i.e. repeated or zero).

The talk will be based on joint work with Luca Dieci (Georgia Tech) and Alessandra Papini (Univ. of Florence).

Generative modeling through time reversal and reflection of diffusion processes

Series: Applied and Computational Mathematics Seminar
Time: Monday, April 29, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/98355006347
Speaker: Nicole Yang – Emory University

Please Note: Speaker will present in person.

In this talk, we discuss generative modeling algorithms motivated by the time reversal and reflection properties of diffusion processes. Score-based diffusion models (SBDM) have recently emerged as state-of-the-art approaches for image generation. We develop SBDMs in the infinite-dimensional setting, that is, we model the training data as functions supported on a rectangular domain. Besides the quest for generating images at ever higher resolution, our primary motivation is to create a well-posed infinite-dimensional learning problem so that we can discretize it consistently at multiple resolution levels. We demonstrate how to overcome two shortcomings of current SBDM approaches in the infinite-dimensional setting by ensuring the well-posedness of forward and reverse processes, and derive the convergence of the approximation of multilevel training. We illustrate that approximating the score function with an operator network is beneficial for multilevel training.

In the second part of this talk, we propose the Reflected Schrodinger Bridge algorithm: an entropy-regularized optimal transport approach tailored for generating data within diverse bounded domains. We derive reflected forward-backward stochastic differential equations with Neumann and Robin boundary conditions, extend divergence-based likelihood training to bounded domains, and demonstrate its scalability in constrained generative modeling.

Monotone generative modeling via a geometry-preserving mapping

Series: Applied and Computational Mathematics Seminar
Time: Monday, April 15, 2024 - 14:00 for 1 hour (actually 50 minutes)
Location: Skiles 005 and https://gatech.zoom.us/j/98355006347
Speaker: Wonjun Lee – University of Minnesota, Twin Cities – lee01273@umn.edu

Generative Adversarial Networks (GANs) are powerful tools for creating new content, but they face challenges such as sensitivity to starting conditions and mode collapse. To address these issues, we propose a deep generative model that utilizes the Gromov-Monge embedding (GME). It helps identify the low-dimensional structure of the underlying measure of the data and then map it, while preserving its geometry, into a measure in a low-dimensional latent space, which is then optimally transported to the reference measure. We guarantee the preservation of the underlying geometry by the GME and c-cyclical monotonicity of the generative map, where c is an intrinsic embedding cost employed by the GME. The latter property is a first step in guaranteeing better robustness to initialization of parameters and mode collapse. Numerical experiments demonstrate the effectiveness of our approach in generating high-quality images, avoiding mode collapse, and exhibiting robustness to different starting conditions.

Georgia Institute of Technology College of Sciences

Search form