Atmospheric Chemistry Surrogate Modeling With Sparse Identification of Nonlinear Dynamics
Xiaokai Yang, Lin Guo, Zhonghua Zheng, Nicole Riemer, Christopher W. TessumAbstract
Modeling atmospheric chemistry is computationally expensive and limits the widespread use of chemical transport models. This computational cost arises from solving high‐dimensional systems of stiff differential equations. Previous work has demonstrated the promise of machine learning (ML) to accelerate air quality model simulations but has suffered from numerical instability during long‐term simulations. This may be because previous ML‐based efforts have relied on explicit Euler time integration—which is known to be unstable for stiff systems—and have used neural networks which are prone to overfitting. We hypothesize that parsimonious models combined with modern numerical integration techniques can overcome this limitation. Using a small‐scale mechanism to explore the potential of these methods, we have created a machine‐learned surrogate by (a) reducing dimensionality using singular value decomposition to create an interpretably‐compressed low‐dimensional latent space and (b) using Sparse Identification of Nonlinear Dynamics (SINDy) to create a differential‐equation‐based representation of the underlying dynamics in the compressed latent space with reduced stiffness. The root mean square error of ML model prediction for ozone concentration over 9 days is 37.8% of the root mean square concentration across all simulations in our testing data set. The surrogate model is 10× faster with 12× fewer integration timesteps compared to reference model and is numerically stable in all tested simulations. Overall, we find that SINDy can be used to create fast, stable, and accurate surrogates of a simple photochemical mechanism. In future work, we will explore the application of this method to more detailed mechanisms and their use in large‐scale simulations.