Inverse Dirichlet weighting enables reliable training of physics informed neural networks

Suryanarayana Maddu; Dominik Sturm; Christian L. Mueller; Ivo F. Sbalzarini

doi:10.1088/2632-2153/ac3712

Inverse Dirichlet weighting enables reliable training of physics informed neural networks

Research output: Contribution to journal › Research article › Contributed › peer-review

Contributors

Suryanarayana Maddu - , Chair of Scientific Computing for Systems Biology, Chair of Databases, Center for Systems Biology Dresden (CSBD), Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig (Author)
Dominik Sturm - , Center for Advanced Systems Understanding (CASUS), Helmholtz-Zentrum Dresden-Rossendorf (Author)
Christian L. Mueller - , Flatiron Institute, Ludwig Maximilian University of Munich, Helmholtz Zentrum München - German Research Center for Environmental Health (Author)
Ivo F. Sbalzarini - , Chair of Scientific Computing for Systems Biology, Clusters of Excellence PoL: Physics of Life, Max Planck Institute of Molecular Cell Biology and Genetics, Center for Systems Biology Dresden (CSBD), Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig (Author)

Abstract

We characterize and remedy a failure mode that may arise from multi-scale dynamics with scale imbalances during training of deep neural networks, such as physics informed neural networks (PINNs). PINNs are popular machine-learning templates that allow for seamless integration of physical equation models with data. Their training amounts to solving an optimization problem over a weighted sum of data-fidelity and equation-fidelity objectives. Conflicts between objectives can arise from scale imbalances, heteroscedasticity in the data, stiffness of the physical equation, or from catastrophic interference during sequential training. We explain the training pathology arising from this and propose a simple yet effective inverse Dirichlet weighting strategy to alleviate the issue. We compare with Sobolev training of neural networks, providing the baseline of analytically epsilon-optimal training. We demonstrate the effectiveness of inverse Dirichlet weighting in various applications, including a multi-scale model of active turbulence, where we show orders of magnitude improvement in accuracy and convergence over conventional PINN training. For inverse modeling using sequential training, we find that inverse Dirichlet weighting protects a PINN against catastrophic forgetting.

Details

Original language	English
Article number	015026
Number of pages	22
Journal	Machine learning: science and technology
Volume	3
Issue number	1
Publication status	Published - 15 Feb 2022
Peer-reviewed	Yes

External IDs

unpaywall	10.1088/2632-2153/ac3712
Scopus	85126707714
ORCID	/0000-0003-4414-4340/work/142252132

Keywords

Research priority areas of TU Dresden

DFG Classification of Subject Areas according to Review Boards

Subject groups, research areas, subject areas according to Destatis

Sustainable Development Goals

ASJC Scopus subject areas

Keywords

physics-informed neural networks, multi-scale modeling, active turbulence, catastrophic forgetting, multi-objective training, gradient flow regularization, ALGORITHM

Library keywords

004 Computer science

Research Portal of the TU Dresden

Contributors

Abstract

Details

External IDs

Keywords

Research priority areas of TU Dresden

DFG Classification of Subject Areas according to Review Boards

Subject groups, research areas, subject areas according to Destatis

Sustainable Development Goals

ASJC Scopus subject areas

Keywords

Library keywords