Inverse Dirichlet weighting enables reliable training of physics informed neural networks
Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung
Beitragende
Abstract
We characterize and remedy a failure mode that may arise from multi-scale dynamics with scale imbalances during training of deep neural networks, such as physics informed neural networks (PINNs). PINNs are popular machine-learning templates that allow for seamless integration of physical equation models with data. Their training amounts to solving an optimization problem over a weighted sum of data-fidelity and equation-fidelity objectives. Conflicts between objectives can arise from scale imbalances, heteroscedasticity in the data, stiffness of the physical equation, or from catastrophic interference during sequential training. We explain the training pathology arising from this and propose a simple yet effective inverse Dirichlet weighting strategy to alleviate the issue. We compare with Sobolev training of neural networks, providing the baseline of analytically epsilon-optimal training. We demonstrate the effectiveness of inverse Dirichlet weighting in various applications, including a multi-scale model of active turbulence, where we show orders of magnitude improvement in accuracy and convergence over conventional PINN training. For inverse modeling using sequential training, we find that inverse Dirichlet weighting protects a PINN against catastrophic forgetting.
Details
Originalsprache | Englisch |
---|---|
Aufsatznummer | 015026 |
Seitenumfang | 22 |
Fachzeitschrift | Machine learning: science and technology |
Jahrgang | 3 |
Ausgabenummer | 1 |
Publikationsstatus | Veröffentlicht - 15 Feb. 2022 |
Peer-Review-Status | Ja |
Externe IDs
unpaywall | 10.1088/2632-2153/ac3712 |
---|---|
Scopus | 85126707714 |
ORCID | /0000-0003-4414-4340/work/142252132 |
Schlagworte
Forschungsprofillinien der TU Dresden
DFG-Fachsystematik nach Fachkollegium
- Interaktive und intelligente Systeme, Bild- und Sprachverarbeitung, Computergraphik und Visualisierung
- Massiv parallele und datenintensive Systeme
- Bioinformatik und Theoretische Biologie
- Statistische Physik, Weiche Materie, Biologische Physik, Nichtlineare Dynamik
- Entwicklungsbiologie
- Softwaretechnik und Programmiersprachen
- Zellbiologie
- Biophysik
- Mathematik
Fächergruppen, Lehr- und Forschungsbereiche, Fachgebiete nach Destatis
Ziele für nachhaltige Entwicklung
ASJC Scopus Sachgebiete
Schlagwörter
- physics-informed neural networks, multi-scale modeling, active turbulence, catastrophic forgetting, multi-objective training, gradient flow regularization, ALGORITHM