Analysis of B-Scan Ultrasonography Using Neural Networks to Predict Risk of Fibrosis in Patients With Metabolic Dysfunction-Associated Steatotic Liver Disease (MASLD)

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

  • Josefine Stansch - , University Hospital Leipzig (Author)
  • Kien Vu Trung - , University Hospital Leipzig (Author)
  • Valentin Blank - , Martin Luther University Halle-Wittenberg (Author)
  • Jakob Nikolas Kather - , Else Kröner Fresenius Center for Digital Health (Author)
  • Moritz Herzog - , Else Kröner Fresenius Center for Digital Health (Author)
  • Tobias Seibel - , RWTH Aachen University (Author)
  • Paul Henry Koop - , RWTH Aachen University (Author)
  • Robert Haase - , Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig (Author)
  • Thomas Berg - , University Hospital Leipzig (Author)
  • Johannes Wiegand - , University Hospital Leipzig (Author)
  • Thomas Karlas - , University Hospital Leipzig (Author)

Abstract

Objective The prevalence of metabolic dysfunction–associated steatotic liver disease (MASLD) continues to rise, underscoring the need for tools to stratify individual risk of disease progression. We evaluated whether logistic regression models augmented by deep learning–based predictions (DLPs) can improve the B-mode ultrasound-based identification of at-risk MASLD, defined as patients with increased fibrosis risk. Methods We retrospectively analyzed 205 patients with a total of 636 ultrasound images. We developed a model that reproduces the LSM-based dichotomous fibrosis risk classification using clinical parameters and ultrasound image–derived deep learning pipelines. Patients were classified by same-day liver stiffness measurement (LSM) ('8 kPa: low fibrosis risk; ≥8 kPa: increased fibrosis risk). We assessed the incremental value of DLPs when added to the parameters sex, age, BMI, diabetes mellitus type 2 status and the fibrosis-4 score (FIB-4) based on accuracy, AUROC, and related statistics. Results The logistic regression model combining the clinical parameters and the DLPs achieved acceptable performance with an AUROC of 0.73 and a test accuracy of 68%. The same model without DLPs showed an AUROC of 0.72 and a test accuracy of 61%. Including FIB-4 improved performance further (AUROC 0.92, accuracy 88%). Models based solely on image data demonstrated limited diagnostic performance. Conclusion B-mode ultrasound provides a weak fibrosis-related signal, yielding limited diagnostic performance. Meaningful discrimination required the incorporation of clinical parameters, with FIB-4 offering the greatest improvement among the parameters assessed. Deep learning predictions added only modest incremental value. Prospective validation is needed to clarify clinical utility.

Details

Original languageEnglish
Pages (from-to)1346-1354
Number of pages9
JournalUltrasound in Medicine and Biology
Volume52
Issue number7
Early online date13 Apr 2026
Publication statusE-pub ahead of print - 13 Apr 2026
Peer-reviewedYes

External IDs

ORCID /0000-0002-0676-6926/work/215165137
ORCID /0000-0002-3730-5348/work/215165913

Keywords

Sustainable Development Goals

Keywords

  • Deep learning (DL), Liver fibrosis, Liver stiffness measurement (LSM), Metabolic dysfunction associated steatotic liver disease (MASLD)