Machine learning for identifying relevant publications in updates of systematic reviews of diagnostic test studies

Toni Lange; Guido Schwarzer; Thomas Datzmann; Harald Binder

doi:10.1002/jrsm.1486

Machine learning for identifying relevant publications in updates of systematic reviews of diagnostic test studies

Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung

Beitragende

Toni Lange - , Zentrum für evidenzbasierte Gesundheitsversorgung (Erstautor:in)
Guido Schwarzer - , Albert-Ludwigs-Universität Freiburg (Autor:in)
Thomas Datzmann - , Zentrum für evidenzbasierte Gesundheitsversorgung (Autor:in)
Harald Binder - , Albert-Ludwigs-Universität Freiburg (Letztautor:in)

Abstract

Updating systematic reviews is often a time-consuming process that involves a lot of human effort and is therefore not conducted as often as it should be. The aim of our research project was to explore the potential of machine learning methods to reduce human workload. Furthermore, we evaluated the performance of deep learning methods in comparison to more established machine learning methods. We used three available reviews of diagnostic test studies as the data set. In order to identify relevant publications, we used typical text pre-processing methods. The reference standard for the evaluation was the human-consensus based on binary classification (inclusion, exclusion). For the evaluation of the models, various scenarios were generated using a grid of combinations of data preprocessing steps. Moreover, we evaluated each machine learning approach with an approach-specific predefined grid of tuning parameters using the Brier score metric. The best performance was obtained with an ensemble method for two of the reviews, and by a deep learning approach for the other review. Yet, the final performance of approaches strongly depends on data preparation. Overall, machine learning methods provided reasonable classification. It seems possible to reduce human workload in updating systematic reviews by using machine learning methods. Yet, as the influence of data preprocessing on the final performance seems to be at least as important as choosing the specific machine learning approach, users should not blindly expect a good performance by solely using approaches from a popular class, such as deep learning.

Details

Originalsprache	Englisch
Seiten (von - bis)	506-515
Seitenumfang	10
Fachzeitschrift	Research Synthesis Methods
Jahrgang	12
Ausgabenummer	4
Frühes Online-Datum	15 März 2021
Publikationsstatus	Veröffentlicht - Juli 2021
Peer-Review-Status	Ja

Externe IDs

PubMed	33720520
ORCID	/0000-0002-8671-7496/work/152545136

Schlagworte

ASJC Scopus Sachgebiete

Ausbildung bzw. Denomination

Schlagwörter

deep learning, machine learning, preprocessing, systematic reviews, text mining

Bibliotheksschlagworte

610 Medizin und Gesundheit

Forschungsportal der TU Dresden