NCBench: providing an open, reproducible, transparent, adaptable, and continuous benchmark approach for DNA-sequencing-based variant calling

Friederike Hanssen; Gisela Gabernet; Famke Bäuerle; Bianca Stöcker; Felix Wiegand; Nicholas H. Smith; Christian Mertes; Avirup Guha Neogi; Leon Brandhoff; Anna Ossowski; Janine Altmueller; Kerstin Becker; Andreas Petzold; Marc Sturm; Tyll Stöcker; Sugirthan Sivalingam; Fabian Brand; Axel Schmidt; Andreas Buness; Alexander J. Probst; Susanne Motameny; Johannes Köster

doi:10.12688/f1000research.140344.2

NCBench: providing an open, reproducible, transparent, adaptable, and continuous benchmark approach for DNA-sequencing-based variant calling

Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung

Beitragende

Friederike Hanssen - , Eberhard Karls Universität Tübingen (Autor:in)
Gisela Gabernet - , Eberhard Karls Universität Tübingen (Autor:in)
Famke Bäuerle - , Eberhard Karls Universität Tübingen, Universitätsklinikum Tübingen (Autor:in)
Bianca Stöcker - , Universität Duisburg-Essen (Autor:in)
Felix Wiegand - , Universität Duisburg-Essen (Autor:in)
Nicholas H. Smith - , Technische Universität München (Autor:in)
Christian Mertes - , Technische Universität München (Autor:in)
Avirup Guha Neogi - , Universität zu Köln (Autor:in)
Leon Brandhoff - , Universität zu Köln (Autor:in)
Anna Ossowski - , Universität zu Köln (Autor:in)
Janine Altmueller - , Universität zu Köln, Berliner Institut für Gesundheitsforschung in der Charité, Max-Delbrück-Centrum für Molekulare Medizin (MDC) (Autor:in)
Kerstin Becker - , Universität zu Köln (Autor:in)
Andreas Petzold - , DRESDEN-concept Genome Center (CMCB Core Facility), Technische Universität Dresden (Autor:in)
Marc Sturm - , Universitätsklinikum Tübingen (Autor:in)
Tyll Stöcker - , Universität Bonn (Autor:in)
Sugirthan Sivalingam - , Universitätsklinikum Düsseldorf (Autor:in)
Fabian Brand - , Universität Bonn (Autor:in)
Axel Schmidt - , Universität Bonn (Autor:in)
Andreas Buness - , Universität Bonn (Autor:in)
Alexander J. Probst - , Universität Duisburg-Essen (Autor:in)
Susanne Motameny - , Universität zu Köln (Autor:in)
Johannes Köster - , Universität Duisburg-Essen, Deutsches Krebsforschungszentrum (DKFZ) (Autor:in)

Abstract

We present the results of the human genomic small variant calling benchmarking initiative of the German Research Foundation (DFG) funded Next Generation Sequencing Competence Network (NGS-CN) and the German Human Genome-Phenome Archive (GHGA). In this effort, we developed NCBench, a continuous benchmarking platform for the evaluation of small genomic variant callsets in terms of recall, precision, and false positive/negative error patterns. NCBench is implemented as a continuously re-evaluated open-source repository. We show that it is possible to entirely rely on public free infrastructure (Github, Github Actions, Zenodo) in combination with established open-source tools. NCBench is agnostic of the used dataset and can evaluate an arbitrary number of given callsets, while reporting the results in a visual and interactive way. We used NCBench to evaluate over 40 callsets generated by various variant calling pipelines available in the participating groups that were run on three exome datasets from different enrichment kits and at different coverages. While all pipelines achieve high overall quality, subtle systematic differences between callers and datasets exist and are made apparent by NCBench.These insights are useful to improve existing pipelines and develop new workflows. NCBench is meant to be open for the contribution of any given callset. Most importantly, for authors, it will enable the omission of repeated re-implementation of paper-specific variant calling benchmarks for the publication of new tools or pipelines, while readers will benefit from being able to (continuously) observe the performance of tools and pipelines at the time of reading instead of at the time of writing.

Details

Originalsprache	Englisch
Aufsatznummer	1125
Fachzeitschrift	F1000Research
Jahrgang	12
Publikationsstatus	Veröffentlicht - 2024
Peer-Review-Status	Ja

Externe IDs

PubMed	39345270
ORCID	/0000-0001-9599-8632/work/174428924

Schlagworte

ASJC Scopus Sachgebiete

Schlagwörter

benchmarking, continuous, NGS, variant calling

Forschungsportal der TU Dresden