MeDIP-HMM: Genome-wide identification of distinct DNA methylation states from high-density tiling arrays

Publikation: Beitrag in FachzeitschriftForschungsartikelBeigetragenBegutachtung

Beitragende

  • Michael Seifert - , Zelluläre Netzwerke und Systembiologie (FoG), Leibniz-Institut für Pflanzengenetik und Kulturpflanzenforschung, Centre national de la recherche scientifique (CNRS) (Autor:in)
  • Sandra Cortijo - , Centre national de la recherche scientifique (CNRS) (Autor:in)
  • Maria Colomé-Tatché - , University of Groningen (Autor:in)
  • Frank Johannes - , University of Groningen (Autor:in)
  • François Roudier - , Centre national de la recherche scientifique (CNRS) (Autor:in)
  • Vincent Colot - , Centre national de la recherche scientifique (CNRS) (Autor:in)

Abstract

MOTIVATION: Methylation of cytosines in DNA is an important epigenetic mechanism involved in transcriptional regulation and preservation of genome integrity in a wide range of eukaryotes. Immunoprecipitation of methylated DNA followed by hybridization to genomic tiling arrays (MeDIP-chip) is a cost-effective and sensitive method for methylome analyses. However, existing bioinformatics methods only enable a binary classification into unmethylated and methylated genomic regions, which limit biological interpretations. Indeed, DNA methylation levels can vary substantially within a given DNA fragment depending on the number and degree of methylated cytosines. Therefore, a method for the identification of more than two methylation states is highly desirable.

RESULTS: Here, we present a three-state hidden Markov model (MeDIP-HMM) for analyzing MeDIP-chip data. MeDIP-HMM uses a higher-order state-transition process improving modeling of spatial dependencies between chromosomal regions, allows a simultaneous analysis of replicates and enables a differentiation between unmethylated, methylated and highly methylated genomic regions. We train MeDIP-HMM using a Bayesian Baum-Welch algorithm, integrating prior knowledge on methylation levels. We apply MeDIP-HMM to the analysis of the Arabidopsis root methylome and systematically investigate the benefit of using higher-order HMMs. Moreover, we also perform an in-depth comparison study with existing methods and demonstrate the value of using MeDIP-HMM by comparisons to current knowledge on the Arabidopsis methylome. We find that MeDIP-HMM is a fast and precise method for the analysis of methylome data, enabling the identification of distinct DNA methylation levels. Finally, we provide evidence for the general applicability of MeDIP-HMM by analyzing promoter DNA methylation data obtained for chicken.

AVAILABILITY: MeDIP-HMM is available as part of the open-source Java library Jstacs (www.jstacs.de/index.php/MeDIP-HMM). Data files are available from the Jstacs website.

CONTACT: seifert@ipk-gatersleben.de.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Details

OriginalspracheEnglisch
Seiten (von - bis)2930-2939
Seitenumfang10
FachzeitschriftBioinformatics
Jahrgang28
Ausgabenummer22
PublikationsstatusVeröffentlicht - 15 Nov. 2012
Peer-Review-StatusJa

Externe IDs

Scopus 84869397620
ORCID /0000-0002-2844-053X/work/153655336

Schlagworte

Schlagwörter

  • Algorithms, Arabidopsis/genetics, Bayes Theorem, DNA Methylation, Epigenesis, Genetic, Genome-Wide Association Study, Genomics/methods, Immunoprecipitation/methods, Oligonucleotide Array Sequence Analysis/methods