Next-generation sequencing reveals differentially amplified tandem repeats as a major genome component of Northern Europe's oldest Camellia japonica

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

Abstract

Northern Europe's oldest and largest Camellia japonica growing at the Pillnitz Castle (Germany) for over 200 years is of botanical and cultural importance and is a reference for C. japonica molecular scale analysis. In order to provide a fundament for genome analysis of the genus Camellia, we characterize the C. japonica tandem repeat fraction, constituting 12.5 % of the Pillnitz camellia's genome. A genomic library of the Pillnitz C. japonica was produced and Illumina sequenced to generate 36 Gb of paired-end reads. We performed graph-based read clustering implemented in the RepeatExplorer pipeline to estimate the C. japonica repeat fraction of 73 %. This enabled us to identify and characterize the most prominent satellite DNAs, Camellia japonica satellite 1-4 (CajaSat1-CajaSat4), and the 5S ribosomal DNA (rDNA) by bioinformatics, fluorescent in situ and Southern hybridization. Within the Camellia genus, satellite spreading, array expansion and formation of higher-order structures highlight different modes of repeat evolution. The CajaSat satellites localize at prominent chromosomal sites, including (peri)centromeres and subtelomeres of all chromosomes, thus serving as chromosomal landmarks for their identification. This work provides an insight into the C. japonica chromosome organization and significantly expands the Camellia genomic knowledge, also with respect to the tea plant Camellia sinensis.

Details

Original languageEnglish
Pages (from-to)791-806
Number of pages16
JournalChromosome Research
Volume23
Issue number4
Publication statusPublished - Dec 2015
Peer-reviewedYes

External IDs

ORCID /0000-0003-0168-8428/work/53671082
Scopus 84949110239

Keywords

Keywords

  • Base Sequence, Camellia/genetics, Chromosomes, Plant, Consensus Sequence, DNA Methylation, DNA, Satellite, Genome Components, Genome, Plant, High-Throughput Nucleotide Sequencing, Molecular Sequence Data, Sequence Alignment, Tandem Repeat Sequences