Expansion and re-classification of the extracytoplasmic function (ECF) σ factor family

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

  • Delia Casas-Pastor - , LOEWE-Center for Synthetic Microbiology (SYNMIKRO) (Author)
  • Raphael R Müller - , Justus Liebig University Giessen (Author)
  • Sebastian Jaenicke - , Justus Liebig University Giessen (Author)
  • Karina Brinkrolf - , Justus Liebig University Giessen (Author)
  • Anke Becker - , LOEWE-Center for Synthetic Microbiology (SYNMIKRO) (Author)
  • Mark J Buttner - , John Innes Centre (Author)
  • Carol A Gross - , University of California at Berkeley (Author)
  • Thorsten Mascher - , Chair of General Microbiology, Institute of Microbiology (Author)
  • Alexander Goesmann - , Justus Liebig University Giessen (Author)
  • Georg Fritz - , University of Western Australia (Author)

Abstract

Extracytoplasmic function σ factors (ECFs) represent one of the major bacterial signal transduction mechanisms in terms of abundance, diversity and importance, particularly in mediating stress responses. Here, we performed a comprehensive phylogenetic analysis of this protein family by scrutinizing all proteins in the NCBI database. As a result, we identified an average of ∼10 ECFs per bacterial genome and 157 phylogenetic ECF groups that feature a conserved genetic neighborhood and a similar regulation mechanism. Our analysis expands previous classification efforts ∼50-fold, enriches many original ECF groups with previously unclassified proteins and identifies 22 entirely new ECF groups. The ECF groups are hierarchically related to each other and are further composed of subgroups with closely related sequences. This two-tiered classification allows for the accurate prediction of common promoter motifs and the inference of putative regulatory mechanisms across subgroups composing an ECF group. This comprehensive, high-resolution description of the phylogenetic distribution of the ECF family, together with the massive expansion of classified ECF sequences and an openly accessible data repository called 'ECF Hub' (https://www.computational.bio.uni-giessen.de/ecfhub), will serve as a powerful hypothesis-generator to guide future research in the field.

Details

Original languageEnglish
Pages (from-to)986-1005
Number of pages20
JournalNucleic acids research
Volume49
Issue number2
Publication statusPublished - 25 Jan 2021
Peer-reviewedYes

External IDs

PubMedCentral PMC7826278
Scopus 85100358296

Keywords

Keywords

  • Amino Acid Sequence, Bacterial Proteins/chemistry, Consensus Sequence, DNA-Directed RNA Polymerases/chemistry, Gene Expression Regulation, Bacterial, Multigene Family, Phylogeny, Sequence Alignment, Sigma Factor/classification, Signal Transduction, Substrate Specificity, Terminology as Topic

Library keywords