SHARK-capture identifies functional motifs in intrinsically disordered protein regions

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

  • Chi Fung Willis Chow - , Clusters of Excellence PoL: Physics of Life, Max Planck Institute of Molecular Cell Biology and Genetics, Center for Systems Biology Dresden (CSBD) (Author)
  • Swantje Lenz - , Max Planck Institute of Molecular Cell Biology and Genetics, Center for Systems Biology Dresden (CSBD) (Author)
  • Maxim Scheremetjew - , Max Planck Institute of Molecular Cell Biology and Genetics, Center for Systems Biology Dresden (CSBD) (Author)
  • Soumyadeep Ghosh - , Max Planck Institute of Molecular Cell Biology and Genetics, Center for Systems Biology Dresden (CSBD) (Author)
  • Doris Richter - , Max Planck Institute of Molecular Cell Biology and Genetics (Author)
  • Ceciel Jegers - , Clusters of Excellence PoL: Physics of Life, Chair of Cellular Biochemistry (Author)
  • Alexander von Appen - , Max Planck Institute of Molecular Cell Biology and Genetics, Center for Systems Biology Dresden (CSBD) (Author)
  • Simon Alberti - , Clusters of Excellence PoL: Physics of Life, Chair of Cellular Biochemistry (Author)
  • Agnes Toth-Petroczy - , Max Planck Institute of Molecular Cell Biology and Genetics, Center for Systems Biology Dresden (CSBD), Clusters of Excellence PoL: Physics of Life (Author)

Abstract

Increasing insights into how sequence motifs in intrinsically disordered regions (IDRs) provide functions underscore the need for systematic motif detection. Contrary to structured regions where motifs can be readily identified from sequence alignments, the rapid evolution of IDRs limits the usage of alignment-based tools in reliably detecting motifs within. Here, we developed SHARK-capture, an alignment-free motif detection tool designed for difficult-to-align regions. SHARK-capture innovates on word-based methods by flexibly incorporating amino acid physicochemistry to assess motif similarity without requiring rigid definitions of equivalency groups. SHARK-capture offers consistently strong performance in a systematic benchmark, with superior residue-level performance. SHARK-capture identified known functional motifs across orthologs of the microtubule-associated zinc finger protein BuGZ. We also identified a short motif in the IDR of S. cerevisiae RNA helicase Ded1p, which we experimentally verified to be capable of promoting ATPase activity. Our improved performance allows us to systematically calculate 10,889 motifs for 2695 yeast IDRs and provide it as a resource. SHARK-capture offers the most precise tool yet for the systematic identification of conserved regions in IDRs and is freely available as a Python package (https://pypi.org/project/bio-shark/) and on https://git.mpi-cbg.de/tothpetroczylab/shark.

Details

Original languageEnglish
Article numbere70091
JournalProtein science
Volume34
Issue number4
Publication statusPublished - Apr 2025
Peer-reviewedYes

External IDs

PubMed 40100159
ORCID /0000-0003-4017-6505/work/181860566

Keywords

ASJC Scopus subject areas

Keywords

  • alignment-free, IDRs, motif detection, sequence-to-function