Studies on the inference of protein binding regions across fold space based on structural similarities

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

Abstract

The emerging picture of a continuous protein fold space highlights the existence of non obvious structural similarities between proteins with apparent different topologies. The identification of structure resemblances across fold space and the analysis of similar recognition regions may be a valuable source of information towards protein structure-based functional characterization. In this work, we use non-sequential structural alignment methods (ns-SAs) to identify structural similarities between protein pairs independently of their SCOP hierarchy, and we calculate the significance of binding region conservation using the interacting residues overlap in the ns-SA. We cluster the binding inferences for each family to distinguish already known family binding regions from putative new ones. Our methodology exploits the enormous amount of data available in the PDB to identify binding region similarities within protein families and to propose putative binding regions. Our results indicate that there is a plethora of structurally common binding regions among proteins, independently of current fold classifications. We obtain a 6- to 8-fold enrichment of novel binding regions, and identify binding inferences for 728 protein families that so far lack binding information in the PDB. We explore binding mode analogies between ligands from commonly clustered binding regions to investigate the utility of our methodology. A comprehensive analysis of the obtained binding inferences may help in the functional characterization of protein recognition and assist rational engineering. The data obtained in this work is available in the download link at www.scowlp.org.

Details

Original languageEnglish
Pages (from-to)499-508
Number of pages10
JournalProteins
Volume79
Issue number2
Publication statusPublished - 7 Oct 2010
Peer-reviewedYes

External IDs

Scopus 78650761785

Keywords

Keywords

  • Algorithms, Computer Simulation, Databases, Protein, Models, Molecular, Protein Binding, Protein Folding, Protein Interaction Domains and Motifs, Protein Multimerization, Protein Structure, Quaternary, Proteins/chemistry, Structural Homology, Protein