Reduced false positives in PDZ binding prediction using sequence and structural descriptors

John C Hawkins; Hongbo Zhu; Joan Teyra; M Teresa Pisabarro

doi:10.1109/TCBB.2012.54

Reduced false positives in PDZ binding prediction using sequence and structural descriptors

Research output: Contribution to journal › Research article › Contributed › peer-review

Contributors

John C Hawkins - , Biotechnology Center (BIOTEC) (Author)
Hongbo Zhu - , Biotechnology Center (BIOTEC) (Author)
Joan Teyra - , Biotechnology Center (BIOTEC) (Author)
M Teresa Pisabarro - , Structural Bioinformatics (Research Group), Biotechnology Center (Author)

Abstract

Abstract—Identifying the binding partners of proteins is a problem of fundamental importance in computational biology. The PDZ is one of the most common and well-studied protein binding domains, hence it is a perfect model system for designing protein binding predictors. The standard approach to identifying the binding partners of PDZ domains uses multiple sequence alignments to infer the set of contact residues that are used in a predictive model. We expand on the sequence alignment approach by incorporating structural information to generate descriptors of the binding site geometry. Furthermore, we generate a real-value score for binary predictions by applying a filter based on models that predict the probability distributions of contact residues at each of the canonical PDZ ligand binding positions. Under training cross validation, our model produced an order of magnitude more predictions at a false positive proportion (FPP) of 10 percent than our benchmark model chosen from the literature. Evaluated using an independent cross validation, with computationally predicted structures, our model was able to make five times as many predictions as the benchmark model, with a Matthews' correlation coefficient (MCC) of 0.33. In addition, our model achieved a false positive proportion of 0.14, while the benchmark model had a 0.25 false positive proportion.

Details

Original language	English
Pages (from-to)	1492-503
Number of pages	12
Journal	IEEE/ACM transactions on computational biology and bioinformatics
Volume	9
Issue number	5
Publication status	Published - 18 Apr 2012
Peer-reviewed	Yes

External IDs

Scopus	84864576889
ORCID	/0000-0002-5175-9311/work/175747480

Keywords

Binding Sites, Databases, Protein, PDZ Domains, Protein Conformation, Proteins/chemistry, Sequence Alignment

Library keywords

570 Biology

Research Portal of the TU Dresden

Contributors

Abstract

Details

External IDs

Keywords

Keywords

Library keywords