The complexity of phylogeny constraint satisfaction problems

Research output: Contribution to journalResearch articleContributedpeer-review

Contributors

Abstract

We systematically study the computational complexity of a broad class of computational problems in phylogenetic reconstruction. The class contains, for example, the rooted triple consistency problem, forbidden subtree problems, the quartet consistency problem, and many other problems studied in the bioinformatics literature. The studied problems can be described as constraint satisfaction problems, where the constraints have a first-order definition over the rooted triple relation. We show that every such phylogeny problem can be solved in polynomial time or is NP-complete. On the algorithmic side, we generalize a well-known polynomial-time algorithm of Aho, Sagiv, Szymanski, and Ullman for the rooted triple consistency problem. Our algorithm repeatedly solves linear equation systems to construct a solution in polynomial time.We then showthat every phylogeny problem that cannot be solved by our algorithm is NP-complete. Our classification establishes a dichotomy for a large class of infinite structures that we believe is of independent interest in universal algebra, model theory, and topology. The proof of our main result combines results and techniques from various research areas: a recent classification of the model-complete cores of the reducts of the homogeneous binary branching C-relation, Leeb's Ramsey theorem for rooted trees, and universal algebra.

Details

Original languageEnglish
Article number23
JournalACM transactions on computational logic
Volume18
Issue number3
Publication statusPublished - Jul 2017
Peer-reviewedYes

External IDs

ORCID /0000-0001-8228-3611/work/142241092

Keywords

Keywords

  • Computational complexity, Constraint satisfaction problems, Model theory, Phylogenetic reconstruction, Ramsey theory