Agreement forests of caterpillar trees: Complexity, kernelization and branching

S. Kelk*, R. Meuwese

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Given a set X of species, a phylogenetic tree is an unrooted binary tree whose leaves are bijectively labelled by X. Such trees can be used to show the way species evolve over time. One way of understanding how topologically different two phylogenetic trees are, is to construct a minimum-size agreement forest: a partition of X into the smallest number of blocks, such that the blocks induce homeomorphic, non-overlapping subtrees in both trees. This is called a maximum agreement forest. This comparison yields insight into commonalities and differences in the evolution of X across the two trees. Computing a maximum agreement forest is NP-hard [8]. In this work we study the problem on caterpillars, which are path-like phylogenetic trees. We will demonstrate that, even if we restrict the input to this highly restricted subclass, the problem remains NP-hard and is in fact APX-hard. Furthermore we show that for caterpillars two standard reduction rules well known in the literature yield a tight kernel of size at most 7k, compared to 15k for general trees [11]. Finally we demonstrate that we can determine if two caterpillars have an agreement forest with at most k blocks in O?(2.49k) time, compared to O?(3k) for general trees [4], where O?(.) suppresses polynomial factors.
Original languageEnglish
Article number114608
JournalTheoretical Computer Science
Volume1003
DOIs
Publication statusPublished - 1 Jul 2024

Keywords

  • Agreement forests
  • Caterpillars
  • Combinatorics
  • Exponential-time algorithms
  • Phylogenetics

Fingerprint

Dive into the research topics of 'Agreement forests of caterpillar trees: Complexity, kernelization and branching'. Together they form a unique fingerprint.

Cite this