The Complexity of the Single Individual SNP Haplotyping Problem

Rudi Cilibrasi*, Leo Van Iersel, Steven Kelk, John Tromp

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

We present several new results pertaining to haplotyping. These results concern the combinatorial problem of reconstructing haplotypes from incomplete and/or imperfectly sequenced haplotype fragments. We consider the complexity of the problems minimum error correction (mec) and longest haplotype reconstruction (lhr) for different restrictions on the input data. Specifically, we look at the gapless case, where every row of the input corresponds to a gapless haplotype-fragment, and the 1-gap case, where at most one gap per fragment is allowed. We prove that mec is apx-hard in the 1-gap case and still np-hard in the gapless case. In addition, we question earlier claims that mec is np-hard even when the input matrix is restricted to being completely binary. Concerning lhr, we show that this problem is np-hard and apx-hard in the 1-gap case (and thus also in the general case), but is polynomial time solvable in the gapless case.
Original languageEnglish
Pages (from-to)13-36
JournalAlgorithmica
Volume49
Issue number1
DOIs
Publication statusPublished - 1 Sept 2007
Externally publishedYes

Cite this