On a Fixed Haplotype Variant of the Minimum Error Correction Problem

Axel Goblet, Steven Kelk, Matúš Mihalák, Georgios Stamoulis*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterAcademic

Abstract

Haplotype assembly is the problem of reconstructing the two parental chromosomes of an individual from a set of sampled dna-sequences. A combinatorial optimization problem that models haplotype assembly is the minimum error correction problem (mec). This problem has been intensively studied in the computational biology literature and is also known in the clustering literature: essentially we are required to find two cluster centres such that the sum of distances to the nearest centre, is minimized. We introduce here the problem fixed haplotype-minimum error correction (fh-mec), a new variant of mec which corresponds to instances where one of the haplotypes/centres is already given. We provide hardness results for the problem on various restricted instances. We also propose a new and very simple 2-approximation algorithm for mec on binary input matrices.
Original languageEnglish
Title of host publicationComputing and Combinatorics. COCOON 2018
EditorsL. Wang, D. Zhu
PublisherSpringer Verlag
Pages554-566
DOIs
Publication statusPublished - 29 Jun 2018

Publication series

SeriesLecture Notes in Computer Science
Volume10976

Cite this