TY - CHAP

T1 - Parameterized Complexity Analysis for the Closest String with Wildcards Problem

AU - Hermelin, Dan

AU - Rozenberg, Liat

PY - 2014/1

Y1 - 2014/1

N2 - The Closest String problem asks to find a string s which is not too far from each string in a set of m input strings, where the distance is taken as the Hamming distance. This well-studied problem has various applications in computational biology and drug design. In this paper, we introduce a new variant of Closest String where the input strings can contain wildcards that can match any letter in the alphabet, and the goal is to find a solution string without wildcards. We call this problem the Closest String with Wildcards problem, and we analyze it in the framework of parameterized complexity. Our study determines for each natural parameterization whether this parameterization yields a fixed-parameter algorithm, or whether such an algorithm is highly unlikely to exist. More specifically, let m denote the number of input strings, each of length n, and let d be the given distance bound for the solution string. Furthermore, let k denote the minimum number of wildcards in any input string. We present fixed-parameter algorithms for the parameters m, n, and k+d, respectively. On the other hand, we then show that such results are unlikely to exist when k and d are taken as single parameters. This is done by showing that the problem is NP-hard already for k=0 and d ≥2. Finally, to complement the latter result, we present a polynomial-time algorithm for the case of d=1. Apart from this last result, all other results hold even when the strings are over a general alphabet.

AB - The Closest String problem asks to find a string s which is not too far from each string in a set of m input strings, where the distance is taken as the Hamming distance. This well-studied problem has various applications in computational biology and drug design. In this paper, we introduce a new variant of Closest String where the input strings can contain wildcards that can match any letter in the alphabet, and the goal is to find a solution string without wildcards. We call this problem the Closest String with Wildcards problem, and we analyze it in the framework of parameterized complexity. Our study determines for each natural parameterization whether this parameterization yields a fixed-parameter algorithm, or whether such an algorithm is highly unlikely to exist. More specifically, let m denote the number of input strings, each of length n, and let d be the given distance bound for the solution string. Furthermore, let k denote the minimum number of wildcards in any input string. We present fixed-parameter algorithms for the parameters m, n, and k+d, respectively. On the other hand, we then show that such results are unlikely to exist when k and d are taken as single parameters. This is done by showing that the problem is NP-hard already for k=0 and d ≥2. Finally, to complement the latter result, we present a polynomial-time algorithm for the case of d=1. Apart from this last result, all other results hold even when the strings are over a general alphabet.

UR - http://www.scopus.com/inward/record.url?scp=84958542233&partnerID=8YFLogxK

U2 - https://doi.org/10.1007/978-3-319-07566-2_15

DO - https://doi.org/10.1007/978-3-319-07566-2_15

M3 - Entry for encyclopedia/dictionary

SN - 978-3-319-07566-2

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 140

EP - 149

BT - Combinatorial Pattern Matching

A2 - Kulikov, Alexander S.

A2 - Kuznetsov, Sergei O.

A2 - Pevzner, Pavel

PB - Springer

T2 - 25th Annual Symposium on Combinatorial Pattern Matching, CPM 2014

Y2 - 16 June 2014 through 18 June 2014

ER -