TY - GEN
T1 - Weighted LCS
AU - Amir, Amihood
AU - Gotthilf, Zvi
AU - Shalom, B. Riva
PY - 2009
Y1 - 2009
N2 - The Longest Common Subsequence (LCS) of two strings A and B is a well studied problem having awide range of applications. When each symbol of the input strings is assigned a positive weight the problem becomes the Heaviest Common Subsequence (HCS) problem. In this paper we consider a different version of weighted LCS on Position Weight Matrices (PWM). The Position Weight Matrix was introduced as a tool to handle a set of sequences that are not identical, yet, have many local similarities. Such a weighted sequence is a 'statistical image' of this set where we are given the probability of every symbol's occurrence at every text location. We consider two possible definitions of LCS on PWM. For the first, we solve the weighted LCS problem of z sequences in time O(znz+1). For the second, we prove NP-hardness and provide an approximation algorithm.
AB - The Longest Common Subsequence (LCS) of two strings A and B is a well studied problem having awide range of applications. When each symbol of the input strings is assigned a positive weight the problem becomes the Heaviest Common Subsequence (HCS) problem. In this paper we consider a different version of weighted LCS on Position Weight Matrices (PWM). The Position Weight Matrix was introduced as a tool to handle a set of sequences that are not identical, yet, have many local similarities. Such a weighted sequence is a 'statistical image' of this set where we are given the probability of every symbol's occurrence at every text location. We consider two possible definitions of LCS on PWM. For the first, we solve the weighted LCS problem of z sequences in time O(znz+1). For the second, we prove NP-hardness and provide an approximation algorithm.
UR - http://www.scopus.com/inward/record.url?scp=77950178534&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-10217-2_7
DO - 10.1007/978-3-642-10217-2_7
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:77950178534
SN - 3642102166
SN - 9783642102165
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 36
EP - 47
BT - Combinatorial Algorithms - 20th International Workshop, IWOCA 2009, Revised Selected Papers
T2 - 20th International Workshop on Combinatorial Algorithms, IWOCA 2009
Y2 - 28 June 2009 through 2 July 2009
ER -