TY - GEN
T1 - Practical algorithms for pattern based linear regression
AU - Bannai, Hideo
AU - Hatano, Kohei
AU - Inenaga, Shunsuke
AU - Takeda, Masayuki
PY - 2005
Y1 - 2005
N2 - We consider the problem of discovering the optimal pattern from a set of strings and associated numeric attribute values. The goodness of a pattern is measured by the correlation between the number of occurrences of the pattern in each string, and the numeric attribute value assigned to the string. We present two algorithms based on suffix trees, that can find the optimal substring pattern in O(Nn) and O(N 2) time, respectively, where n is the number of strings and N is their total length. We further present a general branch and bound strategy that can be used when considering more complex pattern classes. We also show that combining the O(N 2) algorithm and the branch and bound heuristic increases the efficiency of the algorithm considerably.
AB - We consider the problem of discovering the optimal pattern from a set of strings and associated numeric attribute values. The goodness of a pattern is measured by the correlation between the number of occurrences of the pattern in each string, and the numeric attribute value assigned to the string. We present two algorithms based on suffix trees, that can find the optimal substring pattern in O(Nn) and O(N 2) time, respectively, where n is the number of strings and N is their total length. We further present a general branch and bound strategy that can be used when considering more complex pattern classes. We also show that combining the O(N 2) algorithm and the branch and bound heuristic increases the efficiency of the algorithm considerably.
UR - http://www.scopus.com/inward/record.url?scp=33745322333&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745322333&partnerID=8YFLogxK
U2 - 10.1007/11563983_6
DO - 10.1007/11563983_6
M3 - Conference contribution
AN - SCOPUS:33745322333
SN - 3540292306
SN - 9783540292302
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 44
EP - 56
BT - Discovery Science - 8th International Conference, DS 2005, Proceedings
T2 - 8th International Conference on Discovery Science, DS 2005
Y2 - 8 October 2005 through 11 October 2005
ER -