TY - GEN
T1 - Subclass-oriented Dimension Reduction with constraint transformation and manifold regularization
AU - Tong, Bin
AU - Suzuki, Einoshin
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2010
Y1 - 2010
N2 - We propose a new method, called Subclass-oriented Dimension Reduction with Pairwise Constraints (SODRPaC), for dimension reduction on high dimensional data. Current linear semi-supervised dimension reduction methods using pairwise constraints, e.g., must-link constraints and cannot-link constraints, can not handle appropriately the data of multiple subclasses where the points of a class are separately distributed in different groups. To illustrate this problem, wparticularly classify the must-link constraint into two categories, which are theinter-subclass must-link constraint and the intra-subclass must-link constraint, respectively. We argue that handling the inter-subclass must-link constraint is challenging for current discriminant criteria. Inspired by the above observation and the cluster assumption that nearby points are possible in the same class, we carefully transform must-link constraints into cannot-link constraints, and then propose a new discriminant criterion by employing the cannot-link constraints and the compactness of shared nearest neighbors. For the reason that the local data structure is one of the most significant features for the data of multiple subclasses, manifold regularization is also incorporated in our dimension reduction framework. Extensive experiments on both synthetic and practical data sets illustrate the effectiveness of our method.
AB - We propose a new method, called Subclass-oriented Dimension Reduction with Pairwise Constraints (SODRPaC), for dimension reduction on high dimensional data. Current linear semi-supervised dimension reduction methods using pairwise constraints, e.g., must-link constraints and cannot-link constraints, can not handle appropriately the data of multiple subclasses where the points of a class are separately distributed in different groups. To illustrate this problem, wparticularly classify the must-link constraint into two categories, which are theinter-subclass must-link constraint and the intra-subclass must-link constraint, respectively. We argue that handling the inter-subclass must-link constraint is challenging for current discriminant criteria. Inspired by the above observation and the cluster assumption that nearby points are possible in the same class, we carefully transform must-link constraints into cannot-link constraints, and then propose a new discriminant criterion by employing the cannot-link constraints and the compactness of shared nearest neighbors. For the reason that the local data structure is one of the most significant features for the data of multiple subclasses, manifold regularization is also incorporated in our dimension reduction framework. Extensive experiments on both synthetic and practical data sets illustrate the effectiveness of our method.
UR - http://www.scopus.com/inward/record.url?scp=79956312031&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79956312031&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-13672-6_1
DO - 10.1007/978-3-642-13672-6_1
M3 - Conference contribution
AN - SCOPUS:79956312031
SN - 3642136710
SN - 9783642136719
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 1
EP - 13
BT - Advances in Knowledge Discovery and Data Mining - 14th Pacific-Asia Conference, PAKDD 2010, Proceedings
T2 - 14th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2010
Y2 - 21 June 2010 through 24 June 2010
ER -