TY - JOUR
T1 - Multiclass Laplacian support vector machine with functional analysis of variance decomposition
AU - Park, Beomjin
AU - Park, Changyi
N1 - Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/11
Y1 - 2023/11
N2 - In classification problems, acquiring a sufficient amount of labeled samples sometimes proves expensive and time-consuming, while unlabeled samples are relatively easier to obtain. The Laplacian Support Vector Machine (LapSVM) is one of the successful methods that learn better classification functions by incorporating unlabeled samples. However, since LapSVM was originally designed for binary classification, it can not be applied directly to multiclass classification problems commonly encountered in practice. Thus we derive an extension of LapSVM to multiclass classification problems using an appropriate multiclass formulation. Another problem with LapSVM is that irrelevant variables easily degrade classification performance. The irrelevant variables can increase the variance of predicted values and make the model difficult to interpret. Therefore, this paper also proposes the multiclass LapSVM with functional analysis of variance decomposition to identify relevant variables. Through comprehensive simulations and real-world datasets, we demonstrate the efficiency and improved classification performance of the proposed methods.
AB - In classification problems, acquiring a sufficient amount of labeled samples sometimes proves expensive and time-consuming, while unlabeled samples are relatively easier to obtain. The Laplacian Support Vector Machine (LapSVM) is one of the successful methods that learn better classification functions by incorporating unlabeled samples. However, since LapSVM was originally designed for binary classification, it can not be applied directly to multiclass classification problems commonly encountered in practice. Thus we derive an extension of LapSVM to multiclass classification problems using an appropriate multiclass formulation. Another problem with LapSVM is that irrelevant variables easily degrade classification performance. The irrelevant variables can increase the variance of predicted values and make the model difficult to interpret. Therefore, this paper also proposes the multiclass LapSVM with functional analysis of variance decomposition to identify relevant variables. Through comprehensive simulations and real-world datasets, we demonstrate the efficiency and improved classification performance of the proposed methods.
KW - Laplacian support vector machine
KW - Multiclass classification
KW - Semi-supervised learning
KW - Variable selection
UR - http://www.scopus.com/inward/record.url?scp=85164303482&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2023.107814
DO - 10.1016/j.csda.2023.107814
M3 - Article
AN - SCOPUS:85164303482
SN - 0167-9473
VL - 187
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
M1 - 107814
ER -