|
Is there any benefit to running a dimensionality reduction (e.g. Kernel PCA) before using the SVM algorithm? Naively, it seems like feeding Kernel-PCA-preprocessed data to a linear SVM is a similar process to feeding the raw data to kernel-SVM. I'm trying to develop some intuition about whether and how the two strategies differ. |
There might be a benefit if you have unlabeled data, as that will implicitly perform some kind of semi supervised learning.