1
1

I have a few doubts in understanding SVMs intuitively. Assume we have trained a SVM model for classification using some standard tool like SVMLight or LibSVM.

  1. When we use this model for prediction on test data, the model generates a file having "alpha" values for each test point. If alpha value is positive the test point belongs to Class 1, else it belongs to Class 2. Now, can we say that a test point with greater "alpha" value belongs to corresponding class with "higher" probability?

  2. Similar to first question, when we have a SVM trained. The SV's lie very near to the hyper-plane. So does that mean that SV's belong to that class with high probability? Can we relate the probability of a point belonging to a class with it's distance from the "hyperplane"? Does "alpha" value represent distance from "hyperplane"?

Thanks for your input.

This question is marked "community wiki".

asked Sep 01 '11 at 09:47

Amit%20Chandak's gravatar image

Amit Chandak
1122


2 Answers:
  1. SVMs do not estimate probabilities, so technically you can't make a statement about the probabilities of points belonging to classes given the output of an SVM. However, it is common to use the alpha output and the true labels of the training data to train a logistic regression classifier from which you can get well-calibrated probability estimates out of the SVM. Also, libsvm can compute these probabilities itself, look it up in its manual. It requires retraining the SVM.

  2. The SVs essentially are the hyperplane in a way (the hyperplane is a linear combination of the SVs). This means that these are precisely the points where the classifier is in most uncertainty about the classification (so if you change the features values of the SVs by epsilon the classifier changes; this generally does not happen for every point). In you want to think about the alpha as a probability you need to think the opposite, and the "probability" is higher the further a point is from the hyperplane.

If you ask for my opinion you should really use the libSVM probability estimation.

This answer is marked "community wiki".

answered Sep 01 '11 at 14:06

Alexandre%20Passos's gravatar image

Alexandre Passos ♦
1896744214334

Quote from "Sparse Probabilistic Classifiers" by Romain Hérault, Yves Grandvalet

"The scores returned by support vector machines are often used as a confidence measures in the classification of new examples. However, there is no theoretical argument sustaining this practice. Thus, when classification uncertainty has to be assessed, it is safer to resort to classifiers estimating conditional probabilities of class labels."

More details in the paper http://www.machinelearning.org/proceedings/icml2007/papers/233.pdf

This answer is marked "community wiki".

answered Sep 01 '11 at 10:21

come's gravatar image

come
712

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.