I want to calculate the probability of success in a task.

I have thousands of instances of input attributes and (success/failure) results. The tools I am using (Weka classifiers) predict success/failure outcome for new instances but not probabilities of success.

My goal is distribute resources among new tasks and for this I would like to estimate probabilities of success. Is there a straightforward to way to do this using the binary classifier?

asked Jan 23 '11 at 05:03

peter_c_william's gravatar image

peter_c_william
14114


3 Answers:

Many classifiers give you a numeric value (score), which is converted to a yes/no decision by comparing it with a threshold. There are a few methods which convert the score to probability, e.g. Platt scaling. See "Predicting Good Probabilities With Supervised Learning".

Weka's classifier API has a distributionForInstance method which is supposed to return probabilities. I think, at least for SMO classifier Weka has an option of implementing it by fitting a logistic regression to SVM output which is analogous to Platt's scaling method.

answered Jan 23 '11 at 07:44

ivank's gravatar image

ivank
20646

If the results are just YES or NO you only have the probability of 1 for YES and 0 for NO if your data point has been classified as YES, and 1 for NO and 0 for YES if you data point has been classified as NO. There is no NES or YO ;) and hence you won't have any fuzzy confidence for these values.

The answer of ivank is very good and is the answer to your question. But the confidence for a decsion also depends heavily on the type of model you use for your classification problem.

answered Nov 21 '11 at 06:50

Tom's gravatar image

Tom
564510

-1

How Can i get the probabilities from a binary classification if the results of the models ares yes or no? Thank You!

answered Nov 17 '11 at 07:13

Chuchi%20Come's gravatar image

Chuchi Come
0

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.