I have a training data set for a binary classification problem. There exist two possible scenarios, one is that all of the training data set are labeled as positive; another one is that the training data set includes labeled positive ones and labeled negative ones.

Assume that I use this training data set to train a decision tree. How do these two different scenarios affect the trained tree?

Moreover, if the ratio of positive and negative ones changes, how does this change affect the built tree model?

asked Jan 02 at 17:46

surfreta's gravatar image

surfreta
5236

Be the first one to answer this question!
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.