Revision history[back]
click to hide/show revision 1
Revision n. 1

Jul 06 '10 at 02:18

Aditya%20Mukherji's gravatar image

Aditya Mukherji
2251611

The Entropy of a feature might be a good way to assess its relevance to your classification task.
A feature which occurs uniformly across classes has high entropy and does not really provide much information to help you in your classification.
OTOH one which occurs only in one or just a few classes has a lower entropy, i.e. is more opinionated, so might be a better training feature.
Eventually you could start adding additional features by figuring out which ones can help you discriminate between the non-diagonal pairs of your confusion matrix which show a high misclassification rate.

click to hide/show revision 2
Revision n. 2

Jul 06 '10 at 02:19

Aditya%20Mukherji's gravatar image

Aditya Mukherji
2251611

The Entropy Entropy of a feature might be a good way to assess its relevance to your classification task.
A feature which occurs uniformly across classes has high entropy and does not really provide much information to help you in your classification.
OTOH one which occurs only in one or just a few classes has a lower entropy, i.e. is more opinionated, so might be a better training feature.
Eventually you could start adding additional features by figuring out which ones can help you discriminate between the non-diagonal pairs of your confusion matrix which show a high misclassification rate.

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.