During my presentation Machine Learning Empowered by Python, I created a real-time digit recognition demo. The training samples were black and white. I was looking for a way to train my system independently of the pixel colors so it could work on opposite contrast or any color (i.e.: color independent). In order to do so, I have forced a preprocessing to black-and-white, but I am pretty sure there is a better way that requires less manual intervention.

There might be a way to add constraints to the training so it doesn't train from pixel color.

asked May 19 '10 at 22:18

Francis%20Pi%C3%A9raut's gravatar image

Francis Piéraut
136457

edited May 26 '10 at 16:53

Joseph%20Turian's gravatar image

Joseph Turian ♦♦
579051125146


One Answer:

I think you should maybe look at the infinite MNIST dataset and how it was generated. There is a paper by Loosli et al that describes it.

As far as I know, they just applied randomly to the training set the transformations they wanted the classifier to be invariant to. In your case, as well as the ones thye apply, it seems that color changes could be a good idea.

answered Jun 30 '10 at 13:02

Alexandre%20Passos's gravatar image

Alexandre Passos ♦
2554154278421

edited Jun 30 '10 at 13:03

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.