|
I am interested in ways if treating an imbalanced regression problem. By imbalanced regression I mean that I have few samples for some regions of my input and target space and lots of features for other regions. I don't know a priori about the exact distribution and how they contribute to the overall error. All I know that the distribution I get as training data is not the distribution I expect during the runtime of the system. I'd be interested in ideas on how to treat this problem robustly, theoretical work as well as quick heuristics/hacks that you found work well. So far I have been thinking about:
However I have no practical experience. Anyone encountered such a problem and successfully solved it? |