I have collected a number of Amazon ratings and comments. Each comment is about a product and is accompanied by a numerical rating (from 1-5). I am wondering if there are articles about inferring the numerical rating from the text? Also I am wondering what would be the best way to properly featurize the text (bag-of-words, bigrams, POS, ...) Thank you

asked Jan 03 '12 at 20:25

Mark%20Alen's gravatar image

Mark Alen
1323234146


One Answer:

This sounds like a sentiment analysis problem, which can probably be extended to a regression analysis without too much trouble (in the same way that classification can be in some circumstances).

This paper seems to look at your problem.

answered Jan 03 '12 at 20:34

Robert%20Layton's gravatar image

Robert Layton
1625122637

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.