What is common practice for dealing with the ambiguous parts-of-speech tags (things like "NNS|NN", "VBD|VBN", etc) tags found in corpora such as the Penn Treebank. I'm specifically interested in how people typically deal with this when reporting accuracy, precision, recall, and other performance metrics in publications, but would also be interested in how in hearing about how it might come into play in other part of the process, like during learning.

asked Aug 09 '13 at 16:06

alto's gravatar image

alto
60351124

Be the first one to answer this question!
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.