|
I'm interested in using MALLET to conduct Part of Speech tagging or sentiment analysis. After reviewing the MALLET API documentation, it's pretty clear that the tagger accepts binary values when describing features. But, all of the examples that I've been able to dig up for using MALLET only use binary labels along with binary features. I wanted to get some clarification prior to mucking around with MALLET. MALLET SimpleTaggger API documentation. MALLET quick start on the SimpleTagger tool. NLTK has a wrapper built on top of MALLET, a quick source code review didn't make it really clear that MALLET supports multinomial labels. NLTK-MALLET API documentation. Thanks in advance! ct |
|
The SimpleTagger handles multiple classes. Here's a little test input and output you can use: For training:
For testing:
With these two files, you can then train and tag your text:
And get the output:
|