CMU ARK Twitter Part-of-Speech Tagger
Year: 2,013
Journal: Conference of the North American Chapter of the Association for Computational Linguistics
Languages: English
Programming languages: Java, Python, Shell
Input data:
Tweets
Output data:
tags
Project website: http://www.cs.cmu.edu/~ark/TweetNLP/
We consider the problem of part-of-speech tagging for informal, online conversational text. We systematically evaluate the use of large-scale unsupervised word clustering and new lexical features to improve tagging accuracy.