Journal: Conference on Empirical Methods in Natural Language Processing
Programming languages: C, Shell
parse tree with CoNLL format
Project website: http://www.cs.cmu.edu/~ark/TweetNLP/
We describe a new dependency parser for English tweets, TWEEBOPARSER. The parser builds on several contributions: new syntactic annotations for a corpus of tweets (TWEEBANK), with conventions informed by the domain; adaptations to a statistical parsing algorithm; and a new approach to exploiting out-of-domain Penn Treebank data.