We have annotated this corpus as part of our participation to the NEEL-IT task at EVALITA 2016 (http://www.evalita.it/2016/tasks/neel-it) to use it as an additional training set. We therefore followed the guidelines of the NEEL-IT task (NEEL-it guidelines). The distributed corpus is composed of 1614 annotated tweets, for a total of 3127 annotated entities.
NE-annotated-tweets-AL is licensed under a Creative Commons Attribution 4.0 International License.
Publications or presentations containing research results obtained through the use of NE-annotated-tweets-AL should cite the following reference:
Anne-Lyse Minard, Mohammed R. H. Qwaider, and Bernardo Magnini. 2016. FBK-NLP at NEEL-IT: Active Learning for Domain Adaptation. In Proceedings of the 5th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian (EVALITA 2016).
To obtain the annotated tweets, please fill the request form with your data (they will be maintained in a database at FBK): Fill form