WITAC - NewsReader Wikinews italian Corpus

WItaC is the Italian section of the NewsReader MEANTIME corpus. It consists of the Italian translation of 120 English Wikinews (http://en.wikinews.org/) articles on four topics (i.e. Airbus and Boeing, Apple Inc., Stock market, and General Motors, Chrysler and Ford) and has been annotated manually at multiple levels, including entities, events, event factuality, temporal information, semantic roles, and intra-document and cross-document event and entity coreference.

For the annotation guidelines and other information, please refer to the NewsReader website: NewsReader MEANTIME corpus.

WItaC has been used as test data for the Evalita FactA task (Event Factuality Annotation) at EVALITA 2016.

For the annotation guidelines and other information, please refer to the Evalita website: FactA@EVALITA2016.

Distribution license

As part of the NewsReader MEANTIME corpus, WItaC is licensed under a Creative Commons Attribution 4.0 International License.

If you use WItaC, please cite one of the following papers:

If you use WItaC for FactA@EVALITA2016, please cite the following paper:

  • Anne-Lyse Minard, Manuela Speranza, and Tommaso Caselli. The EVALITA 2016 Event Factuality Annotation Task (FactA). In Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian, Accademia University Press, Napoli, Italy, December 5-7, 2016.

Downloads

Obtain WItaC: Download

Obtain WItaC for FactA@EVALITA2016: Fill form

MEANTIME website

FACTA @ EVALITA 2016 website