Stance and engagement in English and Spanish journalistic texts : towards a reliable annotation scheme for linguistic and computational purposes
Loading...
Official URL
Full text at PDC
Publication date
2018
Advisors (or tutors)
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Jagiellonian University Press
Citation
Abstract
The paper describes the process of validating a reliable annotation scheme for the categories of Stance and Engagement in English and Spanish using a bilingual sample of English-Spanish journalistic texts extracted from the MULTINOT corpus (Lavid et al. 2015). The bilingual sample includes three different newspaper genres: news reports, editorials and letters to the editor. Following the generic annotation pipeline proposed by Hovy and Lavid (2010), the paper describes the different steps to validate an annotation scheme to capture the main features of Stance and Engagement and their realizations in English and Spanish. This includes the instantiation of the theoretical categories, the development of annotation guidelines and the performance of an interannotation agreement study on a small training corpus to measure the reliability of the proposed tags. The paper also describes the results of the annotation of a larger corpus using the validated scheme (Moratón 2015). This reveals interesting patterns of variation in the distribution of Stance and Engagement in the three newspaper genres, which can be fruitfully used for contrastive linguistic and computational purposes.