Simaki, Vasiliki and Paradis, Carita and Skeppstedt, Maria and Sahlgren, Magnus and Kucher, Kostiantyn and Kerren, Andreas (2020) Annotating speaker stance in discourse : the Brexit Blog Corpus. Corpus Linguistics and Linguistic Theory. ISSN 1613-7027
_Corpus_Linguistics_and_Linguistic_Theory_Annotating_Speaker_Stance_in_Discourse_The_Brexit_Blog_Corpus.pdf - Published Version
Available under License Creative Commons Attribution-NonCommercial-NoDerivs.
Download (3MB)
Abstract
The aim of this study is to explore the possibility of identifying speaker stance in discourse, provide an analytical resource for it and an evaluation of the level of agreement across speakers. We also explore to what extent language users agree about what kind of stances are expressed in natural language use or whether their interpretations diverge. In order to perform this task, a comprehensive cognitive-functional framework of ten stance categories was developed based on previous work on speaker stance in the literature. A corpus of opinionated texts was compiled, the Brexit Blog Corpus (BBC). An analytical protocol and interface (Active Learning and Visual Analytics) for the annotations was set up and the data were independently annotated by two annotators. The annotation procedure, the annotation agreements and the co-occurrence of more than one stance in the utterances are described and discussed. The careful, analytical annotation process has returned satisfactory inter- and intra-annotation agreement scores, resulting in a gold standard corpus, the final version of the BBC.