Supporting the corpus-based study of Shakespeare’s language:Enhancing a corpus of the First Folio

Culpeper, Jonathan and Hardie, Andrew and Demmen, Jane and Hughes, Jennifer and Timperley, Matt (2021) Supporting the corpus-based study of Shakespeare’s language:Enhancing a corpus of the First Folio. ICAME Journal, 45 (1). pp. 37-86. ISSN 1502-5462

Full text not available from this repository.

Abstract

This article explores challenges in the corpus linguistic analysis of Shakes-peare’s language, and Early Modern English more generally, with particularfocus on elaborating possible solutions and the benefits they bring. An accountof work that took place within the Encyclopedia of Shakespeare’s LanguageProject (2016–2019) is given, which discusses the development of the project’sdata resources, specifically, the Enhanced Shakespearean Corpus. Topics cov-ered include the composition of the corpus and its subcomponents; the structureof the XML markup; the design of the extensive character metadata; and theword-level corpus annotation, including spelling regularisation, part-of-speechtagging, lemmatisation and semantic tagging. The challenges that arise fromeach of these undertakings are not exclusive to a corpus-based treatment ofShakespeare’s plays but it is in the context of Shakespeare’s language that theyare so severe as to seem almost insurmountable. The solutions developed for theEnhanced Shakespearean Corpus – often combining automated manipulationwith manual interventions, and always principled – offer a way through.

Item Type:
Journal Article
Journal or Publication Title:
ICAME Journal
Subjects:
ID Code:
165436
Deposited By:
Deposited On:
02 Feb 2022 10:41
Refereed?:
Yes
Published?:
Published
Last Modified:
04 May 2022 02:38