Lancaster EPrints

Automatic error tagging of spelling mistakes in learner corpora

Rayson, Paul and Baron, Alistair (2011) Automatic error tagging of spelling mistakes in learner corpora. In: A Taste for Corpora. Studies in Corpus Linguistics, 45 . John Benjamins, Amsterdam, pp. 109-126. ISBN 978 90 272 0350 2

Full text not available from this repository.

Abstract

Manual error tagging of learner corpus data is time consuming and creates a bottleneck in the analysis of learner corpora. This had led researchers to apply techniques from the area of natural language processing to assist in the automatic analysis of such data. This chapter presents the novel application of a hybrid approach to the detection of spelling errors in learner data. The Variant Detector (VARD) software was developed to match historical spelling variants to modern equivalents with the intention of improving the accuracy and robustness of corpus linguistics techniques when applied to historical corpora. Here, we describe its application to detect spelling errors in written learner corpora consisting of 50,000 words from each of three learner backgrounds (French, German and Spanish).

Item Type: Contribution in Book/Report/Proceedings
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments: Faculty of Science and Technology > School of Computing & Communications
ID Code: 60200
Deposited By: ep_importer_pure
Deposited On: 22 Nov 2012 11:54
Refereed?: No
Published?: Published
Last Modified: 25 Mar 2014 23:50
Identification Number:
URI: http://eprints.lancs.ac.uk/id/eprint/60200

Actions (login required)

View Item