Mair, C. and Hundt, M. and Leech, Geoffrey and Smith, N. (2003) Short term diachronic shifts in part-of-speech frequencies: a comparison of the tagged LOB and F-LOB corpora. International Journal of Corpus Linguistics, 7 (2). pp. 245-264. ISSN 1569-9811
Full text not available from this repository.Abstract
The paper presents a comparison of tag frequencies in two matching one-million word reference corpora of British standard English, the 1961 LOB-corpus and its 1991 “clone” produced at Freiburg. Both corpora were tagged using a version of the CLAWS part-of-speech-tagger developed at Lancaster, and part of the material was post-edited manually in Freiburg to assess the accuracy of the automatic procedure. The comparison of tag frequencies is an essential complement to work on recent linguistic change carried out on the untagged material, because this work has been based on the – so far unverified – assumption that tag frequencies have remained constant over the thirty-year period in question. In addition, the paper discusses some common and partly contradictory claims about the prevalence of a “nominal” style in present-day written English. It is shown that while part-of-speech frequencies have not remained constant over the period investigated, the shifts are usually not big enough to invalidate the results obtained in analyses of the untagged material. With regard to style, the material shows a significant rise in the frequency of nouns, which, however, is not paralleled by a corresponding decrease in verbs.