The UCREL semantic analysis system.

Rayson, P. and Archer, Dawn and Piao, S. and McEnery, A. M. (2004) The UCREL semantic analysis system. In: Proceedings of the beyond named entity recognition semantic labelling for NLP tasks workshop, Lisbon, Portugal, 2004. UNSPECIFIED, PRT, pp. 7-12.

[img]
Preview
PDF (usas_lrec04ws)
usas_lrec04ws.pdf - Accepted Version

Download (93kB)

Abstract

The UCREL semantic analysis system (USAS) is a software tool for undertaking the automatic semantic analysis of English spoken and written data. This paper describes the software system, and the hierarchical semantic tag set containing 21 major discourse fields and 232 fine-grained semantic field tags. We discuss the manually constructed lexical resources on which the system relies, and the seven disambiguation methods including part-of-speech tagging, general likelihood ranking, multi-word-expression extraction, domain of discourse identification, and contextual rules. We report an evaluation of the accuracy of the system compared to a manually tagged test corpus on which the USAS software obtained a precision value of 91%. Finally, we make reference to the applications of the system in corpus linguistics, content analysis, software engineering, and electronic dictionaries

Item Type: Contribution in Book/Report/Proceedings
Uncontrolled Keywords: /dk/atira/pure/researchoutput/libraryofcongress/p1
Subjects:
Departments: Faculty of Science and Technology > School of Computing & Communications
Faculty of Arts & Social Sciences > Linguistics & English Language
ID Code: 1783
Deposited By: Professor Tony McEnery
Deposited On: 25 Feb 2008 16:21
Refereed?: Yes
Published?: Published
Last Modified: 17 Aug 2019 23:22
URI: https://eprints.lancs.ac.uk/id/eprint/1783

Actions (login required)

View Item View Item