Multilingual resources for European languages:Contributions of the CRATER project

McEnery, T. and Wilson, A. and SáNchez-LeóN, F. and Nieto-Serrano, A. (1997) Multilingual resources for European languages:Contributions of the CRATER project. Literary and Linguistic Computing, 12 (4). pp. 219-226. ISSN 0268-1145

Full text not available from this repository.

Abstract

Here we describe the contributions of the CRATER project to the development of multilingual resources for European languages. The project has developed a trilingual parallel aligned corpus of one million tokens each of Spanish, French, and English. The corpus has been part-of-speech tagged and lemmatized. Tools for the alignment of multi-lingual corpora at the sentence and word levels hae been developed, which are of general significance to multilingual corpus linguistics. The Xerox part-of-speech tagger has also been retrained for Spanish, with important findings for part-of-speech tagging generally.

Item Type: Journal Article
Journal or Publication Title: Literary and Linguistic Computing
Uncontrolled Keywords: /dk/atira/pure/subjectarea/asjc/3300/3310
Subjects:
Departments: Faculty of Arts & Social Sciences > Linguistics & English Language
ID Code: 134441
Deposited By: ep_importer_pure
Deposited On: 22 Jun 2019 08:53
Refereed?: Yes
Published?: Published
Last Modified: 30 Sep 2019 21:32
URI: https://eprints.lancs.ac.uk/id/eprint/134441

Actions (login required)

View Item View Item