Multilingual resources for European languages:Contributions of the CRATER project

McEnery, T. and Wilson, A. and SáNchez-LeóN, F. and Nieto-Serrano, A. (1997) Multilingual resources for European languages:Contributions of the CRATER project. Literary and Linguistic Computing, 12 (4). pp. 219-226. ISSN 0268-1145

Full text not available from this repository.

Abstract

Here we describe the contributions of the CRATER project to the development of multilingual resources for European languages. The project has developed a trilingual parallel aligned corpus of one million tokens each of Spanish, French, and English. The corpus has been part-of-speech tagged and lemmatized. Tools for the alignment of multi-lingual corpora at the sentence and word levels hae been developed, which are of general significance to multilingual corpus linguistics. The Xerox part-of-speech tagger has also been retrained for Spanish, with important findings for part-of-speech tagging generally.

Item Type:
Journal Article
Journal or Publication Title:
Literary and Linguistic Computing
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/3300/3310
Subjects:
ID Code:
134441
Deposited By:
Deposited On:
22 Jun 2019 08:53
Refereed?:
Yes
Published?:
Published
Last Modified:
01 Jan 2020 11:44