McEnery, T. and Wilson, A. and SáNchez-LeóN, F. and Nieto-Serrano, A. (1997) Multilingual resources for European languages : Contributions of the CRATER project. Literary and Linguistic Computing, 12 (4). pp. 219-226. ISSN 0268-1145
Full text not available from this repository.Abstract
Here we describe the contributions of the CRATER project to the development of multilingual resources for European languages. The project has developed a trilingual parallel aligned corpus of one million tokens each of Spanish, French, and English. The corpus has been part-of-speech tagged and lemmatized. Tools for the alignment of multi-lingual corpora at the sentence and word levels hae been developed, which are of general significance to multilingual corpus linguistics. The Xerox part-of-speech tagger has also been retrained for Spanish, with important findings for part-of-speech tagging generally.