Building a corpus of spoken sylheti.

Baker, J. P. and Lie, M. and McEnery, A. M. and Sebba, Mark (2000) Building a corpus of spoken sylheti. Literary and Linguistic Computing, 15 (4). pp. 421-432. ISSN 1477-4615

Full text not available from this repository.


This paper describes the construction of a corpus of spoken Sylheti. The corpus was created to examine difficulties in the creation of spoken language corpora in which features such as code switching (simply described here as the process of switching from one language to another during the course of an interaction; however, this description disguises a host of situations, which will be examined in the paper) are common. The paper also presents a transliteration scheme for Sylheti based around the Roman alphabet.

Item Type: Journal Article
Journal or Publication Title: Literary and Linguistic Computing
Uncontrolled Keywords: /dk/atira/pure/researchoutput/libraryofcongress/p1
Departments: Faculty of Arts & Social Sciences > Linguistics & English Language
ID Code: 1813
Deposited By: Professor Tony McEnery
Deposited On: 25 Feb 2008 16:12
Refereed?: Yes
Published?: Published
Last Modified: 01 Jan 2020 06:10

Actions (login required)

View Item View Item