Survey on Thai NLP Language Resources and Tools

Arreerard, Ratchakrit and Mander, Stephen and Piao, Scott (2022) Survey on Thai NLP Language Resources and Tools. In: Language Resources and Evaluation Conference LREC 2022 Proceedings. European Language Resources Association (ELRA), FRA, pp. 6495-6505. ISBN 9791095546726

[img]
Text (lrec2022 thai nlp survey paper)
lrec2022_thai_nlp_survey.pdf - Published Version
Available under License Creative Commons Attribution-NonCommercial.

Download (538kB)

Abstract

Over the past decades, Natural Language Processing (NLP) research has been expanding to cover more languages. Recently particularly, NLP community has paid increasing attention to under-resourced languages. However, there are still many languages for which NLP research is limited in terms of both language resources and software tools. Thai language is one of the under-resourced languages in the NLP domain, although it is spoken by nearly 70 million people globally. In this paper, we report on our survey on the past development of Thai NLP research to help understand its current state and future research directions. Our survey shows that, although Thai NLP community has achieved a significant achievement over the past three decades, particularly on NLP upstream tasks such as tokenisation, research on downstream tasks such as syntactic parsing and semantic analysis is still limited. But we foresee that Thai NLP research will advance rapidly as richer Thai language resources and more robust NLP techniques become available.

Item Type:
Contribution in Book/Report/Proceedings
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/1700/1702
Subjects:
ID Code:
168535
Deposited By:
Deposited On:
21 Jun 2022 16:10
Refereed?:
Yes
Published?:
Published
Last Modified:
03 Feb 2023 01:55