Text and speech corpora for natural language processing and corpus linguistics

UNSPECIFIED (2025) Text and speech corpora for natural language processing and corpus linguistics. Scientific Data, Specia. ISSN 2052-4463

Full text not available from this repository.

Abstract

Corpus Linguistics (CL) and Natural Language Processing (NLP) are two of the transformative forces in research across the sciences and humanities, reshaping how insights are gleaned from vast text and speech datasets. Their applications span the natural, medical, social and applied sciences, leading the cutting edge in fields such as healthcare diagnostics, biomedicine, environmental science, and computer vision. This Collection presents a series of annotated text and speech corpora alongside linguistic models tailored for CL and NLP applications. These resources aim to enrich the arsenals of CL and NLP users and facilitate interdisciplinary research.

Item Type:
Journal Article
Journal or Publication Title:
Scientific Data
Uncontrolled Keywords:
Research Output Funding/no_not_funded
Subjects:
?? natural language processingcorpus linguisticscorporaartificial intelligencemachine learningbioinformaticsno - not fundedgeneral ??
ID Code:
231701
Deposited By:
Deposited On:
02 Sep 2025 06:33
Refereed?:
Yes
Published?:
Published
Last Modified:
02 Sep 2025 06:33