Creating and analysing a multimodal corpus of news texts with Google Cloud Vision's automatic image tagger

Baker, Paul and Collins, Luke (2023) Creating and analysing a multimodal corpus of news texts with Google Cloud Vision's automatic image tagger. Applied Corpus Linguistics, 3 (1): 100043. ISSN 2666-7991

Full text not available from this repository.

Abstract

This study describes the creation and analysis of a small multimodal corpus of British news articles about obesity, where tags were assigned to images in the articles using the automatic tagger Google Cloud Vision. In order to illustrate the potential for analysis of image tags, the corpus analysis tool WordSmith was used to identify differences between newspapers in the ways that obesity was framed. Three forms of analysis were carried out – the first simply compared keywords across the newspapers, the second examined key visual tags and their collocates associated with each newspaper, while the third incorporated a combined analysis of words and image tags. The three analyses produced complementary findings, indicating the value in using Google Cloud Vision in creating and analysing multimodal corpora. The paper ends by reflecting on the method undertaken, while considering how additional research could improve our understanding of image tagging.

Item Type:

Journal Article

Journal or Publication Title:

Applied Corpus Linguistics

Uncontrolled Keywords:

Research Output Funding/yes_externally_funded

Subjects:

?? newsannotationvisualimageobesitydiscourseyes - externally fundedno ??

Departments:

Faculty of Arts & Social Sciences > Linguistics & English Language

ID Code:

186268

Deposited By:

ep_importer_pure

Deposited On:

14 Feb 2023 11:45

Refereed?:

Yes

Published?:

Published

Last Modified:

11 Dec 2025 07:14

URI:

https://eprints.lancs.ac.uk/id/eprint/186268