A Survey of Multimodal Sarcasm Detection

Farabi, Shafkat and Ranasinghe, Tharindu and Kanojia, Diptesh and Kong, Yu and Zampieri, Marcos (2024) A Survey of Multimodal Sarcasm Detection. In: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence :. International Joint Conferences on Artificial Intelligence Organization, KOR, pp. 8020-8028. ISBN 9781956792041

[thumbnail of IJCAI_2024__Multimodal_Sarcasm_Detection__A_Survey]
Text (IJCAI_2024__Multimodal_Sarcasm_Detection__A_Survey)
IJCAI_2024_Multimodal_Sarcasm_Detection_A_Survey.pdf - Accepted Version
Available under License Creative Commons Attribution.

Download (2MB)

Abstract

Sarcasm is a rhetorical device that is used to convey the opposite of the literal meaning of an utterance. Sarcasm is widely used on social media and other forms of computer-mediated communication motivating the use of computational models to identify it automatically. While the clear majority of approaches to sarcasm detection have been carried out on text only, sarcasm detection often requires additional information present in tonality, facial expression, and contextual images. This has led to the introduction of multimodal models, opening the possibility to detect sarcasm in multiple modalities such as audio, images, text, and video. In this paper, we present the first comprehensive survey on multimodal sarcasm detection - henceforth MSD - to date. We survey papers published between 2018 and 2023 on the topic, and discuss the models and datasets used for this task. We also present future research directions in MSD.

Item Type:
Contribution in Book/Report/Proceedings
Uncontrolled Keywords:
Research Output Funding/no_not_funded
Subjects:
?? no - not funded ??
ID Code:
226301
Deposited By:
Deposited On:
02 Jan 2025 16:30
Refereed?:
Yes
Published?:
Published
Last Modified:
04 Jan 2025 01:47