Burdick, Laura and Mihalcea, Rada and Boyd, Ryan and Pennebaker, James W. (2021) Analyzing Connections Between User Attributes, Images, and Text. Cognitive Computation, 13. pp. 241-260. ISSN 1866-9956
Accepted_Draft.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial.
Download (31MB)
Abstract
This work explores the relationship between a person’s demographic/ psychological traits (e.g., gender, personality) and selfidentity images and captions. We use a dataset of images and captions provided by N = 1,350 individuals, and we automatically extract features from both the images and captions. We identify several visual and textual properties that show reliable relationships with individual differences between participants. The automated techniques presented here allow us to draw interesting conclusions from our data that would be difficult to identify manually, and these techniques are extensible to other large datasets. We believe that our work on the relationship between user characteristics and user data has relevance in online settings, where users upload billions of images each day (Meeker M, 2014. Internet trends 2014–Code conference. Retrieved May 28, 2014).