Jagfeld, Glorianna and Lobban, Fiona and Rayson, Paul and Jones, Steven (2021) Understanding who uses Reddit : Profiling individuals with a self-reported bipolar disorder diagnosis. In: Computational Linguistics and Clinical Psychology: Improving Access : Proceedings of the Seventh Workshop. Association for Computational Linguistics (ACL Anthology), Stroudsberg, PA, pp. 1-14. ISBN 9781954085411
BipolarOnReddit_CLPsych21_CameraReady.pdf - Accepted Version
Available under License Creative Commons Attribution.
Download (383kB)
Abstract
Recently, research on mental health conditions using public online data, including Reddit, has surged in NLP and health research but has not reported user characteristics, which are important to judge generalisability of findings. This paper shows how existing NLP methods can yield information on clinical, demographic, and identity characteristics of almost 20K Reddit users who self-report a bipolar disorder diagnosis. This population consists of slightly more feminine- than masculine-gendered mainly young or middle-aged US-based adults who often report additional mental health diagnoses, which is compared with general Reddit statistics and epidemiological studies. Additionally, this paper carefully evaluates all methods and discusses ethical issues.