Gad, Mahmoud and Park, Gitae and Rawsthorne, Sam and Young, Steven (2025) When Methods Matter : How Implementation Choices Shape Topic Discovery in Financial Text. Working Paper. SSRN Working Paper.
Full text not available from this repository.Abstract
This paper examines the application of LDA topic modelling to risk disclosures in FTSE350 firms' annual reports. We show that LDA implementation choices significantly impact topic representations and subsequent inferences. Using a corpus of FTSE350 annual reports, we show that preprocessing decisions, multiword expressions, and labelling strategies materially affect topic interpretability and granularity. Our analysis reveals that while risk reporting addresses key business risks at an aggregate level, the degree of firm-specific commentary is sensitive to topic granularity. Hierarchical linear modelling suggests that 27% of topic variation is within firms for broad topics, increasing to 75% for granular topics. We leverage GPT to enhance topic labelling, showcasing the potential of LLMs in financial text analysis. These findings challenge the claims of LDA objectivity and highlight the importance of domain expertise. We propose a practical checklist for LDA implementation in accounting and finance research emphasizing transparency and robustness checks.