El Haj, Mahmoud and Rayson, Paul Edward and Young, Steven Eric and Alves, Paulo and Herrero Zorita, Carlos (2019) Multilingual Financial Narrative Processing : Analysing Annual Reports in English, Spanish and Portuguese. In: Multilingual Text Analysis : Challenges, Models, and Approaches. World Scientific Publishing. ISBN 9789813274877
Full text not available from this repository.Abstract
This chapter describes and evaluates the use of Information Extraction and Natural Language Processing methods for extraction and analysis of financial annual reports in three languages: English, Spanish and Portuguese. The work described retains information on document structure which is needed to enable a clear distinction between narrative and financial statement components of annual reports and between individual sections within the narratives component. Extraction accuracy varies between languages with English exceeding 95 %. We apply the extraction methods on a comprehensive sample of annual reports published by UK, Spanish and Portuguese non-financial firms between 2003 and 2014.