Nundloll, Vatsala and Lamb, Rob and Hankin, Barry and Blair, Gordon (2021) A semantic approach to enable data integration for the domain of flood risk management. Environmental Challenges, 3: 100064. ISSN 2667-0100
1_s2.0_S2667010021000433_main.pdf - Published Version
Available under License Creative Commons Attribution.
Download (5MB)
Abstract
With so many things around us continuously producing and processing data, be it mobile phones, or sensors attached to devices, or satellites sitting thousands of kilometres above our heads, data is becoming increasingly heterogeneous. Scientists are inevitably faced with data challenges, coined as the 4 V’s of data - volume, variety, velocity and veracity. In this paper, we address the issue of data variety. The task of integrating and querying such heterogeneous data is further compounded if the data is in unstructured form. We hence propose an approach using Semantic Web and Natural Language Processing techniques to resolve the heterogeneity arising in data formats, bring together structured and unstructured data and provide a unified data model to query from disparate data sets.