Sampling labelled profile data for identity resolution

Edwards, Matthew and Wattam, Stephen Michael and Rayson, Paul Edward and Rashid, Awais (2016) Sampling labelled profile data for identity resolution. In: Proceedings of IEEE International Conference on Big Data (IEEE BigData 2016) :. IEEE, USA, pp. 540-547. ISBN 9781467390064

[thumbnail of crosssampling_camera]
Preview
PDF (crosssampling_camera)
crosssampling_camera.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial.

Download (277kB)

Abstract

Identity resolution capability for social networking profiles is important for a range of purposes, from open-source intelligence applications to forming semantic web connections. Yet replication of research in this area is hampered by the lack of access to ground-truth data linking the identities of profiles from different networks. Almost all data sources previously used by researchers are no longer available, and historic datasets are both of decreasing relevance to the modern social networking landscape and ethically troublesome regarding the preservation and publication of personal data. We present and evaluate a method which provides researchers in identity resolution with easy access to a realistically-challenging labelled dataset of online profiles, drawing on four of the currently largest and most influential online social networks. We validate the comparability of samples drawn through this method and discuss the implications of this mechanism for researchers as well as potential alternatives and extensions.

Item Type:
Contribution in Book/Report/Proceedings
Additional Information:
©2016 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
ID Code:
82789
Deposited By:
Deposited On:
11 Nov 2016 13:10
Refereed?:
Yes
Published?:
Published
Last Modified:
01 Oct 2024 01:14