Self-organised direction aware data partitioning algorithm

Gu, Xiaowei and Angelov, Plamen Parvanov and Kangin, Dmitry and Principe, Jose (2018) Self-organised direction aware data partitioning algorithm. Information Sciences, 423. pp. 80-95. ISSN 0020-0255

[thumbnail of SODA_FR_v1]
Preview
PDF (SODA_FR_v1)
SODA_FR_v1.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial-NoDerivs.

Download (1MB)

Abstract

In this paper, a novel fully data-driven algorithm, named Self-Organised Direction Aware (SODA), for data partitioning and forming data clouds is proposed. The proposed SODA algorithm employs an extra cosine similarity-based directional component to work together with a traditional distance metric, thus, takes the advantages of both the spatial and angular divergences. Using the nonparametric Empirical Data Analytics (EDA) operators, the proposed algorithm automatically identifies the main modes of the data pattern from the empirically observed data samples and uses them as focal points to form data clouds. A streaming data processing extension of the SODA algorithm is also proposed. This extension of the SODA algorithm is able to self-adjust the data clouds structure and parameters to follow the possibly changing data patterns and processes. Numerical examples provided as a proof of the concept testify the proposed algorithm as an autonomous algorithm and demonstrate its high clustering performance and computational efficiency.

Item Type:
Journal Article
Journal or Publication Title:
Information Sciences
Additional Information:
This is the author’s version of a work that was accepted for publication in Information Sciences. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Information Sciences, 423, 2018 DOI: 10.1016/j.ins.2017.09.025
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/1700/1702
Subjects:
?? autonomous learningnonparametricclusteringempirical data analytics (eda)cosine similarity traditional distance metricartificial intelligencetheoretical computer sciencesoftwareinformation systems and managementcontrol and systems engineeringcomputer scien ??
ID Code:
87745
Deposited By:
Deposited On:
13 Sep 2017 13:04
Refereed?:
Yes
Published?:
Published
Last Modified:
20 Mar 2024 00:38