Gu, Xiaowei and Angelov, Plamen Parvanov (2016) Autonomous data-driven clustering for live data stream. In: 2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC 2016) :. IEEE, pp. 1128-1135. ISBN 9781509018987
Autonomous_data_driven_real_time_clustering_v3.pdf - Accepted Version
Download (927kB)
Abstract
In this paper, a novel autonomous data-driven clustering approach, called AD_clustering, is presented for live data streams processing. This newly proposed algorithm is a fully unsupervised approach and entirely based on the data samples and their ensemble properties, in the sense that there is no need for user-predefined or problem-specific assumptions and parameters, which is a problem most of the current clustering approaches suffer from. Moreover, the proposed approach automatically evolves its structure according to the experimentally observable streaming data and is able to recursively update its self-defined parameters using only the current data sample, meanwhile, discards all the previous data samples. Experimental results based on benchmark datasets exhibit the higher performance of the proposed fully autonomous approach compared with the comparative approaches with user- and problem- specific parameters to be predefined. This new clustering algorithm is a promising tool for further applications in the field of real-time streaming data analytics.