Authors A. N. Natskevich, I. O. Kursitys
Month, Year 04, 2018 @en
Index UDC 002.53:004.89
Abstract The article is devoted to solving the clustering problem, which is one of the most important and popular problem in intelligent data analysis. Clustering, which means uniting the similar elements in groups, is one of the fundamental problem in Data Mining. Application of solving this problem includes image segmentation, marketing, protection from financial fraud, forecasting, text analysis and many other fields. A constantly growing scope of generated, transferred and processed data determines the significance of the problem. The authors investigate the clustering problem, provide the problem statement, the main mathematical formulas and the objective function needed for solving. The article consists of the analytical review of the popular algorithms, such as hierarchical optimization, squared error, k-means, c-means and graph-based algorithms. The authors note their benefits and shortcomings. The clustering problem is NP-complete, which determines the advantages of application of bioinspired models and methods for solving the mentioned problem. The related works of famous researchers are given in the article, such as: particle swarm optimization, ant colony optimization, artificial bee colony algorithm, bacteria colony optimization, cuckoo search algorithm, which demonstrate the effectiveness of bioinspired algorithms in terms of solving the clustering problem. The authors propose a combined bioinspired algorithm which applies the ant colony optimization and the bat algorithm successively. The main ideas of the algorithms, their flowcharts and solutions encoding schemes are provided herein. The local search method is implemented in the bat algorithm. The experiments carried out with benchmarks demonstrate the effectiveness of the proposed algorithm in comparison with the k-means algorithm and the genetic one. During the experimental research the authors managed to define the developed combined bioinspired algorithm time complexity. The authors are planning to apply the suggested combined solution for boosting of the algorithms, which works with several algorithms and reveals the best solution among several ones obtained with different bioinspired algorithms.

Download PDF

Keywords Clustering; bioinspired algorithm; ant colony optimization; bat algorithm; swarm intelligence, artificial intelligence.
References 1. Ka-Chun Wong. A Short Survey on Data Clustering Algorithms, IEEE Second International Conference on Soft Computing and Machine Intelligence, 2015.
2. Kravchenko Yu.A., Natskevich A.N. Model' resheniya zadachi klasterizatsii dannykh na osnove ispol'zovaniya bustinga algoritmov adaptivnogo povedeniya murav'inoy kolonii i k-srednikh [A model for solving the problem of data clustering based on the use of boosting algorithms of adaptive behavior of ant colony and k-means], Izvestiya YUFU. Tekhnicheskie nauki [Izvestiya SFedU. Engineering Sciences], 2017, No. 7 (192), pp. 90-102.
3. Ershov K.S., Romanova T.N. Analiz i klassifikatsiya algoritmov klasterizatsii [Analysis and classification of clustering algorithms], Novye informatsionnye tekhnologii v avtomatizirovannykh sistemakh [New information technologies in automated systems], 2016, Issue 19, pp. 274-279.
4. Van D.M. and Engelbrecht A.P. Data clustering using particle swarm optimization, Proceedings of the Congress on Evolutionary Computation, 2003, pp. 215-220.
5. Shelokar P.S., Jayaraman V.K. and Kulkarni B.D. An Ant Colony Approach for Clustering, Analytica Chimica Acta, 2004, Vol. 509, No. 2, pp. 187-195.
6. Yi-Tung Kao, Erwie Zahara and I-Wei Kao. A hybridized approach to data clustering, Expert Systems with Applications, 2008, Vol. 34, No. 3, pp. 1754-1762.
7. Changsheng Zhang, Dantong Ouyang and Jiaxu Ning. An artificial bee colony approach for clustering, Expert Systems with Applications, 2010, Vol. 37, No. 7, pp. 4761-4767,
8. Miao Wan, Lixiang Li, Jinghua Xiao, Cong Wang and Yixian Yang, Data clustering using bacterial foraging optimization, Journal of Intelligent Information Systems, 2012, Vol. 38, No. 2, pp. 321-341.
9. Senthilnath J., Vipul Das, Omkar S.N. and Mani V. Clustering using Levy Flight Cuckoo Search, Proceedings of Seventh International Conference on Bio-Inspired Computing: Theories and Applications, Advances in Intelligent Systems and Computing, 2012, Vol. 202, pp. 65-75.
10. Jensi R., Wiselin Jiji G. MBA-LF: a new data clustering method using modified bat algorithm and Levy flight, ICTACT J. Soft Comput., 2015, No. 6, pp. 1093-1101.
11. Kuliev E.V., Lezhebokov A.A., Kravchenko Yu.A. Roevoy algoritm poiskovoy optimizatsii na osnove modelirovaniya povedeniya letuchikh myshey [Swarm algorithm search engine optimization is based on modeling the behavior of bats], Izvestiya YUFU. Tekhnicheskie nauki [Izvestiya SFedU. Engineering Sciences], 2016, No. 7 (180), pp. 53-62.
12. Chastikova V.A., Novikova E.F. Algoritm letuchikh myshey dlya resheniya zadachi global'noy optimizatsii [The bats algorithm for solving global optimization problems], Nauchnye trudy KubGTU (elektronnyy setevoy politematicheskiy zhurnal) [Научные труды КубГТУ (электронный сетевой политематический журнал)], 2015, No. 2. Available at: (accessed 15 December2017).
13. Krasnoshlyk N.A. Reshenie zadachi global'noy optimizatsii modifitsirovannym algoritmom letuchikh myshey [The global optimization problem the modified algorithm of bats], Radіoelektronіka, іnformatika, upravlіnnya [Radoactive, Informatics, management], 2015, No. 4 (35), pp. 96-103.
14. Kravchenko Yu.A., Natskevich A.N., Kursitys I.O. Busting bioinspirirovannykh algoritmov dlya resheniya zadachi klasterizatsii [Boosting bioinspired algorithms for solving the clustering problem], Mezhdunarodnaya konferentsiya po myagkim vy-chisleniyam i izmereniyam [International conference on soft computing and measurements], 2018, Vol. 1, pp. 777-780.
15. Bova V.V., Kravchenko Y.A., Kureichik V.V. Development of distributed information systems: ontological approach, Advances in Intelligent Systems and Computing, 2015, Vol. 349,
pp. 113-122.
16. Kravchenko Yu.A. Tekhnologiya analiza nadezhnosti adaptivnykh informatsionnykh sred [Technology reliability analysis of adaptive information environments], Izvestiya YUFU. Tekhnicheskie nauki [Izvestiya SFedU. Engineering Sciences], 2010, No. 12 (113), pp. 103-108.
17. Kravchenko Yu.A. Otsenka kognitivnoy aktivnosti pol'zovatelya v sistemakh podderzhki prinyatiya resheniy [Evaluation of cognitive activity of the user in decision support systems], Izvestiya YUFU. Tekhnicheskie nauki [Izvestiya SFedU. Engineering Sciences], 2009, No. 4 (93), pp. 113-117.
18. Rodzin S.I., Kureychik V.V. Teoreticheskie voprosy i sovremennye problemy razvitiya kognitivnykh bioinspirirovannykh algoritmov optimizatsii [Theoretical questions and contemporary problems of the development of cognitive bio-inspired algorithms for optimization], Kibernetika i programmirovanie [Cybernetics and programming], 2017, No. 3, pp. 51-79.
19. Donkuan X. Yingjie T.A. comprehensive survey of clustering algorithms, Annals of Data Science, 2015, Vol. 2, Issue 2, pp. 165-193.
20. Kravchenko Y.A., Kuliev E.V., Kursitys I.O.: Information's semantic search, classification, structuring and integration objectives in the knowledge management context problems, In: 8th IEEE International Conference on Application of Information and Communication Technologies (AICT 2016). IEEE Press, Baku, Azerbaijan, 2016, pp. 136-141.

Comments are closed.