Professional Documents
Culture Documents
Review and Comparative Analysis of Data Clustering Algorithms
Review and Comparative Analysis of Data Clustering Algorithms
not scale properly because of time complexity and as such has the basic hierarchical structure within the data [7]. Also,
detecting noise and outliers is poor (Saket and Pandya, 2016). BIRCH is designed to reduce the numerous input and output
operations as it incrementally and dynamically groups
Hence, the present day data mining deals on larger dataset
incoming data points and attempt to bring out the most
mined at a high level of speed, time, accuracy and detecting
appropriate quality clustering with the resources available
noisy data and outliers, hence this algorithm cannot function
such as time and memory (Rani and Rohil, 2013). One of the
effectively.
advantages of this algorithm is ease of managing different
Moreover, the hierarchical clustering algorithm shapes of distance or similarity.
(agglomerative or divisive) is applied in analyzing data which
Agglomerative or Bottom up
R
Q,
R
M, Q
N, P
O, O, P
P,
Q, O
R
N
M,
N
M
[3] Baviskar, C. T. and Patil, S. S., (2016). Dealing with Overlapping [12] Mann, K. A. and Kaur, N., (2013). Review Paper on Clustering
of Clusters by usingFuzzy K-Means Clustering. International Techniques. Global Journal of Computer Science and Technology
Conference on Global Trends in Engineering, Technology and Software & Data Engineering, USA, 13(5), pp. 42-48.
Management (ICGTETM), India, pp. 45-51. [13] Popat, S. K. and Emmanuel, M., (2014). Review and Comparative
[4] Chaudhari, B. and Parikh, M., (2012). A Comparative Study of Study of Clustering Techniques. International Journal of
Clustering Algorithms UsingWeka Tools. International Journal of Computer Science and Information Technologies (IJCSIT), 5(1),
Application or Innovation in Engineering & Management pp. 805-812.
(IJAIEM), 1(2), pp. 154-158. [14] Rai, P. and Singh, S., (2010). A Survey of Clustering Techniques.
[5] Chris, D. and Xiaofeng, H., (2002). Cluster Merging and Splitting International Journal of Computer Applications, 7(12), pp. 1-5.
in Hierarchical Clustering Algorithms. [15] Rani, Y. and Rohil, H., (2013). A Study of Hierarchical Clustering
[6] DeFreitas, K. and Bernard, M., (2015). Comparative Performance Algorithm. International Journal of Information and Computation
Analysis of Clustering Techniques in Educational Data Mining. Technology, India, 3(11), pp. 1225-1232.
International Journal on Computer Science and Information [16] Rokach, L. and Maimon, O., (2002). Clustering Methods.
Systems,Trinidad and Tobago, 10(2), pp. 65-78. Department of Industrial Engineering Tel-Aviv University.
[7] DeFreitas, K. and Bernard, M., (2014). A Framework for Flexible [17] Salgado, C. M., Azevedo, C., Proenca, H. and VSM., (2016).
Educational Data Mining. In The 2014 International Conference Secondary Analysis of ElectronicHealth Records: Noise versus
on Data Mining. Las Vegas, USA, pp. 176–180. Outliers. pp 163 -183.
[8] Firdaus, S. and Uddin Md. A., (2015). A Survey on Clustering [18] Saket, S. J. and Pandya, S., (2016). An Overview of Partitioning
Algorithms and ComplexityAnalysis. International Journal of Algorithms in Clustering Techniques. International Journal of
Computer Science Issues (IJCSI), Bangladesh, 12(2), pp. 62-85. Advanced Research in Computer Engineering &Technology
[9] Gupta, G. K., (2006). Introduction to data mining with case (IJARCET), 5(6), pp. 1943-1946.
studies.PHI Learning Pvt. Ltd. [19] Tayal, M. A. and Raghuwanshi, M. M., (2010). Review on
[10] Han, J., Kamber, M. and Pei, J., (2012). Data Mining: Concepts Various Clustering Methods for the Image Data. Journal of
and Techniques 3rd ed., Massachusetts, USA: Morgan Kaufmann. Emerging Trends in Computing and Information Sciences, India,
[11] Jain, S., Aalam, M. A. and Doja, M. N., (2010). K-means 2, pp. 34-38.
Clustering Using Weka Interface.Proceedings of the 4 th National [20] Vijayarani, S. and Jothi, P., (2014). Hierarchical and Partitioning
Conference, India. Clustering Algorithms for Detecting Outliers in Data Streams.
International Journal of Advanced Research in Computer and
Communication Engineering, 3(4), pp. 6204-6207.