Copyright © 2023 The Author(s): This is an open-access article distributed under the terms of the Creative
Commons Attribution 4.0 International License (CC BY-NC 4.0) which permits unrestricted use, distribution, and
reproduction in any medium for non-commercial use provided the original author and source are credited.
International Journal of Scientific Research in Computer Science, Engineering and
Information Technology
ISSN : 2456-3307 Available Online at : www.ijsrcseit.com
doi : https://doi.org/10.32628/IJSRCSEIT
30
An in-Depth Review of Big Data Analytic Models for Clustering
Operations
Trishali Dhote, Prof. Pragati Patil
Department of Computer Science and Engineering, Tulsiramji Gaikwad-Patil of Engineering &Technology,
Nagpur, Maharashtra, India
A R T I C L E I N F O A B S T R A C T
Article History:
Accepted: 01 Sep 2023
Published: 08 Sep 2023
The ever-increasing volume, pace, and variety of data in the present
environment of data-driven decision-making need creative ways for extracting
insightful information sets. A key component of data analysis, clustering
techniques are essential for identifying latent patterns and structures in huge
datasets. This work conducts a thorough investigation of big data analytic models
for clustering operations in response to the pressing requirement to harness the
power of big data analytics for effective and accurate clustering process. The
necessity for this effort derives from the expanding levels of data volume and
complexity that characterise modern information ecosystems. While effective in
smaller datasets, conventional clustering approaches fall short when faced with
the enormous datasets typical in contemporary applications. As a result, choosing
and using the right big data analytic models for clustering have become crucial
tasks for both researchers and practitioners. The review procedure used here is
defined by a thorough and comprehensive approach. The first stage includes a
thorough literature review in which a wide range of big data analytical models
are methodically developed. These models cover a broad range of strategies, from
hierarchical and model-based approaches to density-based and partitioning
techniques. The foundation for the future research is laid by this thorough
assessment, which focuses on a detailed analysis of the characteristics and
performance measures of each model process. The empirical assessment
considers a wide range of factors, including accuracy, computational complexity,
scalability, and applicability for various application areas. A comprehensive
knowledge of each model's potential and constraints is revealed by closely
examining each model's performance across these aspects. This not only
encourages a thorough understanding of the models' capabilities but also equips
Publication Issue
Volume 9, Issue 5
September-October-2023
Page Number
30-47