Big data is usually defined by five (05) characteristics called 5Vs +1C (Volume, Velocity, Variety, Veracity, Value and Complexity). It means to data that are too large, dynamic and complex with certain degree of accuracy. For that, data become difficult to analyze using traditional data analysis techniques because of their high complexity and computational cost. Clustering analysis technique is the most used method for cope with huge amount of data. The main goal of clustering is to classify data into clusters in manner that data grouped are more similar. In this paper, we provide an overview of various clustering techniques used for data analysis.
A Review of Clustering Algorithms for Big Data
Kheyreddine Djouzi,Kadda Beghdad-Bey
Published 2019 in 2019 International Conference on Networking and Advanced Systems (ICNAS)
ABSTRACT
PUBLICATION RECORD
- Publication year
2019
- Venue
2019 International Conference on Networking and Advanced Systems (ICNAS)
- Publication date
2019-06-01
- Fields of study
Mathematics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-37 of 37 references · Page 1 of 1
CITED BY
Showing 1-29 of 29 citing papers · Page 1 of 1