Clustering helps in understanding the patterns present in networks and thus helps in getting useful insights. In real‐world complex networks, analysing the structure of the network plays a vital role in clustering. Most of the existing clustering algorithms identify disjoint clusters, which do not consider the structure of the network. Moreover, the clustering results do not provide consistency and precision. This paper presents an efficient parallel fuzzy clustering algorithm named “PFCA” for large complex networks using Hadoop and Pregel (parallel processing framework for large graphs). The proposed algorithm first selects the candidate cluster heads on the basis of their influence in the network and then determines the number of clusters by analysing the graph structure using PageRank algorithm. The proposed algorithm identifies both disjoint and fuzzy clusters efficiently and finds membership of only those vertices, which are the part of more than one cluster. The performance is validated on 6 real‐life networks having up to billions of connections. The experimental results show that the proposed algorithm scales up linearly with the increase in size of network. It is also shown that the proposed algorithm is efficient and has high precision in comparison with the other state‐of‐art fuzzy clustering algorithms in terms of F score and modularity.
PFCA: An influence‐based parallel fuzzy clustering algorithm for large complex networks
Published 2018 in Expert Syst. J. Knowl. Eng.
ABSTRACT
PUBLICATION RECORD
- Publication year
2018
- Venue
Expert Syst. J. Knowl. Eng.
- Publication date
2018-07-18
- Fields of study
Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-49 of 49 references · Page 1 of 1
CITED BY
Showing 1-1 of 1 citing papers · Page 1 of 1