The clustering on categorical variables has received intensive attention. In dataset with categorical features, some features show the superior performance on clustering procedure. In this paper, we propose a simple method to find such distinctive features by comparing pooled within-cluster mean relative difference and then partition the data upon such features and give subspace of the subgroups. The applications on zoo data and soybean data illustrate the performance of the proposed method.
Clustering Categorical Data Based on Within-Cluster Relative Mean Difference
Published 2017 in Open Journal of Statistics
ABSTRACT
PUBLICATION RECORD
- Publication year
2017
- Venue
Open Journal of Statistics
- Publication date
2017-04-20
- Fields of study
Mathematics, Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-12 of 12 references · Page 1 of 1
CITED BY
Showing 1-2 of 2 citing papers · Page 1 of 1