As benchmark image datasets expand in sample size and feature complexity, the challenge of managing increased dimensionality becomes apparent. Contrary to the expectation that more features equate to enhanced information and improved outcomes, the curse of dimensionality often hampers performance. This paper reviews existing literature on filter feature selection techniques applied to image features, highlighting their use in both classical and deep‐learning‐based feature extraction methods. Building on these findings, this study proposes a scalable approach for image feature extraction and selection using Big Data technologies, specifically Apache Spark, to efficiently process large and high‐dimensional datasets. The proposed framework integrates filter‐based feature selection methods within a distributed environment to evaluate their effectiveness in image analysis tasks. Several experiments were performed to compare the results using feature selection techniques with various reduction percentages. Results show that significant feature reduction can be achieved without compromising classification accuracy, demonstrating the potential of Spark‐based distributed processing for large‐scale image analytics.
Optimising Image Feature Extraction and Selection: A Comprehensive Review With Spark Case Studies
J. G. Figueira‐Domínguez,Beatriz Remeseiro,Verónica Bolón-Canedo
Published 2026 in Expert Syst. J. Knowl. Eng.
ABSTRACT
PUBLICATION RECORD
- Publication year
2026
- Venue
Expert Syst. J. Knowl. Eng.
- Publication date
2026-01-05
- Fields of study
Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-80 of 80 references · Page 1 of 1
CITED BY
Showing 1-1 of 1 citing papers · Page 1 of 1