Optimising Image Feature Extraction and Selection: A Comprehensive Review With Spark Case Studies

J. G. Figueira‐Domínguez,Beatriz Remeseiro,Verónica Bolón-Canedo

Published 2026 in Expert Syst. J. Knowl. Eng.

ABSTRACT

As benchmark image datasets expand in sample size and feature complexity, the challenge of managing increased dimensionality becomes apparent. Contrary to the expectation that more features equate to enhanced information and improved outcomes, the curse of dimensionality often hampers performance. This paper reviews existing literature on filter feature selection techniques applied to image features, highlighting their use in both classical and deep‐learning‐based feature extraction methods. Building on these findings, this study proposes a scalable approach for image feature extraction and selection using Big Data technologies, specifically Apache Spark, to efficiently process large and high‐dimensional datasets. The proposed framework integrates filter‐based feature selection methods within a distributed environment to evaluate their effectiveness in image analysis tasks. Several experiments were performed to compare the results using feature selection techniques with various reduction percentages. Results show that significant feature reduction can be achieved without compromising classification accuracy, demonstrating the potential of Spark‐based distributed processing for large‐scale image analytics.

PUBLICATION RECORD

CITATION MAP

EXTRACTION MAP

CLAIMS

  • No claims are published for this paper.

CONCEPTS

  • No concepts are published for this paper.

REFERENCES

Showing 1-80 of 80 references · Page 1 of 1

CITED BY