It is generally agreed that the selection of an appropriate set of features is a fundamental process in the development of any pattern recognition system. Its purpose is to identify the truly distinctive subset of features to reduce the size of the search space, without decreasing the classification performance. This problem is particularly relevant in the field of handwriting recognition, due to the enormous variability of character shape, which has led to the development of a large variety of feature sets that are becoming increasingly larger in terms of the number of attributes. While promising, the results achieved so far have several limitations, which include, among others, the computational complexity of selecting and evaluating feature subsets and the difficulty in evaluating the interactions among features. In a previous study, we tried to overcome some of the above limitations by adopting a feature-ranking-based technique: a large study was carried out considering different filter-based techniques for feature subset evaluation. The aim of this work is to extend the previous study by presenting a broad comparison between filter and wrapper techniques for feature selection in the field of handwritten character recognition. In the experiments, we analysed one of the most effective and widely used set of features in handwriting recognition, applied to standard real-word databases of handwritten characters. The experimental results confirmed that filter and wrapper approaches achieve similar performances, with the former selecting fewer features at a lower computational cost.

Comparing filter and wrapper approaches for feature selection in handwritten character recognition

Cilia N. D.;D'Alessandro T.;De Stefano C.;Fontanella F.;Scotto di Freca A.
2023-01-01

Abstract

It is generally agreed that the selection of an appropriate set of features is a fundamental process in the development of any pattern recognition system. Its purpose is to identify the truly distinctive subset of features to reduce the size of the search space, without decreasing the classification performance. This problem is particularly relevant in the field of handwriting recognition, due to the enormous variability of character shape, which has led to the development of a large variety of feature sets that are becoming increasingly larger in terms of the number of attributes. While promising, the results achieved so far have several limitations, which include, among others, the computational complexity of selecting and evaluating feature subsets and the difficulty in evaluating the interactions among features. In a previous study, we tried to overcome some of the above limitations by adopting a feature-ranking-based technique: a large study was carried out considering different filter-based techniques for feature subset evaluation. The aim of this work is to extend the previous study by presenting a broad comparison between filter and wrapper techniques for feature selection in the field of handwritten character recognition. In the experiments, we analysed one of the most effective and widely used set of features in handwriting recognition, applied to standard real-word databases of handwritten characters. The experimental results confirmed that filter and wrapper approaches achieve similar performances, with the former selecting fewer features at a lower computational cost.
File in questo prodotto:
File Dimensione Formato  
PRL22.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 471.34 kB
Formato Adobe PDF
471.34 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11580/109203
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
social impact