International Journal of Computational Intelligence Research
ISSN 0973-1873 Volume 7, Number 3 (2011), pp. 271-294
© Research India Publications
http://www.ripublication.com/ijcir.htm
On Genetic Algorithm and Multiple Preprocessors
Assisted Feature Boosting for Electronic Nose Signal
Processing
Prabha Verma
1
, Divya Somvanshi
2
and R.D.S. Yadava
3
Sensors & Signal Processing Laboratory, Department of Physics,
Faculty of Science, Banaras Hindu University, Varanasi 221005, India.
E-mail:
1
pverma.bhu@gmail.com,
2
somvanshi.divya@gmail.com,
3
ardius@gmail.com
Abstract
The paper presents a method for feature extraction that explores data space
through different preprocessing strategies in combination with principal
component analysis (PCA) and genetic algorithm (GA). A preprocessor/PCA
combination transforms data space into feature space; a change in
preprocessor results in an alternate feature space. The proposed method first
fuses the feature spaces by simple concatenation of the alternate feature
vectors then allows them to evolve genetically. The genetic evolution of each
fused feature vector is based on treating the feature vector as chromosome and
the feature components as genes. The initial population is created on the basis
of a probability distance metric. The fitness and ranking is done by using PCA
generated variances as measure of significance. In the terminal population the
frequency of a gene (principal component) occurrence is interpreted as a
measure of its significance in defining the feature vector. Finally, the feature
components are given additional weight according to ( )
j j ij ij
p p z z
2
log 1 − =
where
ij
z denotes j-th feature component of i-th sample in fused feature space
and
j
p denotes the probability of j-th component occurring in the terminal
population. In order to demonstrate the efficacy of this idea we employed only
two well known preprocessing methods: vector autoscaling and dimensional
autoscaling. The feature vectors defined in this manner were used as input a
radial basis neural network classifier for validation. Several benchmark
datasets (both chemical and non-chemical) available from open sources were
used in the analysis for validation. It is found that the scheme of feature fusion
and weighting enhances classification rate in most cases analyzed.
Keywords: Feature extraction, electronic nose, feature fusion, genetic
algorithm, pattern recognition.