Turkish Journal of Electrical Engineering and Computer Sciences
DOI
10.3906/elk-1301-181
Abstract
This paper presents a classification system in which learning, feature selection, and classification for incomplete data are simultaneously carried out in an online manner. Learning is conducted on a predefined model including the class-dependent mean vectors and correlation coefficients, which are obtained by incrementally processing the incoming observations with missing features. A nearest neighbor with a Gaussian mixture model, whose parameters are also estimated from the trained model, is used for classification. When a testing observation is received, the algorithm discards the missing attributes on the observation and ranks the available features by performing feature selection on the model that has been trained so far. The developed algorithm is tested on a benchmark dataset. The effect of missing features for online feature selection and classification are discussed and presented. The algorithm easily converges to the stable state of feature selection with similar accuracy results as those when using the complete and incomplete feature set with up to 50% missing data.
Keywords
Online feature selection, classification, missing data, incremental learning
First Page
1625
Last Page
1636
Recommended Citation
KALKAN, HABİL
(2014)
"Online feature selection and classification with incomplete data,"
Turkish Journal of Electrical Engineering and Computer Sciences: Vol. 22:
No.
6, Article 17.
https://doi.org/10.3906/elk-1301-181
Available at:
https://journals.tubitak.gov.tr/elektrik/vol22/iss6/17
Included in
Computer Engineering Commons, Computer Sciences Commons, Electrical and Computer Engineering Commons