HANDLING MISSING VALUES VIA A NEURAL SELECTIVE INPUT MODEL Noel Lopes * , Bernardete Ribeiro † Abstract: Missing data represent an ubiquitous problem with numerous and di- verse causes. Handling Missing Values (MVs) properly is a crucial issue, in partic- ular in Machine Learning (ML) and pattern recognition. To date, the only option available for standard Neural Networks (NNs) to handle this problem has been to rely on pre-processing techniques such as imputation for estimating the missing data values, which limited considerably the scope of their application. To cir- cumvent this limitation we propose a Neural Selective Input Model (NSIM) that accommodates different transparent and bound models, while providing support for NNs to handle MVs directly. By embedding the mechanisms to support MVs we can obtain better models that reflect the uncertainty caused by unknown val- ues. Experiments on several UCI datasets with both different distributions and proportion of MVs show that the NSIM approach is very robust and yields good to excellent results. Furthermore, the NSIM performs better than the state-of-the- art imputation techniques either with higher prevalence of MVs in a large number of features or with a significant proportion of MVs, while delivering competitive performance in the remaining cases. We demonstrate the usefulness and validity of the NSIM, making this a first-class method for dealing with this problem. Key words: Missing Values, Neural Networks, Back-Propagation, Multiple Back-Propagation Received: April 17, 2012 Revised and accepted: July 10, 2012 1. Introduction Incomplete data pose an unavoidable problem for most real-world databases which often contain missing data [1, 2]. In particular, in domains such as gene expression * Noel Lopes UDI/IPG – Research Unit, Polytechnic Institute of Guarda, Portugal; CISUC – Center for Infor- matics and Systems of University of Coimbra, Portugal, E-mail: noel@ipg.pt † Bernardete Ribeiro Department of Informatics Engineering, University of Coimbra, Portugal; CISUC – Center for Informatics and Systems of University of Coimbra, Portugal, E-mail: bribeiro@dei.uc.pt c ICS AS CR 2012 357