International Journal of Electrical and Computer Engineering (IJECE) Vol. 10, No. 3, Juni 2020, pp. 2742~2754 ISSN: 2088-8708, DOI: 10.11591/ijece.v10i3.pp2742-2754 2742 Journal homepage: http://ijece.iaescore.com/index.php/IJECE A comprehensive insight towards pre-processing methodologies applied on GPS data R. Prabha 1 , Mohan G. Kabadi 2 1 Visvesvaraya Technological University, India 2 Department of Computer Science and Engineering, Sai Vidya Institute of Technology, India Article Info ABSTRACT Article history: Received Apr 30, 2019 Revised Nov 6, 2019 Accepted Nov 24, 2019 Reliability in the utilization of the Global Positioning System (GPS) data demands a higher degree of accuracy with respect to time and positional information required by the user. However, various extrinsic and intrinsic parameters disrupt the data transmission phenomenon from GPS satellite to GPS receiver which always questions the trustworthiness of such data. Therefore, this manuscript offers a comprehensive insight into the data preprocessing methodologies evolved and adopted by present-day researchers. The discussion is carried out with respect to standard methods of data cleaning as well as diversified existing research-based approaches. The review finds that irrespective of a good number of work carried out to address the problem of data cleaning, there are critical loopholes in almost all the existing studies. The paper extracts open end research problems as well as it also offers an evidential insight using use-cases where it is found that still there is a critical need to investigate data cleaning methods. Keywords: Data cleaning Data pre-processing Global positioning system Receiver Satellite Copyright © 2020 Institute of Advanced Engineering and Science. All rights reserved. Corresponding Author: R. Prabha, Visvesvaraya Technological University, Belagavi, Karnataka, India Email: research.prabha.r@gmail.com 1. INTRODUCTION The utilization of the Global Positioning System (GPS) has been increasing since the last decade as it is one of the most cost-effective navigational assistance [1]. With the proliferated usage of smartphone, various navigational applications and location services are directly dependent on the GPS data. The GPS system extracts the signal information from the satellites in order to obtain the location-specific information. On the basis of the usage of the different GPS receiver, the information of the location is generically provided in the form of Longitude, Latitude, and altitude [2]. The interesting factor about the GPS signal is its publicly availability and accessibility. From a technical viewpoint, the time factor and the spatial factor are the sole backbones of GPS satellites that bear an atomic clock with superior synchronization capability. They are also capable of rectifying and compensating any form of drift in the clock timing with the ground devices very spontaneously. A radio signal is being consistently transmitted by all the satellites of GPS that consists of updated positional data and time information of that position. It is also believed that latency between the GPS satellite is highly dependent on the distance from the earthly receiver and GPS satellite as it is free from any dependency of the speed of satellite and moreover the radio-waves have uniform velocity [3]. There is a typical computation carried out by the earthly receiver too which is responsible for computing the appropriate positional information after it obtained multiple data from multiple satellites. The computation will need to be carried out with higher accuracy. It is also believed that in order to compute the position information by the GPS receiver, there should be the presence of at least four GPS satellite within a line of sight. Although, this is a condition and this condition is quite hard to be satisfied in many real-time cases. The signal forwarded by the GPS satellite consists of much essential information. The first