Data mining is process of analyzing enormous sets of data and extracting meaningful information from data. Temporal data mining deals with data which has time information.In practical, the data collected may contain noisy, inconsistent data and in many cases the data may be missing. So one of the important step that need to be done in data mining is data pre-processing. Incomplete data may generate biased results and impact the accuracy of analysis. In order to rectify this it is important to predict the missing values based on other details in the dataset. The work focuses on predicting missing data using mean imputation, hotdecking and Inverse Distance Weighted Interpolation methods and compare the results of each of these methods. Machine learning methods applied to the imputed dataset will give better accuracy than that of the incomplete dataset.
Missing values, imputation, inverse distance weighted interpolation.
Arumuga Nainar S., A Comparative Study of Missing Value Imputation Methods on Time Series Data, HCTL Open International Journal of Technology Innovations and Research (IJTIR), Volume 14, April 2015, eISSN: 2321-1814, ISBN (Print): 978-1-62951-946-3.