Content deleted Content added
m moved Data Pre-processing to Data pre-processing |
|||
Line 1:
{{Context|date=October 2009}}
'''Data pre-processing''' is an often neglected but important step in the data mining process. The phrase [[GIGO|"
If there is much irrelevant and redundant information present or noisy and unreliable data, then [[knowledge discovery]] during the training phase is more difficult. Data preparation and filtering steps can take considerable amount of processing time. Data pre-processing includes [[Data cleaning|cleaning]], normalization, transformation, [[feature extraction]] and selection, etc. The product of data pre-processing is the final [[training set]]. Kotsiantis et al. (2006) present a well-known algorithm for each step of data pre-processing.<ref>S. Kotsiantis, D. Kanellopoulos, P. Pintelas, "Data Preprocessing for Supervised Leaning", ''International Journal of Computer Science'', 2006, Vol 1 N. 2, pp
==References==
|