Content deleted Content added
added intro paragraph |
m wikilinks |
||
Line 1:
{{context}}
'''Data pre-processing
If there is much irrelevant and redundant information present or noisy and unreliable data, then [[knowledge discovery]] during the training phase is more difficult.
▲If there is much irrelevant and redundant information present or noisy and unreliable data, then knowledge discovery during the training phase is more difficult. It is well known that data preparation and filtering steps take considerable amount of processing time in ML problems. Data [[Preprocessing|pre-processing]] includes data cleaning, normalization, transformation, [[feature extraction]] and selection, etc. The product of data pre-processing is the final [[training set]]. Kotsiantis et al. (2006) present a well know algorithm for each step of data pre-processing.<ref>S. Kotsiantis, D. Kanellopoulos, P. Pintelas, Data Preprocessing for Supervised Leaning, International Journal of [[Computer science|Computer Science]], 2006, Vol 1 N. 2, pp 111-117.</ref>
==References==
{{reflist}}
[[Category:Machine learning]]
|