Revision as of 14:35, 4 April 2016 edit Dcirovic (talk \| contribs) Autopatrolled, Extended confirmed users, Pending changes reviewers, Rollbackers 253,275 edits Reverted 1 good faith edit by 1.187.94.163 using STiki ← Previous edit		Revision as of 02:29, 22 June 2016 edit undo DanielPenfield (talk \| contribs) Extended confirmed users 38,646 edits +see also Next edit →
Line 2: If there is much irrelevant and redundant information present or noisy and unreliable data, then [[knowledge discovery]] during the training phase is more difficult. Data preparation and filtering steps can take considerable amount of processing time. Data pre-processing includes [[Data cleaning\|cleaning]], [[data normalization\|normalization]], [[data transformation\|transformation]], [[feature extraction]] and [[Feature selection\|selection]], etc. The product of data pre-processing is the final [[training set]]. Kotsiantis et al. (2006) present a well-known algorithm for each step of data pre-processing.<ref>S. Kotsiantis, D. Kanellopoulos, P. Pintelas, "Data Preprocessing for Supervised Learning", ''International Journal of Computer Science'', 2006, Vol 1 N. 2, pp 111–117.</ref> ==See also== [[Data cleansing]] [[Data editing]] [[Data reduction]] [[Data wrangling]] ==References==

Data preprocessing: Difference between revisions