Content deleted Content added
Tags: Reverted Mobile edit Mobile web edit |
m Reverted edits by 2001:8F8:1DD8:B422:E4EB:F6AD:5F16:FBCC (talk): editing tests (HG) (3.4.12) |
||
Line 7:
[[Data mining]] is a particular data analysis technique that focuses on statistical modeling and knowledge discovery for predictive rather than purely descriptive purposes, while [[business intelligence]] covers data analysis that relies heavily on aggregation, focusing mainly on business information. In statistical applications, data analysis can be divided into [[descriptive statistics]], [[exploratory data analysis]] (EDA), and [[Statistical hypothesis testing|confirmatory data analysis]] (CDA).<ref>{{Citation|title=Data Coding and Exploratory Analysis (EDA) Rules for Data Coding Exploratory Data Analysis (EDA) Statistical Assumptions|date=2004-08-16 |url=http://dx.doi.org/10.4324/9781410611420-6|work=SPSS for Intermediate Statistics|pages=42–67 |publisher=Routledge|doi=10.4324/9781410611420-6|isbn=978-1-4106-1142-0|access-date=2021-05-29}}</ref> EDA focuses on discovering new features in the data while CDA focuses on confirming or falsifying existing [[hypotheses]].<ref>{{Cite book |last1=Samandar|first1=Petersson|first2=Sofia|last2=Svantesson|title=Skapandet av förtroende inom eWOM : En studie av profilbildens effekt ur ett könsperspektiv |date=2017|publisher=Högskolan i Gävle, Företagsekonomi|oclc=1233454128}}</ref> [[Predictive analytics]] focuses on the application of statistical models for predictive forecasting or classification, while [[text analytics]] applies statistical, linguistic, and structural techniques to extract and classify information from textual sources, a variety of [[unstructured data]]. All of the above are varieties of data analysis.<ref>{{Cite journal|last=Goodnight|first=James|date=2011-01-13 |title=The forecast for predictive analytics: hot and getting hotter |url=http://dx.doi.org/10.1002/sam.10106|journal=Statistical Analysis and Data Mining: The ASA Data Science Journal|volume=4|issue=1|pages=9–10|doi=10.1002/sam.10106|s2cid=38571193 |issn=1932-1864}}</ref>
==Data
[[File:Data visualization process v1.png|right|350px|thumb|Data science process flowchart from ''Doing Data Science'', by Schutt & O'Neil (2013)]]
|