Data analysis: Difference between revisions

Content deleted Content added
S
Tags: Reverted Visual edit Mobile edit Mobile web edit
m brackets fixed
 
(7 intermediate revisions by 6 users not shown)
Line 1:
{{short description|Thenone}} process<!-- of"none" analyzingis datapreferred towhen discoverthe usefultitle informationis andsufficiently supportdescriptive; see [[WP:SDNONE]] decision-making}}->
{{Data Visualization}}
{{Computational physics}}
Line 22:
The data is necessary as inputs to the analysis, which is specified based upon the requirements of those directing the analytics (or customers, who will use the finished product of the analysis).<ref>{{Citation|title=USE OF THE DATA|date=2015-02-06|url=http://dx.doi.org/10.1002/9781118986370.ch18|work=Handbook of Petroleum Product Analysis|pages=296–303|place=Hoboken, NJ|publisher=John Wiley & Sons, Inc|doi=10.1002/9781118986370.ch18|isbn=978-1-118-98637-0|access-date=2021-05-29}}</ref> The general type of entity upon which the data will be collected is referred to as an [[Statistical unit|experimental unit]] (e.g., a person or population of people). Specific variables regarding a population (e.g., age and income) may be specified and obtained. Data may be numerical or categorical (i.e., a text label for numbers).<ref name="Schutt & O'Neil"/>
 
===Data collectionscollection ===
Data may be collected from a variety of sources.<ref>{{Cite journal|title=Table 1: Data type and sources of data collected for this research.|journal=PeerJ|date=7 May 2021|volume=9|pages=e11387|doi=10.7717/peerj.11387/table-1|last1=Olusola|first1=Johnson Adedeji|last2=Shote|first2=Adebola Adekunle|last3=Ouigmane|first3=Abdellah|last4=Isaifan|first4=Rima J. |doi-access=free }}</ref> A [[List of datasets for machine-learning research|list of data sources]] are available for study & research. The requirements may be communicated by analysts to [[Data custodian|custodians]] of the data; such as, [[Information systems technician|Information Technology personnel]] within an organization.<ref>{{Citation|last=MacPherson|first=Derek|title=Information Technology Analysts' Perspectives|date=2019-10-16|url=http://dx.doi.org/10.4324/9780429437564-12|work=Data Strategy in Colleges and Universities|pages=168–183|publisher=Routledge|doi=10.4324/9780429437564-12|isbn=978-0-429-43756-4|s2cid=211738958|access-date=2021-05-29}}</ref> '''Data collection''' or '''data gathering''' is the process of gathering and [[measuring]] [[information]] on targeted variables in an established system, which then enables one to answer relevant questions and evaluate outcomes. The data may also be collected from sensors in the environment, including traffic cameras, satellites, recording devices, etc. It may also be obtained through interviews, downloads from online sources, or reading documentation.<ref name="Schutt & O'Neil"/>