Data processing: Difference between revisions

Content deleted Content added
m Added more wikilinks, fixed punctuation, removed empty spaces.
m Reverted 1 edit by 2603:8000:6C00:1A86:555:3CA:BC7C:82B6 (talk) to last revision by Milo8505
 
(19 intermediate revisions by 14 users not shown)
Line 1:
{{Short description|Collection and manipulation of items of data to produce meaningful information}}
 
{{Short description|Collection and manipulation of items of data to produce meaningful information}}
{{other uses}}
<!--
Line 31 ⟶ 30:
=== Manual data processing ===
 
Although widespread use of the term ''data processing'' dates only from the 1950s, <ref name=DPuse>{{cite book|title=Google N gram viewer|url=https://books.google.com/ngrams/graph?content=data+processing&year_start=1800&year_end=2000&corpus=15&smoothing=3&share=|access-date=June 26, 2013}}</ref> data processing functions have been performed manually for millennia. For example, [[bookkeeping]] involves functions such as posting transactions and producing reports like the [[balance sheet]] and the [[cash flow statement]]. Completely manual methods were augmented by the application of [[mechanical calculator|mechanical]] or electronic [[calculator]]s. A person whose job was to perform calculations manually or using a calculator was called a "[[Human computer|computer]]."
 
The [[1890 United States Censuscensus]] schedule was the first to gather data by individual rather than [[household]]. A number of questions could be answered by making a check in the appropriate box on the form. From 1850 to 1880 the Census Bureau employed "a system of tallying, which, by reason of the increasing number of combinations of classifications required, became increasingly complex. Only a limited number of combinations could be recorded in one tally, so it was necessary to handle the schedules 5 or 6 times, for as many independent tallies."<ref name=Truesdell65>{{cite book|author1-link=Leon E. Truesdell|last=Truesdell|first=Leon E.|title=The development of punch card tabulation in the Bureau of the Census, 1890|year=1965|publisher=United States Department of Commerce|url=https://play.google.com/books/reader?id=MGZqAAAAMAAJ&printsec=frontcover&output=reader&authuser=0&hl=en&pg=GBS.PR1}}</ref> "It took over 7 years to publish the results of the 1880 census"<ref name=Bohme91>{{cite book|last1=Bohme|first1=Frederick|last2=Wyatt|first2=J. Paul|last3=Curry|first3=James P.|title=100 Years of Data Processing: The Punchcard Century|year=1991|publisher=United States Bureau of the Census|url=https://play.google.com/store/books/details?id=uCeu4sHRLfgC&rdid=book-uCeu4sHRLfgC&rdot=1}}</ref> using manual processing methods.
 
=== Automatic data processing ===
 
The term ''[[Electronic data processing|automatic data processing]]'' was applied to operations performed by means of [[unit record equipment]], such as [[Herman Hollerith]]'s application of [[punched card]] equipment for the [[1890 United States Censuscensus]]. "Using Hollerith's punchcard equipment, the Census Office was able to complete tabulating most of the 1890 census data in 2 to 3 years, compared with 7 to 8 years for the 1880 census. It is estimated that using Hollerith's system saved some $5 million in processing costs"<ref name=Bohme91 /> in 1890 dollars even though there were twice as many questions as in 1880.
 
=== Computerized data processing ===
 
Computerized data processing, or [[electronic data processing]] represents a later development, with a computer used instead of several independent pieces of equipment. The Census Bureau first made limited use of [[electronic computers]] for the [[1950 United States Censuscensus]], using a [[UNIVAC I]] system,<ref name=Truesdell65 /> delivered in 1952.
 
=== Other developments ===
Line 60 ⟶ 59:
In science and engineering, the terms ''data processing'' and ''[[information system]]s'' are considered too broad, and the term ''data processing'' is typically used for the initial stage followed by a [[data analysis]] in the second stage of the overall data handling.
 
Data analysis uses specialized [[algorithm]]s and [[statistical]] calculations that are less often observed in a typical general business environment. For data analysis, software suites like [[SPSS]] or [[SAS (software)|SAS]], or their free counterparts such as [[DAP (software)|DAP]], [[gretl]], or [[PSPP]] are often used. These tools are usually helpful for processing various huge data sets, as they are able to handle enormous amount of statistical analysis.<ref>{{Cite journal |last1=V |first1=Jalajakshi |last2=A n |first2=Myna |date=2022-06-01 |title=Importance of statistics to data science |journal=Global Transitions Proceedings |series=International Conference on Intelligent Engineering Approach(ICIEA-2022) |volume=3 |issue=1 |pages=326–331 |doi=10.1016/j.gltp.2022.03.019 |issn=2666-285X|doi-access=free }}</ref>
 
==Systems==