Content deleted Content added
m Reverted edits by 2001:8F8:1D01:206F:25AB:D64C:B086:A9D0 (talk): disruptive edits (HG) (3.4.12) |
Symbol is already linked in the main data article |
||
(37 intermediate revisions by 33 users not shown) | |||
Line 2:
{{other uses|Data (disambiguation)|Datum (disambiguation)}}
{{broader|Data}}
{{Merge to|Digital data|discuss=Talk:Digital data#Proposed merge of Data (computer science) into Digital data|date=March 2025}}
[[File:Data types - en.svg|thumb|Various types of data which can be visualized through a computer device.]]
In [[computer science]], '''data''' (treated as singular, plural, or as a [[mass noun]]) is [[Data|any sequence of one or more
Data exists in three states: [[data at rest]], [[data in transit]] and [[data in use]].
Physical [[computer memory]] elements consist of an address and a byte/word of data storage. Digital data are often stored in [[Relational database#RDBMS|relational databases]], like [[table (database)|tables]] or SQL databases, and can generally be represented as abstract key/value pairs. Data can be organized in many different types of [[data structure]]s, including arrays, [[Graph (abstract data type)|graphs]], and [[Object (computer science)|objects]]. Data structures can store data of many different [[data type|types]], including [[
==Characteristics==
[[Metadata]] helps translate data to information. Metadata is data about the data. Metadata may be implied, specified or given.
Data relating to physical events or processes will have a temporal component. This temporal component may be implied. This is the case when a device such as a temperature logger receives data from a temperature [[sensor]]. When the temperature is received it is assumed that the data has a temporal reference of ''now''. So the device records the date, time and temperature together.
Fundamentally, computers follow a sequence of instructions they are given in the form of data.
To store data [[byte]]s in a file, they have to be [[Serialization|serialized]] in a [[file format]]. Typically, programs are stored in special file types, different from those used for other data.
The line between program and data can become blurry.
For example, a [[user (computing)|user]] might first instruct the [[operating system]] to load a [[word processor]] program from one file, and then use the running program to open and edit a [[Document file format|document]] stored in another file.
In an alternate usage, [[binary file]]s (which are not [[Human-readable medium|human-readable]]) are sometimes called ''data'' as distinguished from human-readable ''[[text file|text]]''.<ref>{{cite web|url=https://man.openbsd.org/file.1|title=file(1)|work=OpenBSD manual pages|date=24 December 2015|access-date=4 February 2018|url-status=live|archive-url=https://web.archive.org/web/20180205000843/https://man.openbsd.org/file.1|archive-date=5 February 2018}}</ref>
Line 28 ⟶ 29:
==Data keys and values, structures and persistence==
Keys in data provide the context for values.
Data can be represented in computers in multiple ways, as per the following examples:
===RAM===
* [[Random access memory]] (RAM) holds data that the CPU has direct access to. A CPU may only manipulate data within its [[processor register]]s or memory.
===Keys===
* Data keys need not be a direct hardware address in memory. [[Indirection|Indirect]], abstract and logical keys codes can be stored in association with values to form a [[data structure]].
===Organised recurring data structures===
* The [[Table (information)|tabular]] view of repeating data structures is only one of many possibilities.
===Sorted or ordered data===
Line 48 ⟶ 49:
===Indexed data===
* Retrieving a small subset of data from a much larger set may imply inefficiently searching through the data sequentially.
===Abstraction and indirection===
Line 60 ⟶ 61:
===Parallel distributed data processing===
* Modern scalable and high-performance data persistence technologies, such as [[Apache Hadoop]], rely on massively parallel distributed data processing across many commodity computers on a high bandwidth network. In such systems, the data is distributed across multiple computers and therefore any particular computer in the system must be represented in the key of the data, either directly, or indirectly.
==See also==
Line 81 ⟶ 82:
{{Authority control}}
[[Category:Computer data| ]]
|