Data set: Difference between revisions

Content deleted Content added
Bender the Bot (talk | contribs)
m Classic data sets: HTTP to HTTPS for Wayback Machine, replaced: http://web.archive.org/ → https://web.archive.org/ (7)
Delete sentence that makes no sense. Feel free to reword and restore if you know what it was trying to say.
Tag: references removed
Line 4:
A '''data set''' (or '''dataset''') is a collection of [[data]]. In the case of tabular data, a data set corresponds to one or more [[table (database)|database tables]], where every [[column (database)|column]] of a table represents a particular [[Variable (computer science)|variable]], and each [[row (database)|row]] corresponds to a given [[Record (computer science)|record]] of the data set in question. The data set lists values for each of the variables, such as for example height and weight of an object, for each member of the data set. Data sets can also consist of a collection of documents or files.<ref name="Editorial">{{cite journal | last1 = Snijders | first1 = C. | last2 = Matzat | first2 = U. | last3 = Reips | first3 = U.-D. | year = 2012 | title = 'Big Data': Big gaps of knowledge in the field of Internet | url = http://www.ijis.net/ijis7_1/ijis7_1_editorial.html | journal = International Journal of Internet Science | volume = 7 | pages = 1–5 }}</ref>
 
In the [[open data]] discipline, data set is the unit to measure the information released in a public open data repository. The European [[data.europa.eu]] portal aggregates more than a million data sets.<ref>{{Cite web|url=http://www.europeandataportal.eu/data/en/dataset|title=European open data portal|website=European open data portal|publisher=European Commission|access-date=2016-09-23}}</ref> Some other issues ([[Real-time data|real-time data sources]],<ref name=":0">{{Cite journal|last=Atz|first=U|date=2014|title=The tau of data: A new metric to assess the timeliness of data in catalogues|url=http://duweb.donau-uni.ac.at/imperia/md/content/department/gpa/zeg/bilder/cedem/cedem14/cedem14_proceedings.pdf#page=258 |archive-url=https://web.archive.org/web/20160820031406/http://duweb.donau-uni.ac.at/imperia/md/content/department/gpa/zeg/bilder/cedem/cedem14/cedem14_proceedings.pdf |archive-date=2016-08-20 |url-status=live|journal=CEDEM 2014 Proceedings|access-date=2016-08-01}}</ref> [[NoSQL|non-relational]] data sets, etc.) increases the difficulty to reach a consensus about it.<ref name=":0" />
 
==Properties==