Content deleted Content added
Tag: references removed |
|||
Line 16:
Several classic data sets have been used extensively in the [[statistical]] literature:
* [[Iris flower data set]] – Multivariate data set introduced by [[Ronald Fisher]] (1936).<ref name="fisher36">{{cite journal|author=Fisher, R.A. |title=The Use of Multiple Measurements in Taxonomic Problems| journal=[[Annals of Eugenics]]| volume=7 |pages=179–188| year=
* [[MNIST database]] – Images of handwritten digits commonly used to test classification, clustering, and image processing algorithms
* ''[[Categorical data analysis]]'' – Data sets used in the book, ''An Introduction to Categorical Data Analysis''.
*''[[Robust statistics]]'' – Data sets used in ''[[Robust Regression and Outlier Detection]]'' ([[Peter Rousseeuw|Rousseeuw]] and Leroy,
*''[[Time series]]'' – Data used in Chatfield's book, ''The Analysis of Time Series'', are [http://lib.stat.cmu.edu/modules.php?op=modload&name=PostWrap&file=index&page=datasets/ provided on-line by StatLib.]
*''Extreme values'' – Data used in the book, ''An Introduction to the Statistical Modeling of Extreme Values'' are [https://web.archive.org/web/20060910161517/http://homes.stat.unipd.it/coles/public_html/ismev/ismev.dat a snapshot of the data as it was provided on-line by Stuart Coles], the book's author.
|