Content deleted Content added
m Open access bot: add arxiv identifier to citation with #oabot. |
Citation bot (talk | contribs) m Alter: isbn, journal. Add: series, pmc, pmid. Removed accessdate with no specified URL. Removed parameters. You can use this bot yourself. Report bugs here. | User-activated. |
||
Line 3:
==History==
In 1925, [[Ronald Fisher]] mentions the two-way ANOVA in his celebrated book from 1925, ''[[Statistical Methods for Research Workers]]'' (chapters 7 and 8). In 1934, [[Frank Yates]] published procedures for the unbalanced case.<ref>{{cite journal |last=Yates |first=Frank |date=March 1934 |title=The analysis of multiple classifications with unequal numbers in the different classes |jstor=2278459 |journal=Journal of the
==Data set==
Line 13:
==Model==
Upon observing variation among all <math>n</math> data points, for instance via a [[histogram]], "[[Probability theory|probability]] may be used to describe such variation".<ref>{{cite journal |last=Kass |first=Robert E |date=1 February 2011 |title=Statistical inference: The big picture |url=http://projecteuclid.org/euclid.ss/1307626554 |journal=[[Statistical Science
<math>Y_{ijk} \, | \, \mu_{ij}, \sigma^2 \; \overset{i.i.d.}{\sim} \; \mathcal{N}(\mu_{ij}, \sigma^2)</math>.
Line 28:
==Assumptions==
Following Gelman and Hill, the assumptions of the ANOVA, and more generally the [[general linear model]], are, in decreasing order of importance:<ref>{{cite book |last=Gelman |first=Andrew |last2=Hill |first2=Jennifer |date=18 December 2006 |title= Data Analysis Using Regression and Multilevel/Hierarchical Models |url=http://www.cambridge.org/us/academic/subjects/statistics-probability/statistical-theory-and-methods/data-analysis-using-regression-and-multilevelhierarchical-models |publisher=[[Cambridge University Press]] |pages=45–46 |isbn=978-0521867061 }}</ref>
# the data points are relevant with respect to the scientific question under investigation;
# the mean of the response variable is influenced additively (if not interaction term) and linearly by the factors;
Line 48:
-->
Testing if the interaction term is significant can be difficult because of the potentially-large number of [[degrees of freedom (statistics)|degrees of freedom]].<ref>{{cite journal |author=Yi-An Ko|date=September 2013 |title=Novel Likelihood Ratio Tests for Screening Gene-Gene and Gene-Environment Interactions with Unbalanced Repeated-Measures Data |journal=Genetic
==See also==
Line 62:
{{Reflist}}
== References ==
* {{cite book |author=[[George Casella]] |date=18 April 2008 |title=Statistical design |url=https://www.springer.com/statistics/statistical+theory+and+methods/book/978-0-387-75964-7 |publisher=[[Springer Science+Business Media|Springer]] |isbn=978-0-387-75965-4 |series=Springer Texts in Statistics }}
[[Category:Analysis of variance]]
|