Restricted randomization: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 12:38, 2 April 2012 edit Northamerica1000 (talk \| contribs) Autopatrolled, Administrators 708,152 edits wf: statistics ← Previous edit		Latest revision as of 20:59, 20 August 2025 edit undo Citation bot (talk \| contribs) Bots 5,868,562 edits Add: doi, issue, date. \| Use this bot. Report bugs. \| Suggested by Abductive \| Category:Articles with incomplete citations from November 2012 \| #UCB_Category 31/217
(33 intermediate revisions by 24 users not shown)
Line 1: {{more footnotes\|date=April 2012}} ~~{{articleissues~~ ~~\|lead rewrite=September 2009~~ ~~\|expert=March 2012~~ ~~\|howto=April 2012}}~~ In [[statistics]], '''restricted randomization''' occurs in the [[design of experiments]] and in particular in the context of [[randomized experiment]]s and [[randomized controlled trial]]s. Restricted randomization allows intuitively poor allocations of treatments to experimental units to be avoided, while retaining the theoretical benefits of randomization.<ref>{{cite book\|last1=Dodge\|first1=Y.\|title=The Oxford Dictionary of Statistical Terms\|publisher=OUP\|year=2006\|isbn=978-0-19-920613-1\|url-access=registration\|url=https://archive.org/details/oxforddictionary0000unse}}</ref><ref>{{cite journal\|last1=Grundy\|first1=P.M.\|last2=Healy\|first2=M.J.R.\|author-link2=Michael Healy (statistician)\|title=Restricted randomization and quasi-Latin squares\|journal=Journal of the Royal Statistical Society, Series B\|date=1950 \|volume=12\|issue=2 \|pages=286–291 \|doi=10.1111/j.2517-6161.1950.tb00062.x }}</ref> For example, in a [[clinical trial]] of a new proposed treatment of obesity compared to a control, an experimenter would want to avoid outcomes of the randomization in which the new treatment was allocated only to the heaviest patients. In [[statistics]], '''restricted randomization''' occurs in [[experimental design]]s with more than one source of [[statistical dispersion\|variability]]. The concept was introduced by [[Frank Yates]] (1948) and [[William J. Youden]] (1972) "as a way of avoiding bad spatial patterns of treatments in designed experiments."<ref name="ref1">Bailey, R. A. [http://www.jstor.org/discover/10.2307/2288775?uid=3739808&uid=2&uid=4&uid=3739256&sid=21100687318461 Restricted Randomization: A Practical Example], ''Journal of the American Statistical Association'', Vol. 82, No. 399 (Sep., 1987), pp. 712–719, at 712</ref> The concept was introduced by [[Frank Yates]] (1948){{full citation needed\|date=November 2012}} and [[William J. Youden]] (1972){{full citation needed\|date=November 2012}} "as a way of avoiding bad spatial patterns of treatments in designed experiments."<ref name="ref1">{{Cite journal \|jstor = 2288775\|title = Restricted Randomization: A Practical Example\|last1 = Bailey\|first1 = R. A.\|journal = Journal of the American Statistical Association\|year = 1987\|volume = 82\|issue = 399\|pages = 712–719\|doi = 10.1080/01621459.1987.10478487}}</ref> In order to [[variance reduction\|reduce variation]] in processes, these multiple sources must be understood, and that often leads to the concept of nested or hierarchical data structures. For example, in the [[semiconductor]] industry, a [[batch production\|batch process]] may operate on several [[wafer (electronics)\|wafers]] at a time (wafers are said to be '''nested''' within batch). Understanding the input variables that control variation among those wafers, as well as understanding the variation across each wafer in a run, is an important part of the strategy for minimizing the total variation in the system. ==Example of nested data== Line 89 ⟶ 86: There are 16 subplot experimental units for this experiment. Solution temperature and current are the subplot factors in this experiment. There are four whole-plot experimental units in this experiment. Solution concentration is the whole-plot factor in this experiment. Since there are two sizes of experimental units, there are two error terms in the model, one that corresponds to the whole-plot error or whole-plot experimental unit and one that corresponds to the subplot error or subplot experimental unit. The [[ANOVA]] table for this experiment would look, in part, as follows: {\| class="wikitable" \|+ Partial ANOVA table Line 102 ⟶ 99: \|1 \|- \|Error (~~Whole~~whole plot) = Rep* × Conc \|1 \|- Line 108 ⟶ 105: \|1 \|- \|Rep* × Temp \|1 \|- Line 114 ⟶ 111: \|1 \|- \|Rep* × Current \|1 \|- \|Temp* × Conc \|1 \|- \|Rep* × Temp* × Conc \|1 \|- \|Temp* × Current \|1 \|- \|Rep* × Temp* × Current \|1 \|- \|Current* × Conc \|1 \|- \|Rep* × Current* × Conc \|1 \|- \|Temp* × Current* × Conc \|1 \|- \|Error (Subplot) = Rep* × Temp* × Current* × Conc \|1 \|} Line 170 ⟶ 167: \|1 \|- \|Error (~~Whole~~whole plot) = Conc* × Temp \|1 \|- Line 176 ⟶ 173: \|1 \|- \|Conc* × Current \|1 \|- \|Temp* × Current \|1 \|- \|Conc* × Temp* × Current \|1 \|- \|Error (~~Subplot~~subplot) \|8 \|} Line 214 ⟶ 211: \|1 \|- \|Error (~~Whole~~whole plot) \|4 \|- Line 220 ⟶ 217: \|1 \|- \|Conc* × Current \|1 \|- \|Temp* × Current \|1 \|- \|Conc* × Temp* × Current \|1 \|- \|Error (~~Subplot~~subplot) \|4 \|} Line 254 ⟶ 251: ==See also== {{~~portal~~Portal\|~~Statistics~~Mathematics}} * [[Hierarchical linear modeling]] * [[Mixed-design analysis of variance]] Line 262 ⟶ 259: ==References== {{reflist}} * {{cite web \| url=http://www.itl.nist.gov/div898/handbook/pri/section5/pri55.htm \| title=How can I account for nested variation (restricted randomization)? \| publisher=(U.S.) National Institute of Standards and Technology: Information Technology Laboratory \| ~~accessdate~~access-date=March 26, 2012}} ==Further reading== For a more detailed discussion of these designs and the appropriate analysis procedures, see: * {{cite book \|~~author~~author1 = Milliken, G. A. \|~~coauthors~~author2 = Johnson, D. E. \|year = 1984 \|title = Analysis of Messy Data Line 281 ⟶ 278: \|volume = 39 \|issue = 2 ~~\|page = 153~~ \|doi = 10.2307/1270903 ~~\|publisher = Technometrics, Vol. 39, No. 2~~ \|jstor = 1270903 \|pages = 153–161}} ==External links== * [https://www.southampton.ac.uk/~cpd/anovas/datasets/index.htm Examples of all ANOVA and ANCOVA models with up to three treatment factors, including randomized block, split plot, repeated measures, and Latin squares, and their analysis in R] {{Statistics}} {{Experimental design}} Line 295 ⟶ 292: [[Category:Analysis of variance]] [[Category:Design of experiments]] ~~[[Category:Statistical models]]~~