Content deleted Content added
→Tukey-Lambda PPCC Plot for Symmetric Distributions: fixed incorrect capitals per WP:MOS |
m Disambiguating links to Probability plot (link removed; intentional link to DAB) using DisamAssist. |
||
(22 intermediate revisions by 15 users not shown) | |||
Line 1:
The '''probability plot correlation coefficient
Many [[statistical analysis|statistical analyses]] are based on distributional assumptions about the [[population (statistics)|population]] from which the data have been obtained. However, distributional families can have radically different shapes depending on the value of the [[shape parameter]]. Therefore, finding a reasonable choice for the shape parameter is a necessary step in the analysis. In many analyses, finding a good distributional model for the data is the primary focus of the analysis.
The technique is simply "plot the [[probability plot correlation coefficient]]s for different values of the shape parameter, and choose whichever value yields the best fit".
▲The '''probability plot correlation coefficient''' (PPCC) plot is a [[graphical technique]] for identifying the shape parameter for a distributional family that best describes the data set. This technique is appropriate for families, such as the [[Weibull distribution|Weibull]], that are defined by a single shape parameter and [[___location parameter|___location]] and [[scale parameter]]s, and it is not appropriate for distributions, such as the [[normal distribution|normal]], that are defined only by ___location and scale parameters.
==Definition==
The PPCC plot is formed by:
*Vertical axis: [[Probability plot correlation coefficient]];
*Horizontal axis: Value of shape parameter.
That is, for a series of values of the shape parameter, the [[Pearson product-moment correlation coefficient|correlation coefficient]] is computed for the
The PPCC plot is used first to find a good value of the shape parameter. The probability plot is then generated to find estimates of the ___location and scale parameters and in addition to provide a graphical assessment of the adequacy of the distributional fit.
Line 15 ⟶ 17:
#Does the best-fit member provide a good fit (in terms of generating a probability plot with a high correlation coefficient)?
#Does this distributional family provide a good fit compared to other distributions?
#How sensitive is the choice of the shape parameter?
==Comparing distributions==
In addition to finding a good choice for estimating the shape parameter of a given distribution, the PPCC plot can be useful in deciding which distributional family is most appropriate. For example, given a set of [[Reliability (statistics)|reliability]] data, one might generate PPCC plots for a Weibull, [[lognormal distribution|lognormal]], [[gamma distribution|gamma]], and [[inverse Gaussian distribution]]s, and possibly others, on a single page. This one page would show the best value for the shape parameter for several distributions and would additionally indicate which of these distributional families provides the best fit (as measured by the maximum probability plot correlation coefficient). That is, if the maximum PPCC value for the Weibull is 0.99 and only 0.94 for the lognormal, then one could reasonably conclude that the Weibull family is the better choice.
When comparing distributional models, one should not simply choose the one with the maximum PPCC value. In many cases, several distributional fits provide comparable PPCC values. For example, a lognormal and Weibull may both fit a given set of reliability data quite well. Typically, one would consider the complexity of the distribution. That is, a simpler distribution with a marginally smaller PPCC value may be preferred over a more complex distribution. Likewise, there may be theoretical justification in terms of the underlying scientific model for preferring a distribution with a marginally smaller PPCC value in some cases. In other cases, one may not need to know if the distributional model is optimal, only that it is adequate for our purposes. That is, one may be able to use techniques designed for normally distributed data even if other distributions fit the data somewhat better.
==Tukey-
{{
The Tukey
#
#
#
#
#
If the Tukey
The Tukey-
==See also==
*[[Probability plot (disambiguation)|Probability plot]]
==External links==
Line 44 ⟶ 46:
|last=Filliben
|first=J. J.
|date=February 1975
|title = The Probability Plot Correlation Coefficient Test for Normality
|journal = Technometrics
|pages =
|doi = 10.2307/1268008
|volume = 17
|issue = 1
|jstor=1268008
}}
{{NIST-PD}}
[[Category:Statistical charts and diagrams]]
|