Probability distribution fitting: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 18:47, 5 February 2024 edit Citation bot (talk \| contribs) Bots 5,868,529 edits Alter: journal, pages. Add: issue, authors 1-1. Removed parameters. Some additions/deletions were parameter name changes. \| Use this bot. Report bugs. \| Suggested by Headbomb \| Category:CS1 errors: extra text: pages \| #UCB_Category 1/2 ← Previous edit		Latest revision as of 07:45, 17 April 2025 edit undo Helper201 (talk \| contribs) Extended confirmed users 93,372 edits Stopped an image intruding on the references section.
(4 intermediate revisions by 4 users not shown)
Line 78: == Uncertainty of prediction == [[File:BinomialConfBelts.jpg\|thumb\|<small>Uncertainty analysis with confidence belts using the binomial distribution </small><ref>Frequency predictions and their binomial confidence limits. In: International Commission on Irrigation and Drainage, Special Technical Session: Economic Aspects of Flood Control and non-Structural Measures, Dubrovnik, Yugoslavia, 1988. [http://www.waterlog.info/pdf/binomial.pdf On line]</ref>]] Predictions of occurrence based on fitted probability distributions are subject to [[uncertainty]], which arises from the following conditions: Line 94: A Bayesian approach can be used for fitting a model <math>P(x\|\theta)</math> having a prior distribution <math>P(\theta)</math> for the parameter <math>\theta</math>. When one has samples <math>X</math> that are independently drawn from the underlying distribution then one can derive the so-called posterior distribution <math>P(\theta\|X)</math>. This posterior can be used to update the probability mass function for a new sample <math>x</math> given the observations <math>X</math>, one obtains <math display="block">P_\theta (x \| X) := \int d\theta\ P(x\|\theta)\ P(\theta\|X) .</math>. The variance of the newly obtained probability mass function can also be determined. The variance for a Bayesian probability mass function can be defined as <math display="block">\sigma_{P_\theta(x\|X)}^2 := \int d\theta\ \left[ P(x\|\theta) - P_\theta(x\|X) \right]^2\ P(\theta\|X).</math>. This expression for the variance can be substantially simplified (assuming independently drawn samples). Defining the "self probability mass function" as <math display="block">P_\theta(x\|\left\{X,x\right\}) = \int d\theta\ P(x\|\theta)\ P(\theta\|\left\{X, x\right\}),</math>, one obtains for the variance <ref>{{Cite journal \|last1=Pijlman \|last2=Linnartz \|date=2023 \|title=Variance of Likelihood of data \|url=https://sitb2023.ulb.be/proceedings/ \|journal=SITB 2023 Proceedings \|pages=34}}</ref> <math display="block">\sigma_{P_\theta(x\|X)}^2 = P_\theta(x\|X) \left[ P_\theta(x\|\left\{X,x\right\}) - P_\theta(x\|X) \right].</math>. The expression for variance involves an additional fit that includes the sample <math>x</math> of interest.[[File:CumList.png\|thumb\|left\|List of probability distributions ranked by goodness of fit.<ref>[https://www.waterlog.info/cumfreq.htm Software for probability distribution fitting]</ref>]] [[File:GEVdistrHistogr+Density.png\|thumb\|220px\|Histogram and probability density of a data set fitting the [[GEV distribution]] ]] Line 125: * [[Mixture distribution]] * [[Product distribution]] {{clear}} == References ==