Revision as of 14:24, 6 March 2014 edit 98.17.133.223 (talk) Replaced non-working link to Cybenko's paper. ← Previous edit		Revision as of 22:23, 9 March 2014 edit undo Qwertyus (talk \| contribs) Extended confirmed users 31,640 edits correct circular definition; note that the theorem says nothing about learnability Next edit →
Line 1: In the [[mathematics\|mathematical]] theory of [[neural networks]], the '''universal approximation theorem''' states<ref>Balázs Csanád Csáji. Approximation with Artificial Neural Networks; Faculty of Sciences; Eötvös Loránd University, Hungary</ref> that a [[feedforward neural network\|feed-forward]] network with a single hidden layer containing a finite number of [[neuron]]s (i.e., ~~the simplest form of the~~a [[multilayer perceptron]]), iscan ~~a universal approximator among~~approximate [[continuous functions]] on [[Compact_space\|compact subsets]] of [[Euclidean space\|'''R'''<sup>n</sup>]], under mild assumptions on the activation function. The theorem thus states that simple neural networks can ''represent'' a wide variety of interesting functions when given appropriate parameters; it does not touch upon the algorithmic [[Computational learning theory\|learnability]] of those parameters. One of the first versions of the [[theorem]] was proved by [[George Cybenko]] in 1989 for [[sigmoid function\|sigmoid]] activation functions.<ref name=cyb>Cybenko., G. (1989) [http://deeplearning.cs.cmu.edu/pdfs/Cybenko.pdf "Approximations by superpositions of sigmoidal functions"], ''[[Mathematics of Control, Signals, and Systems]]'', 2 (4), 303-314</ref>

Universal approximation theorem: Difference between revisions