Content deleted Content added
ClueBot NG (talk | contribs) m Reverting possible vandalism by 93.37.81.101 to version by Rlink2. Report False Positive? Thanks, ClueBot NG. (4159936) (Bot) |
Copy editing. |
||
Line 17:
<ol start=1>
<li>For all <math>0 \leq t \leq 1</math> and all <math>x_1, x_2 \in X</math>:
<math display=block>f\left(
The right hand side represents the straight line between <math>\left(
</li>
<li>For all <math>0 < t < 1</math> and all <math>x_1, x_2 \in X</math> such that <math>x_1 \neq x_2</math>:
<math display=block>f\left(
The difference of this second condition with respect to the first condition above is that this condition does not include the intersection points (
</li>
</ol>
The second statement characterizing convex functions that are valued in the real line <math>\R</math> is also the statement used to define '''{{em|convex functions}}''' that are valued in the [[extended real number line]] <math>[-\infty, \infty] = \R \cup \{
The second statement can also be modified to get the definition of {{em|strict convexity}}, where the latter is obtained by replacing <math>\,\leq\,</math> with the strict inequality <math>\,<.</math>
Explicitly, the map <math>f</math> is called '''{{em|strictly convex}}''' if and only if for all real <math>0 < t < 1</math> and all <math>x_1, x_2 \in X</math> such that <math>x_1 \neq x_2</math>:
<math display=block>f\left(
A strictly convex function <math>f</math> is a function that the straight line between any pair of points on the curve <math>f</math> is above the curve <math>f</math> except for the intersection points between the straight line and the curve.
Line 47:
* Suppose <math>f</math> is a function of one [[real number|real]] variable defined on an interval, and let <math display=block>R(x_1, x_2) = \frac{f(x_2) - f(x_1)}{x_2 - x_1}</math> (note that <math>R(x_1, x_2)</math> is the slope of the purple line in the above drawing; the function <math>R</math> is [[Symmetric function|symmetric]] in <math>(x_1, x_2),</math> means that <math>R</math> does not change by exchanging <math>x_1</math> and <math>x_2</math>). <math>f</math> is convex if and only if <math>R(x_1, x_2)</math> is [[monotonically non-decreasing]] in <math>x_1,</math> for every fixed <math>x_2</math> (or vice versa). This characterization of convexity is quite useful to prove the following results.
* A convex function <math>f</math> of one real variable defined on some [[open interval]]
* A differentiable function of one variable is convex on an interval if and only if its [[derivative]] is [[monotonically non-decreasing]] on that interval. If a function is differentiable and convex then it is also [[continuously differentiable]].
* A [[Differentiable function|differentiable]] function of one variable is convex on an interval if and only if its graph lies above all of its [[tangent]]s:<ref name="boyd">{{cite book| title=Convex Optimization| first1=Stephen P.|last1=Boyd |first2=Lieven| last2=Vandenberghe | year = 2004 |publisher=Cambridge University Press| isbn=978-0-521-83378-3| url= https://web.stanford.edu/~boyd/cvxbook/bv_cvxbook.pdf#page=83 |format=pdf | access-date=October 15, 2011}}</ref>{{rp|69}} <math display=block>f(x) \geq f(y) + f'(y) (x-y)</math> for all
* A twice differentiable function of one variable is convex on an interval if and only if its [[second derivative]] is non-negative there; this gives a practical test for convexity. Visually, a twice differentiable convex function "curves up", without any bends the other way ([[inflection point]]s). If its second derivative is positive at all points then the function is strictly convex, but the [[Theorem#Converse|converse]] does not hold. For example, the second derivative of <math>f(x) = x^4</math> is <math>f''(x) = 12x^{2}</math>, which is zero for <math>x = 0,</math> but <math>x^4</math> is strictly convex.
**This property and the above property in terms of "...its derivative is monotonically non-decreasing..." are not equal since if <math>f''</math> is non-negative on an interval <math>X</math> then <math>f'</math> is monotonically non-decreasing on <math>X</math> while its converse is not true, for example, <math>f'</math> is monotonically non-decreasing on <math>X</math> while its derivative <math>f''</math> is not defined at some points on <math>X</math>.
Line 57:
}}
* A function is midpoint convex on an interval <math>C</math> if for all <math>x_1, x_2 \in C</math> <math display=block>f\left(
=== Functions of several variables ===
* A function <math>f : X \to [-\infty, \infty]</math> valued in the [[extended real number]]s <math>[-\infty, \infty] = \R \cup \{
* A differentiable function <math>f</math> defined on a convex ___domain is convex if and only if <math>f(x) \geq f(y) + \nabla f(y) \cdot (x-y)</math> holds for all <math>x, y</math> in the ___domain.
* A twice differentiable function of several variables is convex on a convex set if and only if its [[Hessian matrix]] of second [[partial derivative]]s is [[Positive-definite matrix|positive semidefinite]] on the interior of the convex set.
* For a convex function <math>f,</math> the [[sublevel set]]s <math>\{
* Consequently, the set of [[Arg min|global minimisers]] of a convex function <math>f</math> is a convex set: <math>{\operatorname{argmin}}\,f</math> - convex.
* Any [[local minimum]] of a convex function is also a [[global minimum]]. A {{em|strictly}} convex function will have at most one global minimum.<ref>{{cite web | url=https://math.stackexchange.com/q/337090 | title=If f is strictly convex in a convex set, show it has no more than 1 minimum | publisher=Math StackExchange | date=21 Mar 2013 | access-date=14 May 2016}}</ref>
* [[Jensen's inequality]] applies to every convex function <math>f</math>. If <math>X</math> is a random variable taking values in the ___domain of <math>f,</math> then <math>\operatorname{E}(f(X)) \geq f(\operatorname{E}(X)),</math> where <math>\operatorname{
* A first-order [[homogeneous function]] of two positive variables <math>x</math> and <math>y,</math> (that is, a function satisfying <math>f(a x, a y) = a f(x, y)</math> for all positive real <math>a, x, y > 0</math>) that is convex in one variable must be convex in the other variable.<ref>Altenberg, L., 2012. Resolvent positive linear operators exhibit the reduction phenomenon. Proceedings of the National Academy of Sciences, 109(10), pp.3705-3710.</ref>
Line 92:
The concept of strong convexity extends and parametrizes the notion of strict convexity. A strongly convex function is also strictly convex, but not vice versa.
A differentiable function <math>f</math> is called strongly convex with parameter
<math display=block>(\nabla f(x) - \nabla f(y) )^T (x-y) \ge m \|x-y\|_2^2 </math>
or, more generally,
Line 101:
<math display=block>f(y) \ge f(x) + \nabla f(x)^T (y-x) + \frac{m}{2} \|y-x\|_2^2 </math>
It is not necessary for a function to be differentiable in order to be strongly convex. A third definition<ref name="nesterov"/> for a strongly convex function, with parameter
<math display=block>f(tx+(1-t)y) \le t f(x)+(1-t)f(y) - \frac{1}{2} m t(1-t) \|x-y\|_2^2</math>
Notice that this definition approaches the definition for strict convexity as
If the function <math>f</math> is twice continuously differentiable, then it is strongly convex with parameter
Assuming still that the function is twice continuously differentiable, one can show that the lower bound of <math>\nabla^2 f(x)</math> implies that it is strongly convex. Using [[Taylor's theorem|Taylor's Theorem]] there exists
Line 121:
The distinction between convex, strictly convex, and strongly convex can be subtle at first glance. If <math>f</math> is twice continuously differentiable and the ___domain is the real line, then we can characterize it as follows:
*<math>f</math> convex if and only if <math>f''(x) \ge 0</math> for all
*<math>f</math> strictly convex if <math>f''(x) > 0</math> for all
*<math>f</math> strongly convex if and only if <math>f''(x) \ge m > 0</math> for all
For example, let <math>f</math> be strictly convex, and suppose there is a sequence of points <math>(x_n)</math> such that <math>f''(x_n) = \tfrac{1}{n}</math>. Even though <math>f''(x_n) > 0</math>, the function is not strongly convex because <math>f''(x)</math> will become arbitrarily small.
Line 133:
===Uniformly convex functions===
A uniformly convex function,<ref name="Zalinescu">{{cite book|title=Convex Analysis in General Vector Spaces|author=C. Zalinescu|publisher=World Scientific|year=2002|isbn=9812380671}}</ref><ref name="Bauschke">{{cite book|page=[https://archive.org/details/convexanalysismo00hhba/page/n161 144]|title=Convex Analysis and Monotone Operator Theory in Hilbert Spaces |url=https://archive.org/details/convexanalysismo00hhba|url-access=limited|author=H. Bauschke and P. L. Combettes |publisher=Springer |year=2011 |isbn=978-1-4419-9467-7}}</ref> with modulus <math>\phi</math>, is a function <math>f</math> that, for all
<math display=block>f(tx+(1-t)y) \le t f(x)+(1-t)f(y) - t(1-t) \phi(\|x-y\|)
where <math>\phi</math> is a function that is non-negative and vanishes only at 0. This is a generalization of the concept of strongly convex function; by taking <math>\phi(\alpha) = \tfrac{m}{2} \alpha^2</math> we recover the definition of strong convexity.
Line 143:
* The function <math>f(x)=x^2</math> has <math>f''(x)=2>0</math>, so {{mvar|f}} is a convex function. It is also strongly convex (and hence strictly convex too), with strong convexity constant 2.
* The function <math>f(x)=x^4</math> has <math>f''(x)=12x^2\ge 0</math>, so {{mvar|f}} is a convex function. It is strictly convex, even though the second derivative is not strictly positive at all points. It is not strongly convex.
* The [[absolute value]] function <math>f(x)=|x|</math> is convex (as reflected in the [[triangle inequality]]), even though it does not have a derivative at the point
* The function <math>f(x)=|x|^p</math> for <math>p \ge 1</math> is convex.
* The [[exponential function]] <math>f(x)=e^x</math> is convex. It is also strictly convex, since <math>f''(x)=e^x >0 </math>, but it is not strongly convex since the second derivative can be arbitrarily close to zero. More generally, the function <math>g(x) = e^{f(x)}</math> is [[Logarithmically convex function|logarithmically convex]] if
* The function <math>f</math> with ___domain [0,1] defined by <math>f(0) = f(1) = 1, f(x) = 0</math> for <math>0 < x < 1</math> is convex; it is continuous on the open interval <math>(0,
* The function
* Examples of functions that are [[Monotonic function|monotonically increasing]] but not convex include <math>f(x)=\sqrt{x}</math> and <math>g(x)=\log x</math>.
* Examples of functions that are convex but not [[Monotonic function|monotonically increasing]] include <math>h(x)= x^2</math> and <math>k(x)=-x</math>.
* The function <math>f(x) = \tfrac{1}{x}</math> has <math>f''(x)=\tfrac{2}{x^3}</math> which is greater than 0 if
* The function <math>f(x)=\tfrac{1}{x^2}</math> with <math>f(0)=\infty</math>, is convex on the interval <math>(0, \infty)</math> and convex on the interval <math>(-\infty, 0)</math>, but not convex on the interval <math>(-\infty, \infty)</math>, because of the singularity at
===Functions of ''n'' variables===
* [[LogSumExp]] function, also called softmax function, is a convex function.
*The function <math>-\log\det(X)</math> on the ___domain of [[Positive-definite matrix|positive-definite matrices]] is convex.<ref name="boyd" />{{rp|74}}
* Every real-valued [[linear transformation]] is convex but not strictly convex, since if
* Every real-valued [[affine function]],
* Every [[norm (mathematics)|norm]] is a convex function, by the [[triangle inequality]] and [[Homogeneous function#Positive homogeneity|positive homogeneity]].
* The [[spectral radius]] of a [[nonnegative matrix]] is a convex function of its diagonal elements.<ref>Cohen, J.E., 1981. [https://www.ams.org/journals/proc/1981-081-04/S0002-9939-1981-0601750-2/S0002-9939-1981-0601750-2.pdf Convexity of the dominant eigenvalue of an essentially nonnegative matrix]. Proceedings of the American Mathematical Society, 81(4), pp.657-658.</ref>
Line 168:
* [[Convex optimization]]
* [[Geodesic convexity]]
* [[Hahn–Banach theorem]]
* [[Hermite–Hadamard inequality]]
* [[Invex function]]
|