Content deleted Content added
Link suggestions feature: 2 links added. |
|||
(44 intermediate revisions by 29 users not shown) | |||
Line 5:
This has two important [[corollary|corollaries]]:
# If a continuous function has values of opposite sign inside an interval, then it has a [[Zero of a function|root]] in that interval ('''Bolzano's theorem''').<ref>{{MathWorld |title=Bolzano's Theorem |urlname=BolzanosTheorem}}</ref>
# The [[image (mathematics)|image]] of a continuous
==Motivation==
Line 16:
The intermediate value theorem states the following:
Consider
*''Version I.'' if <math>u</math> is a number between <math>f(a)</math> and <math>f(b)</math>, that is, <math display="block">\min(f(a),f(b))<u<\max(f(a),f(b)),</math> then there is a <math>c\in (a,b)</math> such that <math>f(c)=u</math>.
*''Version II.'' the [[Image of a function|image set]] <math>f(I)</math> is also
'''Remark:''' ''Version II'' states that the [[Set (mathematics)|set]] of function values has no gap. For any two function values <math>c,d \in f(I)</math> with <math>c < d</math>
A subset of the real numbers with no internal gap is an interval. ''Version I'' is naturally contained in ''Version II''.
==Relation to completeness==
The theorem depends on, and is equivalent to, the [[completeness of the real numbers]]. The intermediate value theorem does not apply to the [[rational number]]s '''Q''' because gaps exist between rational numbers; [[irrational numbers]] fill those gaps. For example, the function <math>f(x) = x^
Despite the above, there is a version of the intermediate value theorem for polynomials over a [[real closed field]]; see the [[Weierstrass Nullstellensatz]].
==Proof==
=== Proof version A===
<!-- This section is linked from [[Continuity property]] -->
The theorem may be proven as a consequence of the [[completeness (order theory)|completeness]] property of the real numbers as follows:<ref>Essentially follows {{cite book |title=Foundations of Analysis|first=Douglas A.|last=Clarke|publisher=Appleton-Century-Crofts | year=1971|page=284}}</ref>
Line 33 ⟶ 36:
We shall prove the first case, <math>f(a) < u < f(b)</math>. The second case is similar.
Let <math>S</math> be the set of all <math>x \in [a,b]</math> such that <math>f(x)
Note that, due to the continuity of <math>f</math> at <math>a</math>, we can keep <math>f(x)</math> within any <math>\varepsilon>0</math> of <math>f(a)</math> by keeping <math>x</math> sufficiently close to <math>a</math>. Since <math>f(a)<u</math> is a strict inequality, consider the implication when <math>\varepsilon</math> is the distance between <math>u</math> and <math>f(a)</math>. No <math>x</math> sufficiently close to <math>a</math> can then make <math>f(x)</math> greater than or equal to <math>u</math>, which means there are values greater than <math>a</math> in <math>S</math>. A more detailed proof goes like this:
Choose <math>\varepsilon=u-f(a)>0</math>. Then <math>\exists \delta>0</math> such that <math>\forall x \in [a,b]</math>, <math display="block">|x-a|<\delta \implies |f(x)-f(a)|<u-f(a) \implies f(x)<u.</math>Consider the interval <math>[a,\min(a+\delta,b))=I_1</math>. Notice that <math>I_1 \subseteq [a,b]</math> and every <math>x \in I_1</math> satisfies the condition <math>|x-a|<\delta</math>. Therefore for every <math>x \in I_1</math> we have <math>f(x)<u</math>. Hence <math>c</math> cannot be <math>a</math>.
Likewise, due to the continuity of <math>f</math> at <math>b</math>, we can keep <math>f(x)</math> within any <math>\varepsilon > 0</math> of <math>f(b)</math> by keeping <math>x</math> sufficiently close to <math>b</math>. Since <math>u<f(b)</math> is a strict inequality, consider the similar implication when <math>\varepsilon</math> is the distance between <math>u</math> and <math>f(b)</math>. Every <math>x</math> sufficiently close to <math>b</math> must then make <math>f(x)</math> greater than <math>u</math>, which means there are values smaller than <math>b</math> that are upper bounds of <math>S</math>. A more detailed proof goes like this:
Choose <math>\varepsilon=f(b)-u>0</math>. Then <math>\exists \delta>0</math> such that <math>\forall x \in [a,b]</math>, <math display="block">|x-b|<\delta \implies |f(x)-f(b)|<f(b)-u \implies f(x)>u.</math>Consider the interval <math>(\max(a,b-\delta),b]=I_2</math>. Notice that <math>I_2 \subseteq [a,b]</math> and every <math>x \in I_2</math> satisfies the condition <math>|x-b|<\delta</math>. Therefore for every <math>x \in I_2</math> we have <math>f(x)>u</math>. Hence <math>c</math> cannot be <math>b</math>.
With <math>c \neq a</math> and <math>c \neq b</math>, it must be the case <math>c \in (a,b)</math>. Now we claim that <math>f(c)=u</math>.
Fix some <math>\varepsilon > 0</math>. Since <math>f</math> is continuous at <math>c</math>, <math>\exists \delta_1>0</math> such that <math>\forall x \in [a,b]</math>, <math>|x-c|<\delta_1 \implies |f(x) - f(c)| < \varepsilon</math>.
<math display="block">f(x)-\varepsilon<f(c)<f(x)+\varepsilon</math>
for all <math>x\in(c-\delta,c+\delta)</math>. By the properties of the supremum, there exists some <math>a^*\in (c-\delta,c]</math> that is contained in <math>S</math>, and so
<math display="block">f(c)<f(a^*)+\varepsilon
Picking <math>a^{**}\in(c,c+\delta)</math>, we know that <math>a^{**}\not\in S</math> because <math>c</math> is the supremum of <math>S</math>. This means that
<math display="block">f(c)>f(a^{**})-\varepsilon
Both inequalities
<math display="block">u-\varepsilon<f(c)< u+\varepsilon</math>
are valid for all <math>\varepsilon > 0</math>, from which we deduce <math>f(c) = u</math> as the only possible value, as stated.
===Proof version B===
We will only prove the case of <math>f(a)<u<f(b)</math>, as the <math>f(a)>u>f(b)</math> case is similar.<ref>Slightly modified version of {{cite book |title=Understanding Analysis|first=Stephen|last=Abbot|publisher=Springer | year=2015|page=123}}</ref>
Define <math>g(x)=f(x)-u</math> which is equivalent to <math>f(x)=g(x)+u</math> and lets us rewrite <math>f(a)<u<f(b)</math> as <math>g(a)<0<g(b)</math>, and we have to prove, that <math>g(c)=0</math> for some <math>c\in[a,b]</math>, which is more intuitive. We further define the set <math>S=\{x\in[a,b]:g(x)\leq 0\}</math>. Because <math>g(a)<0</math> we know, that <math>a\in S</math> so, that <math>S</math> is not empty. Moreover, as <math>S\subseteq[a,b]</math>, we know that <math>S</math> is bounded and non-empty, so by Completeness, the [[supremum]] <math>c=\sup(S)</math> exists.
There are 3 cases for the value of <math>g(c)</math>, those being <math>g(c)<0,g(c)>0</math> and <math>g(c)=0</math>. For contradiction, let us assume, that <math>g(c)<0</math>. Then, by the definition of continuity, for <math>\epsilon=0-g(c)</math>, there exists a <math>\delta>0</math> such that <math>x\in(c-\delta,c+\delta)</math> implies, that <math>|g(x)-g(c)|<-g(c)</math>, which is equivalent to <math>g(x)<0</math>. If we just chose <math>x=c+\frac{\delta}{N}</math>, where <math>N>\frac{\delta}{b-c}+1</math>, then as <math>1 < N</math>, <math>x<c+\delta</math>, from which we get <math>g(x)<0</math> and <math>c<x<b</math>, so <math>x\in S</math>. It follows that <math>x</math> is an upper bound for <math>S</math>. However, <math>x>c</math>, contradicting the '''upper bound''' property of the ''least upper bound'' <math>c</math>, so <math>g(c)\geq 0</math>. Assume then, that <math>g(c)>0</math>. We similarly chose <math>\epsilon=g(c)-0</math> and know, that there exists a <math>\delta>0</math> such that <math>x\in(c-\delta,c+\delta)</math> implies <math>|g(x)-g(c)|<g(c)</math>. We can rewrite this as <math>-g(c)<g(x)-g(c)<g(c)</math> which implies, that <math>g(x)>0</math>. If we now chose <math>x=c-\frac{\delta}{2}</math>, then <math>g(x)>0</math> and <math>a<x<c</math>. It follows that <math>x</math> is an upper bound for <math>S</math>. However, <math>x<c</math>, which contradict the '''least''' property of the ''least upper bound'' <math>c</math>, which means, that <math>g(c)>0</math> is impossible. If we combine both results, we get that <math>g(c)=0</math> or <math>f(c)=u</math> is the only remaining possibility.
'''Remark:''' The intermediate value theorem can also be proved using the methods of [[non-standard analysis]], which places "intuitive" arguments involving infinitesimals on a rigorous{{Clarify|reason=The placement and phrasing of this remark may suggest that the classical proof is somehow "intuitive" and not rigorous, which is not the case.|date=January 2023}} footing.<ref>{{cite arXiv |last=Sanders|first=Sam | eprint=1704.00281 | title=Nonstandard Analysis and Constructivism!|date=2017|class=math.LO}}</ref>
Line 57 ⟶ 79:
| series = Sources and Studies in the History of Mathematics and Physical Sciences
| title = Redefining Geometrical Exactness: Descartes' Transformation of the Early Modern Concept of Construction
| year = 2001| isbn = 978-1-4612-6521-4
Let <math>f, \
The equivalence between this formulation and the modern one can be shown by setting <math>\
Earlier authors held the result to be intuitively obvious and requiring no proof. The insight of Bolzano and Cauchy was to define a general notion of continuity (in terms of [[infinitesimal]]s in Cauchy's case and using real inequalities in Bolzano's case), and to provide a proof based on such definitions.
==Converse is false==▼
A [[Darboux function]] is a real-valued function {{mvar|f}} that has the "intermediate value property," i.e., that satisfies the conclusion of the intermediate value theorem: for any two values {{mvar|a}} and {{mvar|b}} in the ___domain of {{mvar|f}}, and any {{mvar|y}} between {{math|''f''(''a'')}} and {{math|''f''(''b'')}}, there is some {{mvar|c}} between {{mvar|a}} and {{mvar|b}} with {{math|1=''f''(''c'') = ''y''}}. The intermediate value theorem says that every continuous function is a Darboux function. However, not every Darboux function is continuous; i.e., the converse of the intermediate value theorem is false.▼
As an example, take the function {{math|''f'' : [0, ∞) → [−1, 1]}} defined by {{math|1=''f''(''x'') = sin(1/''x'')}} for {{math|''x'' > 0}} and {{math|1=''f''(0) = 0}}. This function is not continuous at {{math|1=''x'' = 0}} because the [[limit of a function|limit]] of {{math|1=''f''(''x'')}} as {{mvar|x}} tends to 0 does not exist; yet the function has the intermediate value property. Another, more complicated example is given by
In fact, [[Darboux's theorem (analysis)|Darboux's theorem]] states that all functions that result from the [[derivative|differentiation]] of some other function on some interval have the [[intermediate value property]] (even though they need not be continuous).▼
Historically, this intermediate value property has been suggested as a definition for continuity of real-valued functions;<ref>{{Cite book |last=Smorynski |first=Craig |url=https://books.google.com/books?id=lnuhDgAAQBAJ
==Generalizations==
=== Multi-dimensional spaces ===
The intermediate value theorem is closely linked to the [[topology|topological]] notion of [[Connectedness (topology)|connectedness]] and follows from the basic properties of connected sets in metric spaces and connected subsets of '''R''' in particular:▼
The [[Poincaré-Miranda theorem]] is a generalization of the Intermediate value theorem from a (one-dimensional) interval to a (two-dimensional) rectangle, or more generally, to an ''n''-dimensional [[N-cube|cube]].
* If <math>X</math> and <math>Y</math> are [[metric space]]s, <math>f \colon X \to Y</math> is a continuous map, and <math>E \subset X</math> is a [[Connected space|connected]] subset, then <math>f(E)</math> is connected. <math>(*)</math> ▼
* A subset <math>E \subset \R</math> is connected if and only if it satisfies the following property: <math>x,y\in E,\ x < r < y \implies r \in E</math>. <math>(**)</math>▼
Vrahatis<ref>{{Cite journal |last=Vrahatis |first=Michael N. |date=2016-04-01 |title=Generalization of the Bolzano theorem for simplices |url=https://www.sciencedirect.com/science/article/pii/S0166864115005994 |journal=Topology and Its Applications |language=en |volume=202 |pages=40–46 |doi=10.1016/j.topol.2015.12.066 |issn=0166-8641}}</ref> presents a similar generalization to triangles, or more generally, ''n''-dimensional [[Simplex|simplices]]. Let ''D<sup>n</sup>'' be an ''n''-dimensional simplex with ''n''+1 vertices denoted by ''v''<sub>0</sub>,...,''v<sub>n</sub>''. Let ''F''=(''f''<sub>1</sub>,...,''f<sub>n</sub>'') be a continuous function from ''D<sup>n</sup>'' to ''R<sup>n</sup>'', that never equals 0 on the boundary of ''D<sup>n</sup>''. Suppose ''F'' satisfies the following conditions:
In fact, connectedness is a [[topological property]] and <math>(*)</math> generalizes to [[topological space]]s: ''If <math>X</math> and <math>Y</math> are topological spaces, <math>f \colon X \to Y</math> is a continuous map, and <math>X</math> is a [[connected space]], then <math>f(X)</math> is connected.'' The preservation of connectedness under continuous maps can be thought of as a generalization of the intermediate value theorem, a property of real valued functions of a real variable, to continuous functions in general spaces.▼
* For all ''i'' in 1,...,''n'', the sign of ''f<sub>i</sub>''(''v<sub>i</sub>'') is opposite to the sign of ''f<sub>i</sub>''(''x'') for all points ''x'' on the face opposite to ''v<sub>i</sub>'';
Recall the first version of the intermediate value theorem, stated previously:▼
* The sign-vector of ''f''<sub>1</sub>,...,''f<sub>n</sub>'' on ''v''<sub>0</sub> is not equal to the sign-vector of ''f''<sub>1</sub>,...,''f<sub>n</sub>'' on all points on the face opposite to ''v<sub>0</sub>''.
Then there is a point ''z'' in the [[Interior (topology)|interior]] of ''D<sup>n</sup>'' on which ''F''(''z'')=(0,...,0).
{{math theorem|name=Intermediate value theorem|note=''Version I''|math_statement=Consider a closed interval <math>I=[a,b]</math> in the real numbers <math>\R</math> and a continuous function <math>f\colon I\to\R</math>. Then, if <math> u</math> is a real number such that <math>\min(f(a),f(b))< u < \max(f(a),f(b))</math>, there exists <math>c \in (a,b)</math> such that <math>f(c) = u</math>.}}▼
It is possible to normalize the ''f<sub>i</sub>'' such that ''f<sub>i</sub>''(''v<sub>i</sub>'')>0 for all ''i''; then the conditions become simpler:
The intermediate value theorem is an immediate consequence of these two properties of connectedness:<ref>{{Cite book| url=https://archive.org/details/1979RudinW|title=Principles of Mathematical Analysis| last=Rudin|first=Walter| publisher=McGraw-Hill|year=1976|isbn=978-0-07-054235-8|___location=New York|pages=42, 93}}</ref>▼
*For all ''i'' in 1,...,''n'', ''f<sub>i</sub>''(''v<sub>i</sub>'')>0, and ''f<sub>i</sub>''(''x'')<0 for all points ''x'' on the face opposite to ''v<sub>i</sub>''. In particular, ''f<sub>i</sub>''(''v<sub>0</sub>'')<0.
{{math proof|proof= By <math>(**)</math>, <math>I = [a,b]</math> is a connected set. It follows from <math>(*)</math> that the image, <math>f(I)</math>, is also connected. For convenience, assume that <math>f(a) < f(b)</math>. Then once more invoking <math>(**)</math>, <math>f(a) < u < f(b)</math> implies that <math>u \in f(I)</math>, or <math>f(c) = u</math> for some <math>c\in I</math>. Since <math>u\neq f(a), f(b)</math>, <math>c\in(a,b)</math> must actually hold, and the desired conclusion follows. The same argument applies if <math>f(b) < f(a)</math>, so we are done. [[Q.E.D.]]}}▼
*For all points ''x'' on the face opposite to ''v<sub>0</sub>'', ''f<sub>i</sub>''(''x'')>0 for at least one ''i'' in 1,...,''n.''
The theorem can be proved based on the [[Knaster–Kuratowski–Mazurkiewicz lemma]]. In can be used for approximations of fixed points and zeros.<ref>{{Cite journal |last=Vrahatis |first=Michael N. |date=2020-04-15 |title=Intermediate value theorem for simplices for simplicial approximation of fixed points and zeros |journal=Topology and Its Applications |language=en |volume=275 |pages=107036 |doi=10.1016/j.topol.2019.107036 |issn=0166-8641|doi-access=free }}</ref>
=== General metric and topological spaces ===
The intermediate value theorem generalizes in a natural way: Suppose that {{mvar|X}} is a connected topological space and {{math|(''Y'', <)}} is a [[total order|totally ordered]] set equipped with the [[order topology]], and let {{math|''f'' : ''X'' → ''Y''}} be a continuous map. If {{mvar|a}} and {{mvar|b}} are two points in {{mvar|X}} and {{mvar|u}} is a point in {{mvar|Y}} lying between {{math|''f''(''a'')}} and {{math|''f''(''b'')}} with respect to {{math|<}}, then there exists {{mvar|c}} in {{mvar|X}} such that {{math|1=''f''(''c'') = ''u''}}. The original theorem is recovered by noting that {{math|'''R'''}} is connected and that its natural [[Topological space|topology]] is the order topology.▼
▲The intermediate value theorem is closely linked to the [[topology|topological]] notion of [[Connectedness (topology)|connectedness]] and follows from the basic properties of connected sets in metric spaces and connected subsets of '''R''' in particular:
▲* If <math>X</math> and <math>Y</math> are [[metric space]]s, <math>f \colon X \to Y</math> is a continuous map, and <math>E \subset X</math> is a [[Connected space|connected]] subset, then <math>f(E)</math> is connected. ({{EquationRef|<
▲* A subset <math>E \subset \R</math> is connected [[if and only if]] it satisfies the following property: <math>x,y\in E,\ x < r < y \implies r \in E</math>. ({{EquationRef|<
▲In fact, connectedness is a [[topological property]] and
The [[Brouwer fixed-point theorem]] is a related theorem that, in one dimension, gives a special case of the intermediate value theorem.▼
▲Recall the first version of the intermediate value theorem, stated previously:
▲==Converse is false==
▲{{math theorem|name=Intermediate value theorem|note=''Version I''|math_statement=Consider a closed interval <math>I = [a,b]</math> in the real numbers <math>\R</math> and a continuous function <math>f\colon I\to\R</math>. Then, if <math> u</math> is a real number such that <math>\min(f(a),f(b))< u < \max(f(a),f(b))</math>, there exists <math>c \in (a,b)</math> such that <math>f(c) = u</math>.}}
▲A [[Darboux function]] is a real-valued function {{mvar|f}} that has the "intermediate value property," i.e., that satisfies the conclusion of the intermediate value theorem: for any two values {{mvar|a}} and {{mvar|b}} in the ___domain of {{mvar|f}}, and any {{mvar|y}} between {{math|''f''(''a'')}} and {{math|''f''(''b'')}}, there is some {{mvar|c}} between {{mvar|a}} and {{mvar|b}} with {{math|1=''f''(''c'') = ''y''}}. The intermediate value theorem says that every continuous function is a Darboux function. However, not every Darboux function is continuous; i.e., the converse of the intermediate value theorem is false.
▲The intermediate value theorem is an immediate consequence of these two properties of connectedness:<ref>{{Cite book| url=https://archive.org/details/1979RudinW|title=Principles of Mathematical Analysis| last=Rudin|first=Walter| publisher=McGraw-Hill|year=1976|isbn=978-0-07-054235-8|___location=New York|pages=42, 93}}</ref>
▲As an example, take the function {{math|''f'' : [0, ∞) → [−1, 1]}} defined by {{math|1=''f''(''x'') = sin(1/''x'')}} for {{math|''x'' > 0}} and {{math|1=''f''(0) = 0}}. This function is not continuous at {{math|1=''x'' = 0}} because the [[limit of a function|limit]] of {{math|1=''f''(''x'')}} as {{mvar|x}} tends to 0 does not exist; yet the function has the intermediate value property. Another, more complicated example is given by the [[Conway base 13 function]].
▲{{math proof|proof= By
▲In fact, [[Darboux's theorem (analysis)|Darboux's theorem]] states that all functions that result from the [[derivative|differentiation]] of some other function on some interval have the [[intermediate value property]] (even though they need not be continuous).
▲The intermediate value theorem generalizes in a natural way: Suppose that {{mvar|X}} is a connected topological space and {{math|(''Y'', <)}} is a [[total order|totally ordered]] set equipped with the [[order topology]], and let {{math|''f'' : ''X'' → ''Y''}} be a continuous map. If {{mvar|a}} and {{mvar|b}} are two points in {{mvar|X}} and {{mvar|u}} is a point in {{mvar|Y}} lying between {{math|''f''(''a'')}} and {{math|''f''(''b'')}} with respect to {{math|<}}, then there exists {{mvar|c}} in {{mvar|X}} such that {{math|1=''f''(''c'') = ''u''}}. The original theorem is recovered by noting that {{math|'''R'''}} is connected and that its natural [[Topological space|topology]] is the order topology.
▲Historically, this intermediate value property has been suggested as a definition for continuity of real-valued functions;<ref>{{Cite book|url=https://books.google.com/books?id=lnuhDgAAQBAJ&pg=PA51&q=Historically%2C+this+intermediate+value+property+has+been+suggested+as+a+definition+for+continuity+of+real-valued+functions | title=MVT: A Most Valuable Theorem|last=Smorynski|first=Craig|date=2017-04-07|publisher=Springer| isbn=9783319529561| language=en}}</ref> this definition was not adopted.
▲The [[Brouwer fixed-point theorem]] is a related theorem that, in one dimension, gives a special case of the intermediate value theorem.
==In constructive mathematics==
In [[constructive mathematics]], the intermediate value theorem is not true. Instead, the weakened conclusion one
* Let <math>a</math> and <math>b</math> be real numbers and <math>f:[a,b] \to R</math> be a pointwise continuous function from the [[closed interval]] <math>[a,b]</math> to the real line, and suppose that <math>f(a) < 0</math> and <math>0 < f(b)</math>. Then for every positive number <math>\varepsilon > 0</math> there exists a point <math>x</math> in the unit interval such that <math>\vert f(x) \vert < \varepsilon</math>.<ref>{{cite journal|title=Interpolating Between Choices for the Approximate Intermediate Value Theorem | author=Matthew Frank|journal=Logical Methods in Computer Science|volume=16|issue=3|date=July 14, 2020| doi=10.23638/LMCS-16(3:5)2020|arxiv=1701.02227}}</ref>
Line 107 ⟶ 147:
In general, for any continuous function whose ___domain is some closed convex {{nowrap|<math>n</math>-dimensional}} shape and any point inside the shape (not necessarily its center), there exist two antipodal points with respect to the given point whose functional value is the same.
The theorem also underpins the explanation of why rotating a wobbly table will bring it to stability (subject to certain easily met constraints).<ref>[[Keith Devlin]] (2007) [https://web.archive.org/web/20140228044921/http://www.maa.org/external_archive/devlin/devlin_02_07.html How to stabilize a wobbly table]</ref>
==See also==
*
▲* {{annotated link|Non-atomic measure}}
▲* {{annotated link|Hairy ball theorem}}
▲* {{annotated link|Sperner's lemma}}
==References==
{{Reflist}}
== Further reading ==
* https://mathoverflow.net/questions/253059/approximate-intermediate-value-theorem-in-pure-constructive-mathematics
==External links==
* [http://www.cut-the-knot.org/Generalization/ivt.shtml Intermediate value Theorem - Bolzano Theorem] at [[cut-the-knot]]
* [http://demonstrations.wolfram.com/BolzanosTheorem/ Bolzano's Theorem] by Julio Cesar de la Yncera, [[Wolfram Demonstrations Project]].
|