Revised simplex method: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 15:41, 12 March 2014 edit Kxx (talk \| contribs) Extended confirmed users 3,386 edits m →Optimality conditions: use {{math}} for inline math ← Previous edit		Latest revision as of 08:04, 11 February 2025 edit undo Beland (talk \| contribs) Autopatrolled, Administrators 259,086 edits m convert special characters found by Wikipedia:Typo Team/moss (via WP:JWB)
(31 intermediate revisions by 17 users not shown)
Line 1: In [[mathematical optimization]], the '''revised simplex method''' is ana variant of [[George Dantzig]]'s [[simplex method]] for [[linear programming]]. The revised simplex method is mathematically equivalent to the standard simplex method but differs in implementation. Instead of maintaining a tableau which explicitly represents the constraints adjusted to a set of basic variables, it maintains a representation of a [[Basis (linear algebra)\|basis]] of the [[Matrix (mathematics)\|matrix]] representing the constraints. The matrix-oriented approach allows for greater computational efficiency by enabling sparse matrix operations.{{sfn\|Morgan\|1997\|loc=§2}} Line 8: \begin{array}{rl} \text{minimize} & \boldsymbol{c}^{\mathrm{T}} \boldsymbol{x} \\ \text{subject to} & \boldsymbol{Ax} = \boldsymbol{b}~~\text{~~, } \boldsymbol{x} \ge \boldsymbol{0} \end{array} </math> where {{math\|'''''A''''' ∈ ~~'''R'''~~ℝ<sup>''m''×''n''</sup>}}. Without loss of generality, it is assumed that the constraint matrix {{math\|'''''A'''''}} has full row rank and that the problem is feasible, i.e., there is at least one {{math\|'''''x''''' ≥ '''0'''}} such that {{math\|'''''Ax''''' {{=}} '''''b'''''}}. If {{math\|'''''A'''''}} is rank-deficient, either there are redundant constraints, or the problem is infeasible. Both situations can be handled by a presolve step. ==Algorithmic description== Line 19: :<math> \begin{align} \boldsymbol{Ax} & = \boldsymbol{b}~~\text{~~,} \\ \boldsymbol{A}^{\mathrm{T}} \boldsymbol{\lambda} + \boldsymbol{s} & = \boldsymbol{c}~~\text{~~,} \\ \boldsymbol{x} & \ge \boldsymbol{0}~~\text{~~,} \\ \boldsymbol{s} & \ge \boldsymbol{0}~~\text{~~,} \\ \boldsymbol{s}^{\mathrm{T}} \boldsymbol{x} & = 0 \end{align} Line 29: where {{math\|'''''λ'''''}} and {{math\|'''''s'''''}} are the [[Lagrange multiplier]]s associated with the constraints {{math\|'''''Ax''''' {{=}} '''''b'''''}} and {{math\|'''''x''''' ≥ '''0'''}}, respectively.{{sfn\|Nocedal\|Wright\|2006\|p=358\|loc=Eq. 13.4}} The last condition, which is equivalent to {{math\|''s<sub>i</sub>x<sub>i</sub>'' {{=}} 0}} for all {{math\|1 < ''i'' < ''n''}}, is called the ''complementary slackness condition''. By what is sometimes known as the ''fundamental theorem of linear programming'', a vertex {{math\|'''''x'''''}} of the feasible polytope can be identified by bebeing a basis {{math\|'''''B'''''}} of {{math\|'''''A'''''}} chosen from the latter's columns.{{efn\|The same theorem also states that the feasible polytope has at least one vertex and that there is at least one vertex which is optimal.{{sfn\|Nocedal\|Wright\|2006\|p=363\|loc=Theorem 13.2}}}} Since {{math\|'''''A'''''}} has full rank, {{math\|'''''B'''''}} is nonsingular. Without loss of generality, assume that {{math\|'''''A''''' {{=}} ~~[~~['''''B''''' '''''N'''''~~]~~]}}. Then {{math\|'''''x'''''}} is given by :<math> Line 51: \boldsymbol{c_B} \\ \boldsymbol{c_N} \end{bmatrix}~~\text{~~,} \\ \boldsymbol{s} & = \begin{bmatrix} \boldsymbol{s_B} \\ \boldsymbol{s_N} \end{bmatrix}~~\text{~~.} \end{align} </math> Line 64: :<math> \begin{align} \boldsymbol{B}^{\mathrm{T}} \boldsymbol{\lambda} & = \boldsymbol{c_B}~~\text{~~,} \\ \boldsymbol{N}^{\mathrm{T}} \boldsymbol{\lambda} + \boldsymbol{s_N} & = \boldsymbol{c_N}~~\text{~~,} \end{align} </math> Line 73: :<math> \begin{align} \boldsymbol{\lambda} & = (\boldsymbol{B}^{-\mathrm{T}})^{-1} \boldsymbol{c_B}~~\text{~~,} \\ \boldsymbol{s_N} & = \boldsymbol{c_N} - \boldsymbol{N}^{\mathrm{T}} \boldsymbol{\lambda}~~\text{~~.} \end{align} </math> Line 81: ===Pivot operation=== If the KKT conditions are violated, a ''pivot operation'' consisting of introducing a column of <{{math~~>\boldsymbol{~~\|'''''N'''''}}~~</math>~~ into the basis at the expense of an existing column in ~~<math>\boldsymbol~~{{math\|'''''B'''''}}~~</math>~~ is performed. In the absence of [[Degeneracy (mathematics)\|degeneracy]], a pivot operation always results in a strict decrease in {{math\|'''''c'''''<~~math~~sup>~~\boldsymbol{c}^{\mathrm{~~T~~}} \boldsymbol{x}~~</~~math~~sup>'''''x'''''}}. Therefore, if the problem is bounded, the revised simplex method must terminate at an optimal vertex after repeated pivot operations because there are only a finite number of vertices.{{sfn\|Nocedal\|Wright\|2006\|p=370\|loc=Theorem 13.4}} Select an index <{{math>\|''m'' < ''q'' ~~\le~~≤ ''n~~</math>~~''}} such that <{{math\|''s<sub>~~s_q~~q</sub>'' < 0~~</math>~~}} as the ''entering index''. The corresponding column of ~~<math>\boldsymbol~~{{math\|'''''A'''''}}~~</math>~~, ~~<math>\boldsymbol~~{{math\|'''''A~~}_q~~'''<sub>q</~~math~~sub>''}}, will be moved into the basis, and <{{math\|''x<sub>~~x_q~~q</~~math~~sub>''}} will be allowed to increase from zero. It can be shown that :<math>\frac{\partial (\boldsymbol{c}^{\mathrm{T}} \boldsymbol{x})}{\partial x_q} = s_q~~\text{~~,}</math> i.e., every unit increase in <{{math\|''x<sub>~~x_q~~q</~~math~~sub> ~~will~~''}} results in a decrease by <{{math\|−''s<sub>~~-s_q~~q</~~math~~sub>''}} in ~~<math>\boldsymbol~~{~~c}^{\mathrm~~{math\|'''''c'''''<sup>T~~}} \boldsymbol{x}~~</~~math~~sup>'''''x'''''}}.{{sfn\|Nocedal\|Wright\|2006\|p=369\|loc=Eq. 13.24}} Since :<math>\boldsymbol{B x_B} + \boldsymbol{A}_q x_q = \boldsymbol{b}~~\text{~~,}</math> <{{math\|'''''x<sub>~~\boldsymbol{x_B}~~B</~~math~~sub>'''''}} must be correspondingly decreased by <{{math\|Δ'''''x<sub>~~\Delta~~B</sub>''''' ~~\boldsymbol~~{~~x_B}~~ {= ~~\boldsymbol{B~~}~~^{-1~~} ~~\boldsymbol{~~'''''B'''''<sup>−1</sup>'''''A~~}_q x_q~~'''<sub>q</~~math~~sub>x<sub>q</sub>''}} subject to <{{math\|'''''x<sub>~~\boldsymbol{x_B}~~B</sub>''''' -− ~~\Delta~~Δ'''''x<sub>B</sub>''''' ~~\boldsymbol{x_B}~~≥ ~~\ge \boldsymbol{~~'''0'''}}~~</math>~~. Let ~~<math>\boldsymbol~~{{math\|'''''d}''''' ~~= \boldsymbol~~{B{=}~~^{-1~~} ~~\boldsymbol{~~'''''B'''''<sup>−1</sup>'''''A~~}_q~~'''<sub>q</~~math~~sub>''}}. If ~~<math>\boldsymbol~~{{math\|'''''d}''''' ~~\le~~≤ ~~\boldsymbol{~~'''0'''}}~~</math>~~, no matter how much <{{math\|''x<sub>~~x_q~~q</~~math~~sub>''}} is increased, <{{math\|'''''x<sub>~~\boldsymbol{x_B}~~B</sub>''''' -− ~~\Delta \boldsymbol{x_B}~~Δ'''''x<sub>B</~~math~~sub>'''''}} will stay nonnegative. Hence, ~~<math>\boldsymbol~~{~~c}^{\mathrm~~{math\|'''''c'''''<sup>T~~}} \boldsymbol{x}~~</~~math~~sup>'''''x'''''}} can be arbitrarily decreased, and thus the problem is unbounded. Otherwise, select an index <{{math>\|''p'' ~~= \operatorname~~{~~argmin}_~~{~~1 \le~~=}} argmin<sub>1≤''i ~~\le~~ ''≤''m}''</sub> \{~~x_i~~ {(}}''x<sub>i</sub>''/''d<sub>i</sub>'' ~~d_i \mathop~~{\|{!}} ~~d_i~~''d<sub>i</sub>'' > 0\{{)}}}}~~</math>~~ as the ''leaving index''. This choice effectively increases <{{math\|''x<sub>~~x_q~~q</~~math~~sub>''}} from zero until <{{math\|''x<sub>~~x_p~~p</~~math~~sub>''}} is reduced to zero while maintaining feasibility. The pivot operation concludes with replacing ~~<math>\boldsymbol~~{{math\|'''''A~~}_p~~'''<sub>p</~~math~~sub>''}} with ~~<math>\boldsymbol~~{{math\|'''''A~~}_q~~'''<sub>q</~~math~~sub>''}} in the basis. ==Numerical example== {{seealso\|Simplex method#Example}} Consider a linear program where :<math> \begin{align} \boldsymbol{c} & = \begin{bmatrix} -2 & -3 & -4 & 0 & 0 \end{bmatrix}^{\mathrm{T}}, \\ \boldsymbol{A} & = \begin{bmatrix} 3 & 2 & 1 & 1 & 0 \\ 2 & 5 & 3 & 0 & 1 \end{bmatrix}, \\ \boldsymbol{b} & = \begin{bmatrix} 10 \\ 15 \end{bmatrix}. \end{align} </math> Let :<math> \begin{align} \boldsymbol{B} & = \begin{bmatrix} \boldsymbol{A}_4 & \boldsymbol{A}_5 \end{bmatrix}, \\ \boldsymbol{N} & = \begin{bmatrix} \boldsymbol{A}_1 & \boldsymbol{A}_2 & \boldsymbol{A}_3 \end{bmatrix} \end{align} </math> initially, which corresponds to a feasible vertex {{math\|'''''x''''' {{=}} [0 0 0 10 15]<sup>T</sup>}}. At this moment, :<math> \begin{align} \boldsymbol{\lambda} & = \begin{bmatrix} 0 & 0 \end{bmatrix}^{\mathrm{T}}, \\ \boldsymbol{s_N} & = \begin{bmatrix} -2 & -3 & -4 \end{bmatrix}^{\mathrm{T}}. \end{align} </math> Choose {{math\|''q'' {{=}} 3}} as the entering index. Then {{math\|'''''d''''' {{=}} [1 3]<sup>T</sup>}}, which means a unit increase in {{math\|''x''<sub>3</sub>}} results in {{math\|''x''<sub>4</sub>}} and {{math\|''x''<sub>5</sub>}} being decreased by {{math\|1}} and {{math\|3}}, respectively. Therefore, {{math\|''x''<sub>3</sub>}} is increased to {{math\|5}}, at which point {{math\|''x''<sub>5</sub>}} is reduced to zero, and {{math\|''p'' {{=}} 5}} becomes the leaving index. After the pivot operation, :<math> \begin{align} \boldsymbol{B} & = \begin{bmatrix} \boldsymbol{A}_3 & \boldsymbol{A}_4 \end{bmatrix}, \\ \boldsymbol{N} & = \begin{bmatrix} \boldsymbol{A}_1 & \boldsymbol{A}_2 & \boldsymbol{A}_5 \end{bmatrix}. \end{align} </math> Correspondingly, :<math> \begin{align} \boldsymbol{x} & = \begin{bmatrix} 0 & 0 & 5 & 5 & 0 \end{bmatrix}^{\mathrm{T}}, \\ \boldsymbol{\lambda} & = \begin{bmatrix} 0 & -4/3 \end{bmatrix}^{\mathrm{T}}, \\ \boldsymbol{s_N} & = \begin{bmatrix} 2/3 & 11/3 & 4/3 \end{bmatrix}^{\mathrm{T}}. \end{align} </math> A positive {{math\|'''''s<sub>N</sub>'''''}} indicates that {{math\|'''''x'''''}} is now optimal. ==Practical issues== ===Degeneracy=== {{seealso\|Simplex method#Degeneracy: ~~Stalling~~stalling and cycling}} Because the revised simplex method is mathematically equivalent to the simplex method, it also suffers from degeneracy, where a pivot operation does not ~~results~~result in a decrease in {{math\|'''''c'''''<~~math~~sup>~~\boldsymbol{c}^{\mathrm{~~T~~}} \boldsymbol{x}~~</~~math~~sup>'''''x'''''}}, and a chain of pivot operations causes the basis to cycle. A perturbation or lexicographic strategy can be used to prevent cycling and guarantee termination.{{sfn\|Nocedal\|Wright\|2006\|p=381\|loc=§13.5}} ===Basis representation=== Two types of [[System of linear equations\|linear systems]] involving ~~<math>\boldsymbol~~{{math\|'''''B'''''}}~~</math>~~ are present in the revised simplex method: :<math> \begin{align} \boldsymbol{B ~~x_B~~z} & = \boldsymbol{by}~~\text{~~,} \\ \boldsymbol{B}^{\mathrm{T}} \boldsymbol{~~\lambda~~z} & = \boldsymbol{~~c_B~~y}~~\text{~~.} \end{align} </math> Instead of refactorizing ~~<math>\boldsymbol~~{{math\|'''''B'''''}}~~</math>~~, usually an [[LU factorization]] is directly updated after each pivot operation, for which purpose there exist several strategies such as the Forrest−Tomlin and Bartels−Golub ~~method~~methods. However, the amount of data representing the updates as well as numerical errors builds up over time and makes periodic refactorization necessary.{{sfn\|Morgan\|1997\|loc=§2}}{{sfn\|Nocedal\|Wright\|2006\|p=372\|loc=§13.4}} ==Notes and references== Line 129 ⟶ 218: \|year=1997 \|url=http://www.cise.ufl.edu/research/sparse/Morgan/index.htm \|archive-url=https://web.archive.org/web/20110807134509/http://www.cise.ufl.edu/research/sparse/Morgan/index.htm ~~\|ref=harv}}~~ \|archive-date=7 August 2011 }} * {{cite book \|last1=Nocedal Line 148 ⟶ 239: \|___location=New York, NY, USA \|isbn=978-0-387-30303-1 \|url=~~http~~https://www.springer.com/mathematics/book/978-0-387-30303-1 }} ~~\|ref=harv}}~~ {{refend}} Line 157 ⟶ 248: [[Category:Exchange algorithms]] [[Category:Linear programming]] ~~[[Category:Operations research]]~~ ~~[[Category:Optimization algorithms and methods]]~~