Center-of-gravity method: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 09:39, 28 November 2023 edit Erel Segal (talk \| contribs) Extended confirmed users, IP block exemptions 14,576 edits →Description Tags: nowiki added Visual edit ← Previous edit		Latest revision as of 14:37, 29 November 2023 edit undo SunDawn (talk \| contribs) Extended confirmed users, Page movers, New page reviewers, Pending changes reviewers, Rollbackers, Temporary account IP viewers 69,215 edits Added {{One source}} tag Tag: Twinkle
(4 intermediate revisions by one other user not shown)
Line 1: {{One source\|date=November 2023}} The '''center-of-gravity method''' is a theoretic algorithm for [[convex optimization]]. It can be seen as a generalization of the [[bisection method]] from one-dimensional functions to multi-dimensional functions.<ref name=":0">{{Cite web \|last=Nemirovsky and Ben-Tal \|date=2023 \|title=Optimization III: Convex Optimization \|url=http://www2.isye.gatech.edu/~nemirovs/OPTIIILN2023Spring.pdf}}</ref>{{Rp\|___location=Sec.8.2.2}} It is theoretically important as it attains the optimal convergence rate. However, it has little practical value as each step is very computationally expensive. == Input == Our goal is to solve a [[convex optimization]] problem of the form:<blockquote>'''minimize ''f''(''x'') s.t. ''x'' in ''G'','''</blockquote>where ''f'' is a [[convex function]], and ''G'' is a [[Convex set\|convex subset]] of a Euclidean space ''R<sup>dn</sup>''. We assume that we have a "subgradient oracle": a routine that can compute a [[subgradient]] of ''f'' at any given point (if ''f'' is differentiable, then the only subgradient is the [[gradient]] <math>\nabla f</math>; but we do not assume that ''f'' is differentiable). Line 9 ⟶ 10: The method is [[Iterative method\|iterative]]. At each iteration ''t'', we keep a convex region ''G<sub>t</sub>'', which surely contains the desired minimum. Initially we have ''G''<sub>0</sub> = ''G''. Then, each iteration ''t'' proceeds as follows. * Let ''x<sub>t</sub>'' be the [[~~List~~Center of ~~Beyond Live shows~~mass\|center of gravity]] of ''G''<sub>t</sub>. * Compute a subgradient at ''x<sub>t</sub>'', denoted ''f''<nowiki/>'(''x<sub>t</sub>''). ** By definition of a subgradient, the graph of ''f'' is above the subgradient, so for all ''x'' in ''G''<sub>t</sub>: ''f''(''x'')−''f''(''x<sub>t</sub>'') ≥ (''x''−''x<sub>t</sub>'')<sup>T</sup>f'(''x<sub>t</sub>''). Line 18 ⟶ 19: == Convergence == It can be proved that <blockquote><math>Volume(G_{t+1})\leq \left[1-\left(\frac{n}{n+1}\right)^n\right]\cdot Volume(G_t)</math> .</blockquote>Therefore,<blockquote><math>f(x_t) - \min_G f \leq \left[1-\left(\frac{n}{n+1}\right)^n\right]^{t/n} [\max_G f - \min_G f] </math>.</blockquote>In other words, the method has [[linear convergence]] of the residual objective value, with convergence rate <math>\left[1-\left(\frac{n}{n+1}\right)^n\right]^{1/n} \leq (1-1/e)^{1/n} </math>. To get an ε-approximation to the objective value, the number of required steps is at most <math>2.13 n \ln(1/\epsilon) + 1 </math>.<ref name=":0" />{{Rp\|___location=Sec.8.2.2}} == Computational complexity == The main problem with the method is that, in each step, we have to compute the center-of-gravity of a polytope. All the methods known so far for this problem require a number of arithmetic operations that is exponential in the dimension ''n.<ref name=":0" />{{Rp\|___location=Sec.8.2.2}}'' Therefore, the method is not useful in practice when there are 5 or more dimensions. == See also == The [[ellipsoid method]] can be seen as a tractable approximation to the center-of-gravity method. Instead of maintaining the feasible polytope ''G<sub>t</sub>'', it maintains an ellipsoid that contains it. Computing the center-of-gravity of an ellipsoid is much easier than of a general polytope, and hence the ellipsoid method can usually be computed in polynomial time. == References ==