On The Convergence of The Proximal Algorithm For Saddle-Point Problems
On The Convergence of The Proximal Algorithm For Saddle-Point Problems
A. Moudafi
To cite this article: A. Moudafi (1995) On the convergence of the proximal algorithm for
saddle-point problems, Optimization, 33:3, 191-200, DOI: 10.1080/02331939508844076
Article views: 65
A. MOUDAFI
LACO, URA 1586, Universitk de Limoges 87060 Limoges Cedex, France
(Receiued 16 March 1994; in final form 14 September 1994)
The proximal algorithm for saddle-point problems min,,, max,,, L(x,y), where X, Yare Hilbert spaces and
L:X x Y-+R is a proper, closed convex-concave function in X x Y is considered. Under a minimal
assumption on L, 6:= inf sup L= sup inf L, it is proved that the proximmai point algorithm (PPA), with
positive parameters {I.,), converges in general provided that on:= XI:=, I.,+ + co.
1. INTRODUCTION
where {iis
,a)sequence of positive numbers.
191
192 A. MOUDAFI
U p to now the PPA was known to converge under the assumption that the generated
sequence is bounded or, equivalently, under the existence of saddle-points (th. 5,
Rockafellar [lo]). The aim of this work is twofold. First (perhaps the more important) is
to investigate the convergence of the PPA in the case where L has no saddle-points. The
second is to give a direct proof in the case when Argminmax L # 0, actually theorem
5 (in [lo]) is obtained as an application of a result in the more general framework of
maximal monotone inclusions.
We use the following notation in this paper. A function L X x Y + [W is convex-
concave if it is convex in the x variable and concave in the y variable. For E > 0 and
(x,, y,)eX x Y, TE,(x,, yo) is defined as the set of (u,, u,) satisfying
and
Since
2
2(x,~,-x,,u-x,)=Ix,~,-x,l +lu-~,l~-lu-x~-,1~ (7)
and according to (6),we obtain
2)LkL(u,yk) 2 2AkL(xk,yk) + IU - xkI2- I U -xk- + Ixk- -xk12 - 2Akck.
Similarly, we have
+
2 i k L ( x kV, ) < 21, L(xk,yk) Iv - ykI2 - Iv - yk- 11 2 - Iyk- 1 - ykI2 + 2'kEk'
Summing the last inequalities for k = 1 to n, we obtain
and
Now, let us recall the following result which is due to Silverman-Toeplitz and which
will be needed in the proof of the next theorem.
Lemma 2.2 (see for example [3]): Let {I,,), {a,) be two sequences of numbers with
A, >, 0 for all n and limn, ,a, exists. Then
+
The two next results give convergence properties of the approximate PPA.
Theorem 2.3: Let {(x,, y,)} be a sequence generated by (3) and assume
(i) lim E, = 0,
n-+oo
(ii) lim an = + co,
n-+m
then
We also have
lim sup "" L(Xk'
< inf sup L(X,y) + lim -Iu - x0l2 + lim C;: = 1 A,&,
n-+m an XPX YEY n-+ m 2an n-+a On
From condition (i), (ii) and lemma 2.2, we obtain the announced result (8). H
Let us consider now the case when Argminmax L # 0.
Theorem 2.4: Let {(x,, y,)) be a sequence generated by (3)and suppose Argminmax L # 0,
then the following estimate holds true:
d2((x0,yo),Argminmax L)
2an
+ =,
an
i,~,
(9)
If in addition,
(i) A k 2 A > 0 ,
(ii) Akek< + CO,
then
Proof: Formula (9) follows by taking u = 2 in (5) and v = j in (4), where (2, j) is the
point in Argminmax L closest to (x,, yo) and taking into account the fact that when
(2, j)~ArgminmaxL, we have
Let us first show that sequence {(x,, y,)) is bounded. Indeed, let (2, j)~ArgminmaxL,
thanks to (3) and (7) we get the relation (10):
yield
and
Passing to the limit in the two last inequalities and thanks to (14) and the fact that
{(x,,y,)} is bounded, we obtain
lim sup L(x,, y,) <6 and lim inf L(x,, y,) 2 6,
,-+a k-+co
thus
lim L(x,, y,) = 6.
k++m
Now let {(x*,y*)) be a weak cluster point of {(x,,y,)). There exists a subsequence
{(x,,y,)) which converges weakly to (x*,y*) and
( X V; x., Yv- I
4
-Y v
1
E TY (xv,Y,).
Since E , -0 and the graph of the maximal monotone operator T Lis weakly-strongly
closed, we get at the limit
(0,0 )TL(x*,Y*)
~
or (x*,y*) is a saddle-point of L.
It remains to prove that there is no more than one weak cluster point, our argument
follows that given in (Ref. 10, p. 885) and is presented here for completeness.
Let (2,y")be another weak cluster point of {(x,,y,)), we will show that 2 = x* and
y" = y*. This is a consequence of (13).Indeed,
-
we see that the limit of ((x,, y,) - (x*,y*), (x*,y*) - (2,j ) ) as k + cc must exists. This
limit has to be zero because (x*,y*) is a weak cluster point of {(x,,y,)). Hence, at the
limit
The next theorem gives a convergence result for the iterative scheme (15).
Theorem 3.1: Let {(x,, y,)) be a sequence generated by (15) and assume
then
lim X i = 1 ) ~ k ~ k ( ~~ kk>-) 6.
n-+m cn
and
and
198 A. MOUDAFI
Passing to the limit in the above inequalities and taking into account lemma 2.2, (i), (ii)
and (iii), we obtain
and
3.2. Example
We give an application to penalization in constrained saddle-point problems.
Let C (resp. D) be a nonempty closed convex sets of X (resp. of Y) and Lo:X x Y - +R be
a continuous convex-concave function. It is well known that finding a saddle-point of
Lo on C x D and finding a saddle-point of L on X x Yare equivalent problems, where
L is given by
where 4,:X -,[Wi and $,: Y-, 54' are convex lower semicontinuous penalty functions
such that
It is the case for exterior penalty functions (see for example [2], [ 5 ] or [9]). Now, let M,,
mk be the marginal functions associated with L,, we have
and
PROXIMAL ALGORITHM
Clearly
and
Mk(4 Gf (XI =
i+Mo(x) i f x ~ C
cc otherwise
i
mo(y) if YED
m,(y) > g(y) = - cc otherwise.
and
(gTl2, O<r,'t+CO.
Quadratic exterior penalty
(hf)2, O < r k ? + CO.
Acknowledgments
Thanks are due to Professor Bernard Lemaire for his many insightful comments.
References
[I] Auslender A. (1985)Two general methods for computing saddle points with applications for decompos-
ing convex programming problems, Appl. Math. Optim. 13,79-95
[2] Auslender A,, Crouzeix J. P. and Fedit P. (1987) Penalty proximal methods in convex programming, J.
Optimization Theory Applications, 55(1), 1-21
[3] Dunfor N. and Shwartz J. (1988) Linear operators, part I, general theory, New York
[4] Giiler 0. (1991) On the convergence of the proximal algorithm for minimization problems. S I A M J.
Control Optimization, 29,403-419
[5] Lemaire B. (1988) Coupling optimization methods and variational convergence, Trends in Mathemat-
ical Optimization
163 Lemaire B. About the convergence of the proximal method, 6th French-German conference on
Optimization
[7] Martinet B. (1972) Algorithmes pour la resolution des problemes d'optimisation et minimax, These
d'Etat universite de Grenoble
[8] Moudafi A. (1993) Coupling proximal methods and variational convergence, ZOR-Methods and
Models of Operations Research, 269-280
[9] Mouallif K. (1989) Convergence variationnelle et methodes perturbees pour les problemes d'optimisa-
tion et de point selle, these d'Etat Universite de Liege
200 A. MOUDAFI
[lo] Rockafellar R. T. (1976) Monotone operators and the proximal point algorithm, SIAM J . Control
Optimization, 14, 877-898
[l 11 Rockafellar R. T. (1970)Monotone operators associated with saddle functions and minimax problems,
Nonlinear functional analysis, Part I, F. E. Browder ed. Symposia in Pure Math., vol. 18, Amer Math.
Soc., Providence, 397-407