A RELAXATION METHOD FOR FINDING A COMMON POINT OF CONVEX SETS AND ITS APPLICATION TO OPTIMIZATION PROBLEMS
MATHEMATICS
Submitted 1966-01-01 | SovietRxiv: ru-196601.22781 | Translated from Russian

Abstract Generated abstract

This paper develops a general relaxation method for finding a point common to a family of closed convex sets in a linear topological space, using generalized projections defined by a nonnegative distance-like function. Under stated compactness, convexity, and continuity conditions, it proves convergence properties of the resulting relaxation sequences for cyclic choice of sets and for a maximal-distance control rule, including conditions ensuring uniqueness of the limit. The framework includes ordinary Hilbert-space projection as a special case and extends to divergences generated by strictly convex functions, yielding applications to constrained optimization, entropy-type maximization, and transportation or passenger-flow calculation problems.

Full Text

UDC 519.25/26+519.3:330.115

MATHEMATICS

L. M. BREGMAN

A RELAXATION METHOD FOR FINDING A COMMON POINT OF CONVEX SETS AND ITS APPLICATION TO OPTIMIZATION PROBLEMS

(Presented by Academician L. V. Kantorovich on 26 II 1966)

Let closed convex sets \(A_i\) \((i \in I)\) be given in a linear topological space \(X\). Suppose that \(R=\bigcap_{i\in I} A_i\) is nonempty.

Let \(S\) be some convex set in \(X\) such that \(S \cap R \ne \Lambda\).

Consider a function \(D(x,y)\), defined on \(S \times S\) and possessing the following properties:

(I). \(D(x,y)\ge 0\); \(D(x,y)=0\) if and only if \(x=y\).

(II). For each \(i\in I\) and \(y\in S\) there exists a point \(x=P_i y \in A_i\cap S\) such that \(D(x,y)\le D(z,y)\) for all \(z\in A_i\cap S\). We shall call this point \(x\) the \(D\)-projection of the point \(y\) onto the set \(A_i\).

(III). For each \(i\in I\) and \(y\in S\) the function

\[ G(z)=D(z,y)-D(z,P_i y) \]

is convex on \(A_i\cap S\).

(IV). There exists the derivative \(D_x'(x,y)\) of the function \(D(x,y)\) at \(x=y\), and moreover

\[ D_x'(y,y)=0 \quad \left( \text{i.e. } \lim_{t\to 0}\frac{D(y+tz,y)}{t}=0 \text{ for all } z\in X \right). \]

(V). For each \(z\in R\cap S\) and for each real number \(L\), the set \(T=\{x\mid D(z,x)\le L\}\) is compact.

(VI). If \(D(x_n,y_n)\to 0\), \(y_n\to y^*\in \bar S\), and the set of elements of the sequence \(\{x_n\}\) is compact, then \(x_n\to y^*\).

Lemma 1. Let \(z\in A_i\cap S\). Then for any \(y\in S\) the inequality holds

\[ D(P_i y,y)\le D(z,y)-D(z,P_i y). \]

Proof. According to condition (III), we have

\[ D(\lambda z+(1-\lambda)P_i y,y) - D(\lambda z+(1-\lambda)P_i y,P_i y) \le \]

\[ \le \lambda\bigl(D(z,y)-D(z,P_i y)\bigr) + (1-\lambda)D(P_i y,y) \]

for all \(\lambda\in[0,1]\). Hence for \(\lambda>0\) we obtain

\[ D(z,y)-D(z,P_i y)-D(P_i y,y)\ge \]

\[ \ge \frac{D(\lambda z+(1-\lambda)P_i y,y)-D(P_i y,y)}{\lambda} - \frac{D(\lambda z+(1-\lambda)P_i y,P_i y)}{\lambda} \ge \]

\[ \ge - \frac{D(\lambda z+(1-\lambda)P_i y,P_i y)}{\lambda}. \]

Passing to the limit in this inequality as \(\lambda\to 0\), we obtain

\[ D(z,y)-D(z,P_i y)-D(P_i y,y)\ge 0. \]

The lemma is proved.

Consider the following iterative process: take an arbitrary point \(x_0 \in S\), then choose \(i_1(x_0) \in I\) and find a point \(x_1\)—the \(D\)-projection of the point \(x_0\) onto the set \(A_{i_1(x_0)}\); then choose \(i_2(x_1) \in I\) and find a point \(x_2\)—the \(D\)-projection of the point \(x_1\) onto the set \(A_{i_2(x_1)}\), and so on. The sequence \(\{x_n\}\) obtained as a result of this process will be called a relaxation sequence, and the set of functions \(\{i_1(x), i_2(x), \ldots\}\) will be called a relaxation control (cf. (1)).

Lemma 2. For any relaxation control:

  1. The set of elements of the relaxation sequence \(\{x_n\}\) is compact.
  2. \(D(x_{n+1}, x_n) \to 0\) as \(n \to \infty\).
  3. For any \(z \in R \cap S\) there exists \(\lim\limits_{n \to \infty} D(z, x_n)\).

Proof. Take \(z \in R \cap S\). By Lemma 1,

\[ D(x_{n+1}, x_n) \leq D(z, x_n) - D(z, x_{n+1}). \tag{1} \]

Since \(D(x_{n+1}, x_n) \geq 0\), it follows that \(D(z, x_{n+1}) \leq D(z, x_n)\), and consequently \(\lim D(z, x_n)\) exists; together with (1) this gives \(D(x_{n+1}, x_n) \to 0\). Since the set of elements of the relaxation sequence is contained in the set
\[ T=\{x \mid D(z,x) \leq D(z,x_0)\}, \]
and, according to (V), the set \(T\) is compact, assertion 1 is true.

Let us now consider some relaxation controls.

Theorem 1. Let \(I=\{1,2,\ldots,m\}\), and let the indices be chosen in cyclic order, i.e.
\[ i_1(x)=1,\quad i_2(x)=2,\ldots,\quad i_m(x)=m,\quad i_{m+1}(x)=1,\ldots \]
Then any limit point \(x^*\) of the relaxation sequence \(\{x_n\}\) is a common point of the sets \(A_i\).

Proof. By Lemma 2, from the sequences \(\{x_{nm+i}\} \subset A_i\) one can extract convergent subsequences. Let \(x_{n_k m+i} \to x_i^* \in A_i\). According to Lemma 2,
\[ D(x_{n_k m+2}, x_{n_k m+1}) \to 0. \]
By property (VI), \(x_{n_k m+1} \to x_2^*\). Therefore, \(x_1^* = x_2^*\). Analogously one can show that
\[ x_2^* = x_3^*,\quad x_3^* = x_4^*,\ldots,\quad x_{m-1}^* = x_m^*. \]
Consequently, the limit point
\[ x^* = x_1^* = \cdots = x_m^* \in R. \]

Theorem 2. Suppose that for each \(y \in S\) there exists

\[ \max_{i \in I} \min_{x \in A_i} D(x,y)=D(P_{i(y)}y,y) \]

and let \(i_n(x)=i(x)\). Then any limit point \(x^*\) of the relaxation sequence \(\{x_n\}\) is a common point of the sets \(A_i\).

Proof. Let \(x_{n_k} \to x^*\). Let \(y_{n_k}^i = P_i x_{n_k} \in A_i\). Then

\[ D(y_{n_k}^i, x_{n_k}) \leq \max_{i \in I} D(y_{n_k}^i, x_{n_k}) = D(x_{n_k+1}, x_{n_k}). \]

Consequently, \(D(y_{n_k}^i, x_{n_k}) \to 0\) for all \(i \in I\). Since for any \(z \in R \cap S\)
\[ D(z, y_{n_k}^i) \leq D(z, x_{n_k}) \leq D(z, x_0), \]
then, according to (V), the set \(\{y_{n_k}^i\}\) is compact. Consequently, by property (VI), \(y_{n_k}^i \to x^*\) for all \(i \in I\). Hence it follows that \(x^* \in R\).

Remark. In many cases the relaxation sequence \(\{x_n\}\) has a unique limit point. This is the case, for example, if the following condition is fulfilled:

(A). The set \(S\) is closed, and for any \(z_1,z_2 \in R \cap S\) the function
\[ H(y)=D(z_1,y)-D(z_2,y) \]
is continuous on \(S\).

In fact, let \(x^*\) and \(x^{**}\) be limit points of the relaxation sequence \(\{x_n\}\) and let \(x^*, x^{**} \in R\). Let \(x_{n_k} \to x^*\), \(x_{n_l} \to x^{**}\). By Lemma 2, there exists

\[ \lim H(x_n)=\lim \bigl(D(x^*,x_n)-D(x^{**},x_n)\bigr), \]

\[ \lim H(x_{n_k})=-D(x^{**},x^*)\leq 0, \]

\[ \lim H(x_{n_l})=D(x^*,x^{**})\geq 0. \]

Since \(\lim H(x_{n_k})=\lim H(x_{n_l})=\lim H(x_n)\), it follows that \(D(x^{**},x^*)=0\), and hence \(x^{**}=x^*\).

Let us consider some examples of functions satisfying conditions (I)—(VI).

  1. Let \(X\) be a real Hilbert space. The function
    \[ D(x,y)=(x-y,x-y) \]
    satisfies conditions (I)—(VI) and condition (A) for any system of closed convex sets \(A_i\), if convergence is understood as weak convergence, and compactness as weak compactness. In this case the \(D\)-projection onto a convex set coincides with the usual projection. The corresponding relaxation sequence \(\{x_n\}\) under the conditions of Theorem 1 or 2 will converge weakly to an element \(x^*\in R\). In this case the relaxation method coincides with the method of successive projection \((^2)\).

  2. Let \(f(x)\) be a strictly convex twice differentiable function defined on some convex closed set \(S\subset E^n\). Let \(g(x)\) be the gradient of this function at the point \(x\). Consider the function
    \[ D(x,y)=f(x)-f(y)-(g(y),x-y). \]
    If this function satisfies condition (V), then, as is easy to verify, it satisfies conditions (I)—(IV), (VI) and (A) for any system of closed convex sets \(A_i\).

Suppose the following conditions are fulfilled:

1) The absolute minimum of the function \(f(x)\) is attained at a point \(x_0\) lying in the interior of \(S\).

2) The sets \(A_i\) are hyperplanes, i.e.
\[ A_i=\left\{x\in E^n\ \middle|\ \sum_{j=1}^{n} a_{ij}x_j=b_i\right\}, \]
and the set \(I\) is finite.

3) If \(y\) belongs to the interior of \(S\), then the \(D\)-projection of the point \(y\) onto any set \(A_i\) also belongs to the interior of \(S\).

In this case one can show that the corresponding relaxation sequence with initial approximation \(x_0\) converges to the point \(x^*\), which delivers the minimum of the function \(f(x)\) under the conditions
\[ \sum_{j=1}^{n} a_{ij}x_j=b_i \]
\((i\in I)\).

  1. Let
    \[ D(x,y)=\sum_{j=1}^{n}\bigl(y_j-x_j+x_j(\ln x_j-\ln y_j)\bigr) \]
    be given on the set
    \[ \{x\geq 0,\ y>0\}. \]
    This function satisfies conditions (I)—(VI), but does not satisfy condition (A). Nevertheless, in this case too the relaxation sequence will have a unique limit point.

Indeed, if \(x^*\in R\) is a limit point of the sequence \(\{x_n\}\) and \(x_{n_k}\to x^*\), then \(\lim D(x^*,x_{n_k})=0\), and since \(\lim D(x^*,x_n)\) exists, by property (VI) \(x_n\to x^*\).

If

\[ A_i=\left\{x \,\middle|\, \sum_{j=1}^{n} a_{ij}x_j=b_i\right\},\qquad \left(\sum_{j=1}^{n} a_{ij}^{2}>0\right), \]

then \(x^*\) maximizes

\[ \sum_{j=1}^{n} x_j \ln \frac{p_j}{x_j}\quad (p_j>0) \]

under the conditions

\[ \sum_{j=1}^{n} a_{ij}x_j=b_i,\qquad x_j \geqslant 0, \]

if the vector \(x_0=\{p_j/e\}\) is taken as the initial approximation. The \(D\)-projection \(x\) of the point \(y\) onto the set \(A_i\) is found from the formulas \(x_j=y_j e^{\lambda a_{ij}}\), where \(\lambda\) is the unique root of the equation

\[ \sum_{j=1}^{n} a_{ij}y_j e^{\lambda a_{ij}}=b_i . \]

This equation becomes linear with respect to \(e^\lambda\) if all \(a_{ij}\) are equal to 0 or 1.

The latter occurs, for example, in the following problem: maximize

\[ \sum_{i=1}^{m}\sum_{j=1}^{n} x_{ij}\ln \frac{p_{ij}}{x_{ij}}\quad (p_{ij}\geqslant 0) \]

under the conditions

\[ \sum_{j=1}^{n} x_{ij}=a_i,\qquad \sum_{i=1}^{m} x_{ij}=b_j; \]

\(x_{ij}\geqslant 0\); \(x_{ij}=0\) if \(p_{ij}=0\). Such problems arise in the calculation of passenger flows in cities, and the corresponding relaxation method for this problem coincides with the method of G. V. Sheleikhovsky (see \((^3)\)).

Leningrad State University
named after A. A. Zhdanov

Received
13 II 1966

REFERENCES

\(^1\) D. K. Faddeev, V. N. Faddeeva, Computational Methods of Linear Algebra, Moscow, 1963. \(^2\) L. M. Bregman, DAN, 162, No. 3, 487 (1965). \(^3\) A. G. Dynkin, E. G. Movchan, in: Application of Mathematical Methods and Electronic Computers in Urban Planning, Kiev, 1966.

Submission history

A RELAXATION METHOD FOR FINDING A COMMON POINT OF CONVEX SETS AND ITS APPLICATION TO OPTIMIZATION PROBLEMS