Abstract Generated abstract
This paper generalizes the multidimensional Bochner theorem for positive definite functions to positive definite generalized kernels associated with systems of differential, and more general, operators. Using Hilbert space scales, generalized eigenvector expansions, and spectral measures for self-adjoint commuting operator closures, it establishes integral representations of positive definite kernels as superpositions of elementary kernels satisfying eigenvalue equations. The results are specialized to differential operators acting in separate variables, yielding matrix-valued spectral representations in terms of fundamental solutions of ordinary differential equations. A criterion based on uniqueness for associated evolution equations is used to verify self-adjointness and commutativity, leading to Bochner-type representation theorems that include classical Bochner, Bernstein, and Krein-type cases.
Full Text
MATHEMATICS
Yu. M. BEREZANSKII
ONE GENERALIZATION OF THE MULTIDIMENSIONAL BOCHNER THEOREM
(Presented by Academician S. L. Sobolev, 16 IX 1960)
In this note the \(n\)-dimensional Bochner theorem on positive definite (p.d.) functions is generalized to the case when the role of \(e^{i\lambda_j x_j}\) is played by eigenfunctions of differential (and more general) operators. For \(n=1\) the generalization was obtained by M. G. Krein \((^1)\) and by the author \((^2)\) (in the case of generalized kernels—by Maurin \((^3)\)); for \(n>1\) substantial difficulties arise, connected with the necessity of extending Hermitian commuting operators to self-adjoint commuting ones. The considerations of this article correspond to the case when the closures of the operators are already self-adjoint; they are based on the theory of expansions in generalized eigenvectors (for the literature see \((^4)\)). The note is connected with the work of A. G. Kostyuchenko and B. S. Mityagin \((^5)\), in which similar results were obtained for kernels generating a commutative ring. We do not assume a connection with rings, thanks to which the class of differential operators is substantially enlarged (for example, operators of order higher than the second appear)\(*\).
\(1^\circ\). Let \(H_0\) be a complete Hilbert space with scalar product \((f,g)_0\); let \(H_+\) be a linear set from \(H_0\), dense in it and itself a complete Hilbert space with respect to another scalar product \((u,v)_+\), such that \(\|u\|_0 \leq \|u\|_+\) \((u\in H_+)\). The bilinear form \(B(f,u)=(f,u)_0\) is continuous in \(f\in H_0\) and \(u\in H_+\); therefore it can be written in the form \((f,u)_0=(If,u)_+\), where \(I\) is an operator continuously acting from \(H_0\) into \(H_+\). Introduce in \(H_0\) a new scalar product \((f,g)_-=(If,g)_0\) and, carrying out the completion, obtain the Hilbert space \(H_-\). Thus,
\[ H_- \supset H_0 \supset H_+, \tag{1} \]
where each space of this chain is dense in the one standing to its left, and
\(\|u\|_- \leq \|u\|_0 \leq \|u\|_+\).
It is not difficult to show that a bilinear form \((\alpha,u)_0\) \((\alpha\in H_-,\; u\in H_+)\) is defined, becoming the scalar product in \(H_0\) if \(\alpha\in H_0\). Every linear continuous functional \(l(u)\) on \(H_+\) can be written in the form \(l(u)=(\alpha,u)_0\), where \(\alpha\) is some element of \(H_-\). Thus, \(H_-\) may be interpreted as the space of linear functionals on \(H_+\) (it is clear that it is isometric to \(H_+\)). Elements of the spaces \(H_+\), \(H_0\), and \(H_-\)—spaces with positive, zero, and negative norms—we shall denote respectively by \(u,v,\ldots;\ f,g,\ldots\) and \(\alpha,\beta,\ldots\). The space \(H_-\) may be regarded as the space of “generalized vectors” over the basic space \(H_+\) of “smooth vectors.”
Let the space \(H_{++}\) be in relation to \(H_+\) in the same position as \(H_+\) is in relation to \(H_0\). Then, from \(H_{++}\) and \(H_0\), one can construct the negative space \(H_{--}\). It is easy to see that
\[
H_{--}\supset H_-\supset H_0\supset H_+\supset H_{++}
\]
with the same density relations and inequalities as in (1). We call the space \(H_+\) nuclear with respect to \(H_0\) if \(I\), considered as an operator in \(H_0\), has finite trace.
\(*\) The main constructions of the article were reported by the author at a seminar at the Institute of Mathematics of the Academy of Sciences of the USSR in the summer of 1959 (see also \((^4)\)). What is new is Theorem 3, instead of which a cruder result had previously been used, leading to estimates more restrictive than (6).
\(2^\circ\). Tensor-multiplying the spaces of the chain (1) by themselves, we obtain the chain \(H_{-}\times H_{-}\supset H_0\times H_0\supset H_{+}\times H_{+}\). It can be shown that the space \(H_{-}\times H_{-}\) will be negative with respect to the zero space \(H_0\times H_0\) and positive \(H_{+}\times H_{+}\). The elements of the spaces \(H_{+}\times H_{+}\), \(H_0\times H_0\), and \(H_{-}\times H_{-}\), by analogy with the case of spaces of functions, will be called respectively smooth, ordinary, and generalized kernels.
In all that follows we shall assume that in the space \(H_0\) an involution \(f\to \bar f\) has been introduced, which is also an involution for \(H_{+}\). Now the notions of Hermitian and positive definite kernels are introduced in the following way: to each generalized kernel \(K\) there corresponds the bilinear form \(B_K(u,v)=(K,\bar vu)_{H_0\times H_0}\) (the index \(H_0\times H_0\) will henceforth be omitted), where \(u,v\in H_{+}\), and \(\bar vu\) is the tensor product of the vector \(v\) by \(u\). The kernel \(K\) is called Hermitian (positive definite) if the form \(B_K\) is Hermitian (positive definite). Our main problem is the representation of a positive definite kernel in the form of a linear combination of elementary positive definite kernels.
Let in \(H_0\) a system of operators \(A^1,\ldots,A^p\) be defined with domain \(\mathfrak D\), dense in \(H_0\). In what follows we shall consider only separable positive spaces and such spaces that contain \(\mathfrak D\) as their dense part and contain all \(A^j(\mathfrak D)\) \((j=1,\ldots,p)\). Generalized positive definite kernels \(\varphi_\lambda\in H_{-}\times H_{-}\), \(\|\varphi_\lambda\|_{H_{-}\times H_{-}}\leqslant 1\) \((\lambda=(\lambda_1,\ldots,\lambda_p)\) is a parameter) will be called elementary with respect to the system of operators \(A^1,\ldots,A^p\) if \((\varphi_\lambda,(A^j-\lambda_jE)u\cdot\bar v)=0\), \((\varphi_\lambda,\bar v\cdot(A^j-\lambda_jE)u)=0\) \((j=1,\ldots,p)\) for all \(u\in\mathfrak D\) and \(v\in H_{+}\) (from the Hermitian character of the positive definite kernel it follows that one of these equalities entails the other). On the basis of the results of the article \((^4)\) the following is obtained.
Theorem 1. Let \(K\in H_{-}\times H_{-}\) be a generalized positive definite kernel; denote by \(H_K\) the completion of \(H_{+}\) with respect to the scalar product \(\langle u,v\rangle=(K,\bar vu)\) \((u,v\in H_{+})\). Suppose first that \(p=1\). If the operator \(A^1\) is Hermitian in \(H_K\): \(\langle A^1u,v\rangle=\langle u,A^1v\rangle\) \((u,v\in\mathfrak D)\), then the representation is always valid
\[ K=\int_{-\infty}^{\infty}\varphi_\lambda\,d\rho(\lambda), \tag{2} \]
where \(\varphi_\lambda\in H_{--}\times H_{--}\) is some family of elementary positive definite kernels with respect to \(A^1\), and \(d\rho(\lambda)\) is a nonnegative finite measure on the axis; the integral is understood in the weak sense with respect to \(H_{--}\times H_{--}\). Here \(H_{++}\supset H_{+}\) is any positive space, chosen in such a way that it is nuclear with respect to \(H_{+}\). Conversely, if for the kernel \(K\in H_{-}\times H_{-}\) a representation (2) is valid with some family of elementary kernels \(\varphi_\lambda\in H_{--}\times H_{--}\), then \(A^1\) is Hermitian in \(H_K\). In the representation (2) the expression \(\varphi_\lambda\,d\rho(\lambda)\) is determined uniquely if and only if the closure of the operator \(A^1\) is maximal in \(H_K\).
Let now \(p>1\). If the closures of the operators \(A^1,\ldots,A^p\) are self-adjoint in \(H_K\) and their resolutions of the identity \(E^1(\Delta_1),\ldots,E^p(\Delta_p)\) commute for arbitrary intervals \(\Delta_1,\ldots,\Delta_p\), then the representation (2) is valid with the measure \(d\rho(\lambda)\) in \(p\)-dimensional space and with a family of kernels elementary with respect to \(A^1,\ldots,A^p\). The choice of \(H_{++}\) is the same as above; in the representation (2) the expression \(\varphi_\lambda\,d\rho(\lambda)\) is determined uniquely. Conversely, if for the kernel \(K\in H_{-}\times H_{-}\) a representation (2) is valid with some family of elementary kernels \(\varphi_\lambda\in H_{--}\times H_{--}\), with \(\varphi_\lambda\,d\rho(\lambda)\) determined uniquely, then the closures of the operators \(A^1,\ldots,A^p\) are self-adjoint in \(H_K\) and their resolutions of the identity commute.
\(3^\circ\). Let us apply this theorem to differential operators \(A^j\). The case \(p=1\) has been sufficiently studied \((^1,^2)\); everywhere in what follows we assume \(p>1\).
Let \(G\) be a finite or infinite domain of \(n\)-dimensional space; \(H_0=L_2(G)\), the involution being ordinary passage to the complex conjugate; \(\mathfrak D\) contains the set \(C_0^\infty(G)\) of functions finitely and infinitely differentiable relative to \(G\), and \(A^j u=\mathcal L^j[u]\) \((u\in C_0^\infty(G))\), where \(\mathcal L^j\) are linear differential expressions with infinitely differentiable coefficients \((j=1,\ldots,p)\). Then the definition of an elementary positive-definite kernel shows that \(\varphi_\lambda\), in each of the variables, is a generalized solution of a homogeneous linear differential equation, and therefore in the case, for example, of ellipticity of the expressions \(\mathcal L^j\) (or when they are in ordinary derivatives) is, inside \(G\), an ordinary kernel \(\varphi_\lambda(x,y)\) \((x,y\in G)\), satisfying the equations \(\mathcal L_x^j\varphi_\lambda=\lambda_j\varphi_\lambda,\ \mathcal L_y^j\varphi_\lambda=\lambda_j\varphi_\lambda\) \((j=1,\ldots,p)\). If \(\mathcal L^j\) are expressions in partial derivatives, then these equalities still do not make it possible to express \(\varphi_\lambda\) in terms of a standard system of functions depending only on \(\mathcal L^j\) and not depending on \(K\) (cf. (?)). However, in one important case this can be done:
Theorem 2. Let \(G=G_1\times\cdots\times G_n\), where \(G_j\) is a finite or infinite interval of variation of the variable \(x_j\); \(\mathcal L^j\) is an ordinary differential expression in the variable \(x_j\) of order \(r_j\); \(\chi_1^j(x_j,\mu),\ldots,\chi_{r_j}^j(x_j,\mu)\) is a fixed fundamental system of solutions of \(\mathcal L^j[u]=\mu u\). Put
\[
X_{\mathbf j}(x,\lambda)=\chi_{j_1}^1(x_1,\lambda_1)\cdots \chi_{j_n}^n(x_n,\lambda_n)
\quad
(x=(x_1,\ldots,x_n),\ \lambda=(\lambda_1,\ldots,\lambda_n)).
\]
Here \(\mathbf j=(j_1,\ldots,j_n)\) is a combined index, varying over the set \(N\) of points with coordinates \(j_l=1,\ldots,r_l\) \((l=1,\ldots,n)\).
Let \(K\in H_{-}\times H_{-}\) be a generalized positive-definite kernel; if the closures of the corresponding \(\mathcal L^j\)-operators \(A^1,\ldots,A^n\) are self-adjoint in \(H_K\) and their resolutions of the identity commute, then the representation
\[
K=\int_{-\infty}^{\infty}\sum_{\mathbf j,\mathbf k\in N}
X_{\mathbf j}(x,\lambda)\overline{X_{\mathbf k}(y,\lambda)}\,d\rho_{\mathbf j\mathbf k}(\lambda),
\tag{3}
\]
holds, where the matrix \(\|d\rho_{\mathbf j\mathbf k}(\lambda)\|_{\mathbf j,\mathbf k\in N}\) is positive-definite in the sense that
\[
\sum_{\mathbf j,\mathbf k\in N}
\rho_{\mathbf j\mathbf k}(\Delta)\xi_{\mathbf j}\overline{\xi_{\mathbf k}}\ge 0
\quad
(\rho_{\mathbf j\mathbf k}(\Delta)=\rho_{\mathbf j\mathbf k}(\lambda'')-\rho_{\mathbf j\mathbf k}(\lambda'),\
\Delta=[\lambda',\lambda''))
\tag{4}
\]
for any \(\Delta\) and any set of numbers \(\xi_{\mathbf j}\). The integral in (3) converges in the sense of weak convergence in \(H_{-}\times H_{-}\); \(H_{++}\) is chosen in the same way as in Theorem 1.
\(4^\circ\). The principal difficulty that arises in applying Theorems 1 and 2 is the verification of the self-adjointness of the operators \(A^j\) and the commutativity of their resolutions of the identity. We shall now formulate (in a general form) the relevant results. Suppose that for each \(j=1,\ldots,n\) there is given a chain \(H_-^j\supset H_0^j\supset H_+^j\) of type (1); assume that the spaces \(H_+\) and \(H_0\) considered earlier have respectively the form \(H_+^1\times\cdots\times H_+^n\) and \(H_0^1\times\cdots\times H_0^n\); then \(H_-=H_-^1\times\cdots\times H_-^n\). Let \(B^j\) be an operator in \(H_+^j\) with domain of definition \(\mathfrak D(B^j)\) dense in \(H_+^j\); introduce the operator \(A=E\times\cdots\times E\times B^j\times E\times\cdots\times E\) (\(B^j\) stands in the \(j\)-th place) in \(H_+\) with domain of definition
\[
\mathfrak D(A^j)=H_+^1\times\cdots\times H_+^{j-1}\times \mathfrak D(B^j)\times H_+^{j+1}\times\cdots\times H_+^n,
\]
dense in \(H_+\), and hence also in \(H_0\). Everywhere below we assume that \(A^j\) is Hermitian in \(H_K\).
Theorem 3. Suppose that for each \(j\), for both equations \(du_t/dt\pm iB^{j*}u_t=0\) \((0\le t<\infty)\), considered in the Hilbert space \(H_+^j\), uniqueness of weak solutions holds. Then the closures of the operators \(A^j\) are self-adjoint in \(H_K\) and their resolutions of the identity commute.
\(5^\circ\). The scheme indicated in item \(4^\circ\) can be realized for ordinary differential operators \(B^j\) on the whole axis with constant coefficients, acting for different \(j\) in different variables; the proof of uniqueness of solutions in this case is carried out
on the basis of the method developed in [6]. Then, applying Theorem 2, we obtain integral representations of positive-definite kernels. We state the main result, restricting ourselves, for simplicity of formulation, to the case of ordinary positive-definite kernels.
Theorem 4. Let in the \(n\)-dimensional space \(E_n\) there be given a continuous positive-definite kernel \(K(x,y)\) \((x,y\in E_n)\), satisfying, in the sense of generalized functions, the relations
\[ \mathscr L^j_{x_j}[K(x,y)]=\overline{\mathscr L^j_{y_j}[K(x,y)]}\qquad (j=1,\ldots,n), \tag{5} \]
where \(\mathscr L^1,\ldots,\mathscr L^n\) are differential expressions with constant coefficients in the variables, respectively, \(x_1,\ldots,x_n\), and of orders \(r_1,\ldots,r_n\). If for some \(\varepsilon>0\) the estimate* holds
\[ |K(x_1,\ldots,x_n,y_1,\ldots,y_n)|\leq C\exp\{|x_1|^{r'_1-\varepsilon}+ \]
\[ +|y_1|^{r'_1-\varepsilon}+\cdots+|x_n|^{r'_n-\varepsilon}+|y_n|^{r'_n-\varepsilon}\} \qquad (C>0;\ x,y\in E_n), \tag{6} \]
where \(r'_j\), for \(r_j\geq 2\), is the number conjugate to \(r_j\) \((1/r_j+1/r'_j=1)\), and for \(r_j=1\) is an arbitrary positive number, then the representation
\[ K(x,y)=\int_{-\infty}^{\infty}\sum_{j,k\in N} X_j(x,\lambda)\overline{X_k(y,\lambda)}\,d\rho_{jk}(\lambda) \qquad (x,y\in E_n) \tag{7} \]
holds, with an absolutely convergent integral. Here the functions \(X_j\) and the matrix \(\|\rho_{jk}\|\) are the same as in Theorem 2.
In this paper we shall not dwell in detail on questions of the uniqueness of the representation (7); we note only that a certain uniqueness of it follows from Theorem 1. It is also clear that the converse assertion to Theorem 4 is valid in the known sense.
From Theorem 4 one can obtain a number of \(n\)-dimensional theorems of Bochner type. Thus, if \(k(x)\) \((x\in E_n)\) is a continuous function for which the kernel \(K(x,y)=k(x-y)\) is positive-definite, then (5) is satisfied with \(\mathscr L^j=i\,\partial/\partial x_j\). Writing the representation (7) for \(K\) and putting \(y=0\), we obtain Bochner’s theorem (here condition (6) is satisfied automatically). If the kernel \(K(x,y)=k(x+y)\) is positive-definite and \(k(x)\) grows at infinity no faster than \(\exp\{|x_1|^{r'_1}+\cdots+|x_n|^{r'_n}\}\), where \(r'_1,\ldots,r'_n\) are some positive numbers, then the representation (7) gives the \(n\)-dimensional theorem of S. N. Bernstein (in this case \(\mathscr L^j=\partial/\partial x_j\)). If the positive-definite kernel \(K(x,y)=k(x+y)+k(x-y)\) and \(k(x)\) grows at infinity no faster than \(\exp\{|x_1|^{2-\varepsilon}+\cdots+|x_n|^{2-\varepsilon}\}\) \((\varepsilon>0)\), then one may take \(\mathscr L^j=\partial^2/\partial x_j^2\); the representation (7) then gives an expansion of \(k(x)\) in terms of \(\cos\sqrt{\lambda_j}x_j\) and \(\sin\sqrt{\lambda_k}x_k\). If, in addition, the evenness of \(k(x)\) is known, then in this representation the terms \(\sin\sqrt{\lambda_k}x_k\) vanish, and we obtain a theorem of M. G. Krein type. One could also consider the case when from \(k(x)\) the kernel \(K(x,y)\) is formed by a combination of the indicated operations, for example
\(K(x,y)=k(x_1-y_1,\ x_2+y_2)+k(x_1-y_1,\ x_2-y_2)\). If \(k(x)\) depends analytically on \(x_j\), then the method given for forming the kernel \(K(x,y)\) can be generalized to the case of expressions \(\mathscr L^j\) of order higher than the second.
Institute of Mathematics
Academy of Sciences of the Ukrainian SSR
Received
14.IX.1960
CITED LITERATURE
- M. G. Krein, DAN, 53, No. 1 (1946).
- Yu. M. Berezanskii, Matem. sborn., 47 (89), No. 2 (1959).
- K. Maurin, Bull. Acad. Polon. Sci., Ser. Math., Astr. et Phys., 6, No. 3 (1958).
- Yu. M. Berezanskii, Ukr. matem. zhurn., 11, No. 1 (1959).
- A. G. Kostyuchenko, B. S. Mityagin, DAN, 131, No. 1 (1960).
- I. M. Gelfand, G. E. Shilov, Generalized Functions, vol. 3, Moscow, 1958.
* The growth of the kernel \(K\) must be somewhat less than the growth of \(\exp\{|x_1|^{r'_1}+|y_1|^{r'_1}+\cdots\}\); instead of subtracting \(\varepsilon\), one could have given finer estimates.