Abstract Generated abstract
The paper studies similar tests for a general linear hypothesis in normal linear models when observation variances, or weights, are unknown and may differ. Extending results of Linnik on Scheffé-type tests and Neyman structures beyond the Behrens-Fisher setting, it considers tests defined through independent linear forms and homogeneous functions whose small-ratio behavior forces acceptance of the null hypothesis. The main theorem shows that, under these conditions, similarity can hold only if the variances of the auxiliary linear forms are proportional to a common variance factor independently of the unknown variance vector. This yields a Neyman structure for the associated exponential family, with comments on choosing the forms in subsample problems and on a multivariate extension using matrix analogues of the test statistics.
Full Text
UDC 519.28
MATHEMATICS
I. S. DOBROKHOTOV
ON TESTS OF A GENERAL LINEAR HYPOTHESIS WITH UNKNOWN WEIGHTS OF OBSERVATIONS
(Presented by Academician Yu. V. Linnik on 30 III 1967)
In the book of Yu. V. Linnik \((^1)\), properties are indicated that characterize similar Scheffé-type tests as Neyman structures. Such tests have hitherto been known only in the Behrens—Fisher problem. The conditions found are satisfied, for example, by all tests based on the \(t\)-ratio, in particular by the well-known test of E. Barankin \((^3)\) for one problem on linear regression. In \((^1)\) the question is posed of extending the results obtained to the case of a general linear hypothesis with unknown weights of observations. The present note serves this purpose.
Let \(X_1,\ldots,X_N\), where \(X_i \in N(a_i,\sigma_i^2)\), be a repeated sample, and suppose that
\[ a_i=\sum_1^s \alpha_{ij}\beta_j \]
(\(\beta_j\) are new parameters, and the matrix \(\alpha=\|\alpha_{ij}\|\) is assumed given). Suppose that the \(\sigma_i^2\), generally speaking, are different. The general linear hypothesis \(H_0\) is the assertion
\[ \beta_1=B_1,\ldots,\beta_r=B_r, \]
where \(B_i\) \((i=1,\ldots,r)\) are given constants.
Denote by \(\mathfrak X\) the space generated by stochastically independent linear forms \(l_1,\ldots,l_\mu\) and \(L_1,\ldots,L_\nu\) such that
\[
E\{l_i\mid H_0\}=E\{L_j\}=0^* \quad (i=1,\ldots,\mu;\ j=1,\ldots,\nu),
\]
\[
D\{l_i\}/D\{l_1\}=C_i \quad (i=1,\ldots,\mu),
\]
and \(C_i>0\) does not depend on \(\sigma=(\sigma_1,\ldots,\sigma_N)\), whereas the forms \(L_j\) may have different variances.
Consider two continuous homogeneous functions \(T_1\) and \(T_2\) of degree \(n>0\) with the following properties:
\[
T_1(l_1,\ldots,l_\mu)>0,\quad \text{if }(l_1,\ldots,l_\mu)\ne(0,\ldots,0),
\]
\[
T_2(L_1,\ldots,L_\nu)>0,\quad \text{if }(L_1,\ldots,L_\nu)=(0,\ldots,0),
\]
and
\[ T_2>C>0, \tag{1} \]
if at least one of its arguments is equal to one. Then the following is valid.
Theorem. If a similar test \(\varphi\) (generally speaking, randomized) is defined on the space \(\mathfrak X\) and accepts the null hypothesis at least under the conditions
\[ T_1/T_2\leqslant \varepsilon \tag{2} \]
(\(\varepsilon>0\) sufficiently small) and
\[ \sqrt{Q_1}=\left[\sum_1^\mu \frac{l_i^2}{c_i}\right]^{1/2}<\delta \tag{3} \]
\[ {}^*\ \text{The mathematical expectations of the forms }L_j\text{ are equal to zero for any values }\beta=(\beta_1,\ldots,\beta_s). \]
(\(\delta>0\) is any fixed number), then the variances of the linear forms \(L_1,\ldots,L_\nu\) satisfy the relations
\[ D\{L_j\}/D\{l_1\}=d_j \qquad (j=1,\ldots,\nu), \tag{4} \]
and \(d_j>0\) do not depend on \(\sigma\).
It follows immediately from this that \(\varphi\) is a Neyman structure for the exponential family generated by the forms \(l_1,\ldots,l_\mu,L_1,\ldots,L_\nu\). For \(\mu=1\) and in the case of the Behrens—Fisher problem, we again obtain the result of Yu. V. Linnik \((^1)\).
As in \((^1)\), the proof is by contradiction and is based on consideration of the test-similarity condition
\[ E_\sigma\{\varphi\mid H_0\}=\alpha \quad \text{uniformly in } \sigma . \tag{5} \]
Let us briefly explain the proof. Suppose that (4) holds for \(j=1,\ldots,p<\nu\), and transform (5) as described in \((^1)\). As a result, \(\sigma\) are replaced by new parameters \(\theta\), which we regard as complex. Then we consider a parametric point that is singular for both sides of the transformed relation (5). In a neighborhood of this point the stated relation has the form
\[ \int_{\mathfrak x}\cdots\int \varphi\, \frac{dl_1\cdots dl_\mu\,dL_1\cdots dL_\nu}{Q^{\tau+(\mu+\nu)/2}} \]
\[ = A_1G_1(\tau)\xi^{-\tau}(i\xi)^{-\tau-(\nu-p)/2} \prod_{p+1}^{\nu}\left[d_{jN-1}i\xi(d_{jN}-d_{jN-1})\right]^{1/2}, \tag{6} \]
where
\[ Q=Q_1+\sum_{1}^{p}\frac{L_j^2}{d_j} +\sum_{p+1}^{\nu}L_j^2 \frac{i\xi}{d_{jN-1}i\xi+(d_{jN}-d_{jN-1})} +i\xi^2; \]
\(A_1\) is a positive constant; \(G_1(\tau)\) is a function regular in \(\operatorname{Re}(\tau)>0\), and \(\xi>0\) is a number which we henceforth make arbitrarily small. We are now interested in the behavior of the moduli of both sides of (6) as \(\xi\downarrow 0\). A lower estimate for the modulus of the right-hand side has the form
\[ B\xi^{-2\tau-(\nu-p)/2}, \]
where the symbol \(B\) denotes a bounded quantity. To obtain an upper estimate for the modulus of the left-hand side of (6), we divide the space \(\mathfrak x\) into “layers”:
I. \(\;2^{m-1}\xi\le \sqrt{Q_1}<2^m\xi,\)
II. \(\;\dfrac{1}{2^m}\xi\le \sqrt{Q_1}<\dfrac{1}{2^{m-1}}\xi\) inside (3),
III. \(\;2^{m-1}\delta\le \sqrt{Q_1}<2^m\delta\) in the remaining part of the space \(\mathfrak x\).
The estimate is carried out separately in each “layer.” In doing so, the properties of the functions \(T_1\) and \(T_2\) and the inequalities
\[ |Q|\ge |\operatorname{Re}Q|\ge Q_1,\qquad |Q|\ge |\operatorname{Im}Q|\ge B\xi^2 \]
are used for \(|L_j|<\xi\). In the end we obtain the desired estimate
\[ B\xi^{-2\tau}. \]
To avoid a contradiction in the behavior of both sides of (6), we must have \(\nu-p=0\).
Since the exponential family generated by the forms \(l_1,\ldots,l_\mu,L_1,\ldots,L_\nu\) is one-parametric under \(H_0\), it follows from the Lehmann—Scheffé theory that \(\varphi\) is a Neyman structure with respect to the statistic
\[ \sum_{1}^{\mu}\frac{l_i^2}{\sigma_i}+\sum_{1}^{\nu}\frac{L_j^2}{d_j}. \]
The forms \(l_i\) are naturally to be chosen in such a way that the indicated statistic is not sufficient under the specified alternative \(H_1\); then the test \(\varphi\) is nontrivial.
In many practical cases the sample consists of \(m\) independent subsamples of sizes \(n_k\) \((k=1,\ldots,m)\), drawn respectively from
\[ N(a_k,\sigma_k^2). \]
Then \(N=\sum_1^m n_k\), and \(\mu,\nu\) must satisfy the condition
\[ \mu+\nu \leq \min n_k . \]
The optimal choice of the forms \(l_1,\ldots,l_\mu,L_1,\ldots,L_\nu\) depends on the particular form of the functions \(T_1\) and \(T_2\) and can be carried out, for example, on the basis of the same considerations as in (2) or (3), if tests based on the \(F\)-distribution are constructed.
The result obtained admits a multivariate generalization. In this case \(T_1\) and \(T_2\) are replaced respectively by the matrices \(Q_1\) and \(Q_2\). Instead of inequality (2), acceptance of the null hypothesis is postulated under the condition
\[ \operatorname{sp} Q_1 Q_2^{-1} \leq \varepsilon . \]
Condition (1) takes the form
\[ |Q_2| > c > 0, \]
if all entries of the matrix \(Q_2\) are bounded.
Received
28 III 1967
REFERENCES
\(^{1}\) Yu. V. Linnik, Statistical Problems with Nuisance Parameters, “Nauka,” 1966.
\(^{2}\) H. Scheffe, Ann. Math. Statistics, 14, No. 1, 35 (1943).
\(^{3}\) E. W. Barankin, Proc. Berkeley Symp. on Math. Stat. and Prob., 1949.