10 results on '"62H15, 62H10"'
Search Results
2. Regularized LRT for Large Scale Covariance Matrices: One Sample Problem
- Author
-
Choi, Young-Geun, Ng, Chi Tim, and Lim, Johan
- Subjects
Statistics - Methodology ,62H15, 62H10 - Abstract
The main theme of this paper is a modification of the likelihood ratio test (LRT) for testing high dimensional covariance matrix. Recently, the correct asymptotic distribution of the LRT for a large-dimensional case (the case $p/n$ approaches to a constant $\gamma \in (0,1]$) is specified by researchers. The correct procedure is named as corrected LRT. Despite of its correction, the corrected LRT is a function of sample eigenvalues that are suffered from redundant variability from high dimensionality and, subsequently, still does not have full power in differentiating hypotheses on the covariance matrix. In this paper, motivated by the successes of a linearly shrunken covariance matrix estimator (simply shrinkage estimator) in various applications, we propose a regularized LRT that uses, in defining the LRT, the shrinkage estimator instead of the sample covariance matrix. We compute the asymptotic distribution of the regularized LRT, when the true covariance matrix is the identity matrix and a spiked covariance matrix. The obtained asymptotic results have applications in testing various hypotheses on the covariance matrix. Here, we apply them to testing the identity of the true covariance matrix, which is a long standing problem in the literature, and show that the regularized LRT outperforms the corrected LRT, which is its non-regularized counterpart. In addition, we compare the power of the regularized LRT to those of recent non-likelihood based procedures., Comment: 27 pages, 7 figures
- Published
- 2015
- Full Text
- View/download PDF
3. Substitution principle for CLT of linear spectral statistics of high-dimensional sample covariance matrices with applications to hypothesis testing
- Author
-
Zheng, Shurong, Bai, Z. D., and Yao, Jiangfeng
- Subjects
Statistics - Methodology ,62H15, 62H10 - Abstract
Sample covariance matrices are widely used in multivariate statistical analysis. The central limit theorems (CLT's) for linear spectral statistics of high-dimensional non-centered sample covariance matrices have received considerable attention in random matrix theory and have been applied to many high-dimensional statistical problems. However, known population mean vectors are assumed for non-centered sample covariance matrices, some of which even assume Gaussian-like moment conditions. In fact, there are still another two most frequently used sample covariance matrices: the MLE (by subtracting the sample mean vector from each sample vector) and the unbiased sample covariance matrix (by changing the denominator $n$ as $N=n-1$ in the MLE) without depending on unknown population mean vectors. In this paper, we not only establish new CLT's for non-centered sample covariance matrices without Gaussian-like moment conditions but also characterize the non-negligible differences among the CLT's for the three classes of high-dimensional sample covariance matrices by establishing a {\em substitution principle}: substitute the {\em adjusted} sample size $N=n-1$ for the actual sample size $n$ in the major centering term of the new CLT's so as to obtain the CLT of the unbiased sample covariance matrices. Moreover, it is found that the difference between the CLT's for the MLE and unbiased sample covariance matrix is non-negligible in the major centering term although the two sample covariance matrices only have differences $n$ and $n-1$ on the dominator. The new results are applied to two testing problems for high-dimensional data., Comment: 36 pages, 23 references
- Published
- 2014
4. Testing linear hypotheses in high-dimensional regressions
- Author
-
Bai, Z., Jiang, D., Yao, J., and Zheng, S.
- Subjects
Statistics - Methodology ,Mathematics - Statistics Theory ,62H15, 62H10 - Abstract
For a multivariate linear model, Wilk's likelihood ratio test (LRT) constitutes one of the cornerstone tools. However, the computation of its quantiles under the null or the alternative requires complex analytic approximations and more importantly, these distributional approximations are feasible only for moderate dimension of the dependent variable, say $p\le 20$. On the other hand, assuming that the data dimension $p$ as well as the number $q$ of regression variables are fixed while the sample size $n$ grows, several asymptotic approximations are proposed in the literature for Wilk's $\bLa$ including the widely used chi-square approximation. In this paper, we consider necessary modifications to Wilk's test in a high-dimensional context, specifically assuming a high data dimension $p$ and a large sample size $n$. Based on recent random matrix theory, the correction we propose to Wilk's test is asymptotically Gaussian under the null and simulations demonstrate that the corrected LRT has very satisfactory size and power, surely in the large $p$ and large $n$ context, but also for moderately large data dimensions like $p=30$ or $p=50$. As a byproduct, we give a reason explaining why the standard chi-square approximation fails for high-dimensional data. We also introduce a new procedure for the classical multiple sample significance test in MANOVA which is valid for high-dimensional data., Comment: Accepted 02/2012 for publication in "Statistics". 20 pages, 2 pages and 2 tables
- Published
- 2012
- Full Text
- View/download PDF
5. On Identity Tests for High Dimensional Data Using RMT
- Author
-
Wang, Cheng, Yang, Jing, Miao, Baiqi, and Cao, Longbing
- Subjects
Statistics - Methodology ,Statistics - Applications ,62H15, 62H10 - Abstract
In this work, we redefined two important statistics, the CLRT test (Bai et.al., Ann. Stat. 37 (2009) 3822-3840) and the LW test (Ledoit and Wolf, Ann. Stat. 30 (2002) 1081-1102) on identity tests for high dimensional data using random matrix theories. Compared with existing CLRT and LW tests, the new tests can accommodate data which has unknown means and non-Gaussian distributions. Simulations demonstrate that the new tests have good properties in terms of size and power. What is more, even for Gaussian data, our new tests perform favorably in comparison to existing tests. Finally, we find the CLRT is more sensitive to eigenvalues less than 1 while the LW test has more advantages in relation to detecting eigenvalues larger than 1., Comment: 16 pages, 2 figures, 3 tables, To be published in the Journal of Multivariate Analysis
- Published
- 2012
- Full Text
- View/download PDF
6. A note on a Mar\v{c}enko-Pastur type theorem for time series
- Author
-
Yao, Jianfeng
- Subjects
Mathematics - Statistics Theory ,62H15, 62H10 - Abstract
In this note we develop an extension of the Mar\v{c}enko-Pastur theorem to time series model with temporal correlations. The limiting spectral distribution (LSD) of the sample covariance matrix is characterised by an explicit equation for its Stieltjes transform depending on the spectral density of the time series. A numerical algorithm is then given to compute the density functions of these LSD's.
- Published
- 2011
- Full Text
- View/download PDF
7. Test for the mean matrix in a Growth Curve model for high dimensions.
- Author
-
Srivastava, Muni S. and Singull, Martin
- Subjects
- *
GROWTH curves (Statistics) , *MULTIVARIATE analysis , *LINEAR statistical models , *MAXIMUM likelihood statistics , *COVARIANCE matrices - Abstract
We consider the problem of estimating and testing a general linear hypothesis in a general multivariate linear model, the so-called Growth Curve model, when thep×Nobservation matrix is normally distributed. The maximum likelihood estimator (MLE) for the mean is a weighted estimator with the inverse of the sample covariance matrix which is unstable for largepclose toNand singular forplarger thanN. We modify the MLE to an unweighted estimator and propose new tests which we compare with the previous likelihood ratio test (LRT) based on the weighted estimator, i.e., the MLE. We show that the performance of these new tests based on the unweighted estimator is better than the LRT based on the MLE. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
8. Regularized LRT for large scale covariance matrices: One sample problem
- Author
-
Young-Geun Choi, Chi Tim Ng, and Johan Lim
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,62H15, 62H10 ,Covariance function ,Covariance matrix ,Applied Mathematics ,05 social sciences ,Identity matrix ,Covariance ,01 natural sciences ,Methodology (stat.ME) ,010104 statistics & probability ,Estimation of covariance matrices ,Scatter matrix ,0502 economics and business ,Statistics ,Law of total covariance ,Statistics::Methodology ,Applied mathematics ,0101 mathematics ,Statistics, Probability and Uncertainty ,CMA-ES ,Statistics - Methodology ,050205 econometrics ,Mathematics - Abstract
The main theme of this paper is a modification of the likelihood ratio test (LRT) for testing high dimensional covariance matrix. Recently, the correct asymptotic distribution of the LRT for a large-dimensional case (the case $p/n$ approaches to a constant $\gamma \in (0,1]$) is specified by researchers. The correct procedure is named as corrected LRT. Despite of its correction, the corrected LRT is a function of sample eigenvalues that are suffered from redundant variability from high dimensionality and, subsequently, still does not have full power in differentiating hypotheses on the covariance matrix. In this paper, motivated by the successes of a linearly shrunken covariance matrix estimator (simply shrinkage estimator) in various applications, we propose a regularized LRT that uses, in defining the LRT, the shrinkage estimator instead of the sample covariance matrix. We compute the asymptotic distribution of the regularized LRT, when the true covariance matrix is the identity matrix and a spiked covariance matrix. The obtained asymptotic results have applications in testing various hypotheses on the covariance matrix. Here, we apply them to testing the identity of the true covariance matrix, which is a long standing problem in the literature, and show that the regularized LRT outperforms the corrected LRT, which is its non-regularized counterpart. In addition, we compare the power of the regularized LRT to those of recent non-likelihood based procedures., Comment: 27 pages, 7 figures
- Published
- 2017
9. Testing linear hypotheses in high-dimensional regressions
- Author
-
Jianfeng Yao, Shurong Zheng, Zhidong Bai, and Dan Jiang
- Subjects
FOS: Computer and information sciences ,Statistics and Probability ,Variables ,62H15, 62H10 ,media_common.quotation_subject ,Null (mathematics) ,Mathematics - Statistics Theory ,Context (language use) ,Statistics Theory (math.ST) ,Methodology (stat.ME) ,Dimension (vector space) ,Sample size determination ,Likelihood-ratio test ,FOS: Mathematics ,Applied mathematics ,Statistics, Probability and Uncertainty ,Random matrix ,Statistics - Methodology ,Quantile ,Mathematics ,media_common - Abstract
For a multivariate linear model, Wilk's likelihood ratio test (LRT) constitutes one of the cornerstone tools. However, the computation of its quantiles under the null or the alternative requires complex analytic approximations and more importantly, these distributional approximations are feasible only for moderate dimension of the dependent variable, say $p\le 20$. On the other hand, assuming that the data dimension $p$ as well as the number $q$ of regression variables are fixed while the sample size $n$ grows, several asymptotic approximations are proposed in the literature for Wilk's $\bLa$ including the widely used chi-square approximation. In this paper, we consider necessary modifications to Wilk's test in a high-dimensional context, specifically assuming a high data dimension $p$ and a large sample size $n$. Based on recent random matrix theory, the correction we propose to Wilk's test is asymptotically Gaussian under the null and simulations demonstrate that the corrected LRT has very satisfactory size and power, surely in the large $p$ and large $n$ context, but also for moderately large data dimensions like $p=30$ or $p=50$. As a byproduct, we give a reason explaining why the standard chi-square approximation fails for high-dimensional data. We also introduce a new procedure for the classical multiple sample significance test in MANOVA which is valid for high-dimensional data., Comment: Accepted 02/2012 for publication in "Statistics". 20 pages, 2 pages and 2 tables
- Published
- 2013
10. A note on a Mar\v{c}enko-Pastur type theorem for time series
- Author
-
Jianfeng Yao
- Subjects
Statistics and Probability ,Spectral power distribution ,Series (mathematics) ,62H15, 62H10 ,Mathematical analysis ,Mathematics - Statistics Theory ,Extension (predicate logic) ,Type (model theory) ,Order of integration ,Estimation of covariance matrices ,Statistics, Probability and Uncertainty ,Time series ,Singular spectrum analysis ,Mathematics - Abstract
In this note we develop an extension of the Mar cenko-Pastur the- orem to time series model with temporal correlations. The limiting spectral distribution (LSD) of the sample covariance matrix is characterised by an explicit equation for its Stieltjes transform depending on the spectral den- sity of the time series. A numerical algorithm is then given to compute the density functions of these LSD's.
- Published
- 2011
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.