Abstract
Asymptotic properties of model selection criteria for high-dimensional regression models are studied where the dimension of covariates is much larger than the sample size. Several sufficient conditions for model selection consistency are provided. Non-Gaussian error distributions are considered and it is shown that the maximal number of covariates for model selection consistency depends on the tail behavior of the error distribution. Also, sufficient conditions for model selection consistency are given when the variance of the noise is neither known nor estimated consistently. Results of simulation studies as well as real data analysis are given to illustrate that finite sample performances of consistent model selection criteria can be quite different.
Original language | English |
---|---|
Pages (from-to) | 1037-1057 |
Number of pages | 21 |
Journal | Journal of Machine Learning Research |
Volume | 13 |
State | Published - Apr 2012 |
Keywords
- General information criteria
- High dimension
- Model selection consistency
- Regression