"Sure Independence Screening in the Presence of Data That is Missing at" by Adriano Zanin Zambom and Gregory J. Matthews

Mathematics and Statistics: Faculty Publications and Other Works

Title

Sure Independence Screening in the Presence of Data That is Missing at Random

Authors

Adriano Zanin Zambom, California State University
Gregory J. Matthews, Loyola University ChicagoFollow

Document Type

Article

Publication Date

5-29-2019

Publication Title

Statistical Papers

Publisher Name

Springer

Abstract

Variable selection in ultra-high dimensional data sets is an increasingly prevalent issue with the readily available data arising from, for example, genome-wide associations studies or gene expression data. When the dimension of the feature space is exponentially larger than the sample size, it is desirable to screen out unimportant predictors in order to bring the dimension down to a moderate scale. In this paper we consider the case when observations of the predictors are missing at random. We propose performing screening using the marginal linear correlation coefficient between each predictor and the response variable accounting for the missing data using maximum likelihood estimation. This method is shown to have the sure screening property. Moreover, a novel method of screening that uses additional predictors when estimating the correlation coefficient is proposed. Simulations show that simply performing screening using pairwise complete observations is out-performed by both the proposed methods and is not recommended. Finally, the proposed methods are applied to a gene expression study on prostate cancer.

Comments

Author Posting © Springer-Verlag GmbH Germany, part of Springer Nature, 2019. This is the author's version of the work. It is posted here by permission of Springer-Verlag GmbH Germany, part of Springer Nature for personal use, not for redistribution. The definitive version was published in Statistical Papers, May 2019. https://doi.org/10.1007/s00362-019-01115-w

Recommended Citation

Zanin Zambom, Adriano and Matthews, Gregory J.. Sure Independence Screening in the Presence of Data That is Missing at Random. Statistical Papers, , : , 2019. Retrieved from Loyola eCommons, Mathematics and Statistics: Faculty Publications and Other Works, http://dx.doi.org/10.1007/s00362-019-01115-w

COinS

Mathematics and Statistics: Faculty Publications and Other Works

Title

Sure Independence Screening in the Presence of Data That is Missing at Random

Authors

Document Type

Publication Date

Publication Title

Publisher Name

Abstract

Comments

Recommended Citation

Creative Commons License

Copyright Statement

Included in

Submission Tools

Explore

For Contributors

About eCommons

Mathematics and Statistics: Faculty Publications and Other Works

Title

Sure Independence Screening in the Presence of Data That is Missing at Random

Authors

Document Type

Publication Date

Publication Title

Publisher Name

Abstract

Comments

Recommended Citation

Creative Commons License

Copyright Statement

Included in

Share

Submission Tools

Explore

For Contributors

About eCommons