A novel variational Bayesian method for variable selection in logistic regression models

Chun Xia Zhang, Shuang Xu, Jiang She Zhang

科研成果: 期刊稿件文章同行评审

26 引用 (Scopus)

摘要

With high-dimensional data emerging in various domains, sparse logistic regression models have gained much interest of researchers. Variable selection plays a key role in both improving the prediction accuracy and enhancing the interpretability of built models. Bayesian variable selection approaches enjoy many advantages such as high selection accuracy, easily incorporating many kinds of prior knowledge and so on. Because Bayesian methods generally make inference from the posterior distribution with Markov Chain Monte Carlo (MCMC) techniques, however, they become intractable in high-dimensional situations due to the large searching space. To address this issue, a novel variational Bayesian method for variable selection in high-dimensional logistic regression models is presented. The proposed method is based on the indicator model in which each covariate is equipped with a binary latent variable indicating whether it is important. The Bernoulli-type prior is adopted for the latent indicator variable. As for the specification of the hyperparameter in the Bernoulli prior, we provide two schemes to determine its optimal value so that the novel model can achieve sparsity adaptively. To identify important variables and make predictions, one efficient variational Bayesian approach is employed to make inference from the posterior distribution. The experiments conducted with both synthetic and some publicly available data show that the new method outperforms or is very competitive with some other popular counterparts.

源语言英语
页(从-至)1-19
页数19
期刊Computational Statistics and Data Analysis
133
DOI
出版状态已出版 - 5月 2019
已对外发布

指纹

探究 'A novel variational Bayesian method for variable selection in logistic regression models' 的科研主题。它们共同构成独一无二的指纹。

引用此