Robust sparse regression by modeling noise as a mixture of gaussians

Shuang Xu, Chun Xia Zhang

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Regression analysis has been proven to be a quite effective tool in a large variety of fields. In many regression models, it is often assumed that noise is with a specific distribution. Although the theoretical analysis can be greatly facilitated, the model-fitting performance may be poor since the supposed noise distribution may deviate from real noise to a large extent. Meanwhile, the model is also expected to be robust in consideration of the complexity of real-world data. Without any assumption about noise, we propose in this paper a novel sparse regression method called MoG-Lasso to directly model noise in linear regression models via a mixture of Gaussian distributions (MoG). Meanwhile, the L1 penalty is included as a part of the loss function of MoG-Lasso to enhance its ability to identify a sparse model. As for the parameters in MoG-Lasso, we present an efficient algorithm to estimate them via the EM (expectation maximization) and ADMM (alternating direction method of multipliers) algorithms. With some simulated and real data contaminated by complex noise, the experiments show that the novel model MoG-Lasso performs better than several other popular methods in both ‘p>n’ and ‘p<n’ situations, including Lasso, LAD-Lasso and Huber-Lasso.

Original languageEnglish
Pages (from-to)1738-1755
Number of pages18
JournalJournal of Applied Statistics
Volume46
Issue number10
DOIs
StatePublished - 27 Jul 2019
Externally publishedYes

Keywords

  • lasso
  • mixture of Gaussians
  • penalized regression
  • Robust regression
  • variable selection

Fingerprint

Dive into the research topics of 'Robust sparse regression by modeling noise as a mixture of gaussians'. Together they form a unique fingerprint.

Cite this