TY - JOUR
T1 - Multi-Task Sparse Canonical Correlation Analysis with Application to Multi-Modal Brain Imaging Genetics
AU - Du, Lei
AU - Liu, Kefei
AU - Yao, Xiaohui
AU - Risacher, Shannon L.
AU - Han, Junwei
AU - Saykin, Andrew J.
AU - Guo, Lei
AU - Shen, Li
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2021/1/1
Y1 - 2021/1/1
N2 - Brain imaging genetics studies the genetic basis of brain structures and functionalities via integrating genotypic data such as single nucleotide polymorphisms (SNPs) and imaging quantitative traits (QTs). In this area, both multi-task learning (MTL) and sparse canonical correlation analysis (SCCA) methods are widely used since they are superior to those independent and pairwise univariate analysis. MTL methods generally incorporate a few of QTs and could not select features from multiple QTs; while SCCA methods typically employ one modality of QTs to study its association with SNPs. Both MTL and SCCA are computational expensive as the number of SNPs increases. In this paper, we propose a novel multi-task SCCA (MTSCCA) method to identify bi-multivariate associations between SNPs and multi-modal imaging QTs. MTSCCA could make use of the complementary information carried by different imaging modalities. MTSCCA enforces sparsity at the group level via the {\mathrm G}_{2,1}G2,1-norm, and jointly selects features across multiple tasks for SNPs and QTs via the \ell _{2,1}ℓ2,1-norm. A fast optimization algorithm is proposed using the grouping information of SNPs. Compared with conventional SCCA methods, MTSCCA obtains better correlation coefficients and canonical weights patterns. In addition, MTSCCA runs very fast and easy-to-implement, indicating its potential power in genome-wide brain-wide imaging genetics.
AB - Brain imaging genetics studies the genetic basis of brain structures and functionalities via integrating genotypic data such as single nucleotide polymorphisms (SNPs) and imaging quantitative traits (QTs). In this area, both multi-task learning (MTL) and sparse canonical correlation analysis (SCCA) methods are widely used since they are superior to those independent and pairwise univariate analysis. MTL methods generally incorporate a few of QTs and could not select features from multiple QTs; while SCCA methods typically employ one modality of QTs to study its association with SNPs. Both MTL and SCCA are computational expensive as the number of SNPs increases. In this paper, we propose a novel multi-task SCCA (MTSCCA) method to identify bi-multivariate associations between SNPs and multi-modal imaging QTs. MTSCCA could make use of the complementary information carried by different imaging modalities. MTSCCA enforces sparsity at the group level via the {\mathrm G}_{2,1}G2,1-norm, and jointly selects features across multiple tasks for SNPs and QTs via the \ell _{2,1}ℓ2,1-norm. A fast optimization algorithm is proposed using the grouping information of SNPs. Compared with conventional SCCA methods, MTSCCA obtains better correlation coefficients and canonical weights patterns. In addition, MTSCCA runs very fast and easy-to-implement, indicating its potential power in genome-wide brain-wide imaging genetics.
KW - Brain imaging genetics
KW - multi-task sparse canonical correlation analysis
KW - sparse canonical correlation analysis
UR - http://www.scopus.com/inward/record.url?scp=85100564077&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2019.2947428
DO - 10.1109/TCBB.2019.2947428
M3 - 文章
C2 - 31634139
AN - SCOPUS:85100564077
SN - 1545-5963
VL - 18
SP - 227
EP - 239
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 1
M1 - 8869839
ER -