TY - GEN
T1 - Identification of disease-related genetic variants and imaging factors leveraging summary statistics
AU - Xi, Duo
AU - Cui, Dingnan
AU - Zhang, Minjianan
AU - Zhang, Jin
AU - Shang, Muheng
AU - Guo, Lei
AU - Du, Lei
AU - Han, Junwei
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Brain imaging genetics offers insights into the genetic basis of brain structure and function by exploring the relationships between genetic variations and neuroimaging features, with canonical correlation association learning as a vital and effective tool. However, imaging large cohorts affected by specific brain diseases entails significant costs. To tackle this challenge, we introduced a novel bi-multivariate sparse canonical correlation association method based on summary statistics from large GWAS (S-SCCA). S-SCCA leverages effect sizes obtained from these datasets to identify genetic variants associated with complex traits, including those influenced by pleiotropy, while simultaneously identifying imaging factors related to the disease under study. Moreover, we have implemented a rapid optimization strategy to circumvent computational burdens while identifying disease-associated risk factors within genetic variations across the entire chromosome. We assessed S-SCCA against conventional SCCA using a neuroimaging genetic dataset from the Alzheimer's Disease Neuroimaging Initiative. Results showed that S-SCCA demonstrated comparable or superior modeling performance and feature selection capabilities. Furthermore, we applied S-SCCA to two summary statistics datasets from two large GWAS, where original imaging and genetic data were inaccessible. S-SCCA replicated the genetic loci identified by GWAS and additional meaningful variants. Additionally, it revealed bi-multivariate relationships between imaging QTs and SNPs, indicating its powerful modeling capability. These findings highlight the promise of S-SCCA as a practical bi-multivariate learning technique in brain imaging genetics, circumventing the need for sensitive individual-level imaging and genetic data, thereby enhancing its potential for broader applicability and accessibility in biomedical studies.
AB - Brain imaging genetics offers insights into the genetic basis of brain structure and function by exploring the relationships between genetic variations and neuroimaging features, with canonical correlation association learning as a vital and effective tool. However, imaging large cohorts affected by specific brain diseases entails significant costs. To tackle this challenge, we introduced a novel bi-multivariate sparse canonical correlation association method based on summary statistics from large GWAS (S-SCCA). S-SCCA leverages effect sizes obtained from these datasets to identify genetic variants associated with complex traits, including those influenced by pleiotropy, while simultaneously identifying imaging factors related to the disease under study. Moreover, we have implemented a rapid optimization strategy to circumvent computational burdens while identifying disease-associated risk factors within genetic variations across the entire chromosome. We assessed S-SCCA against conventional SCCA using a neuroimaging genetic dataset from the Alzheimer's Disease Neuroimaging Initiative. Results showed that S-SCCA demonstrated comparable or superior modeling performance and feature selection capabilities. Furthermore, we applied S-SCCA to two summary statistics datasets from two large GWAS, where original imaging and genetic data were inaccessible. S-SCCA replicated the genetic loci identified by GWAS and additional meaningful variants. Additionally, it revealed bi-multivariate relationships between imaging QTs and SNPs, indicating its powerful modeling capability. These findings highlight the promise of S-SCCA as a practical bi-multivariate learning technique in brain imaging genetics, circumventing the need for sensitive individual-level imaging and genetic data, thereby enhancing its potential for broader applicability and accessibility in biomedical studies.
KW - Brain imaging genetics
KW - GWAS summary statistics
KW - biomarker identification
KW - genotype-phenotype correlation
UR - http://www.scopus.com/inward/record.url?scp=85217276896&partnerID=8YFLogxK
U2 - 10.1109/BIBM62325.2024.10822054
DO - 10.1109/BIBM62325.2024.10822054
M3 - 会议稿件
AN - SCOPUS:85217276896
T3 - Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
SP - 1244
EP - 1249
BT - Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
A2 - Cannataro, Mario
A2 - Zheng, Huiru
A2 - Gao, Lin
A2 - Cheng, Jianlin
A2 - de Miranda, Joao Luis
A2 - Zumpano, Ester
A2 - Hu, Xiaohua
A2 - Cho, Young-Rae
A2 - Park, Taesung
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2024
Y2 - 3 December 2024 through 6 December 2024
ER -