Identifying diagnosis-specific genotype–phenotype associations via joint multitask sparse canonical correlation analysis and classification

dc.contributor.authorDu, Lei
dc.contributor.authorLiu, Fang
dc.contributor.authorLiu, Kefei
dc.contributor.authorYao, Xiaohui
dc.contributor.authorRisacher, Shannon L
dc.contributor.authorHan, Junwei
dc.contributor.authorGuo, Lei
dc.contributor.authorSaykin, Andrew J
dc.contributor.authorShen, Li
dc.contributor.departmentRadiology and Imaging Sciences, School of Medicineen_US
dc.date.accessioned2021-01-21T16:02:18Z
dc.date.available2021-01-21T16:02:18Z
dc.date.issued2020-07-13
dc.description.abstractMotivation Brain imaging genetics studies the complex associations between genotypic data such as single nucleotide polymorphisms (SNPs) and imaging quantitative traits (QTs). The neurodegenerative disorders usually exhibit the diversity and heterogeneity, originating from which different diagnostic groups might carry distinct imaging QTs, SNPs and their interactions. Sparse canonical correlation analysis (SCCA) is widely used to identify bi-multivariate genotype–phenotype associations. However, most existing SCCA methods are unsupervised, leading to an inability to identify diagnosis-specific genotype–phenotype associations. Results In this article, we propose a new joint multitask learning method, named MT–SCCALR, which absorbs the merits of both SCCA and logistic regression. MT–SCCALR learns genotype–phenotype associations of multiple tasks jointly, with each task focusing on identifying one diagnosis-specific genotype–phenotype pattern. Meanwhile, MT–SCCALR cannot only select relevant SNPs and imaging QTs for each diagnostic group alone, but also allows the selection of those shared by multiple diagnostic groups. We derive an efficient optimization algorithm whose convergence to a local optimum is guaranteed. Compared with two state-of-the-art methods, MT–SCCALR yields better or similar canonical correlation coefficients and classification performances. In addition, it owns much better discriminative canonical weight patterns of great interest than competitors. This demonstrates the power and capability of MTSCCAR in identifying diagnostically heterogeneous genotype–phenotype patterns, which would be helpful to understand the pathophysiology of brain disorders.en_US
dc.identifier.citationDu, L., Liu, F., Liu, K., Yao, X., Risacher, S. L., Han, J., Guo, L., Saykin, A. J., Shen, L., & for the Alzheimer’s Disease Neuroimaging Initiative. (2020). Identifying diagnosis-specific genotype–phenotype associations via joint multitask sparse canonical correlation analysis and classification. Bioinformatics, 36(Supplement_1), i371–i379. https://doi.org/10.1093/bioinformatics/btaa434en_US
dc.identifier.issn1367-4803en_US
dc.identifier.urihttps://hdl.handle.net/1805/24903
dc.language.isoen_USen_US
dc.publisherOxforden_US
dc.relation.isversionof10.1093/bioinformatics/btaa434en_US
dc.relation.journalBioinformaticsen_US
dc.rightsAttribution-NonCommercial 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/*
dc.sourcePMCen_US
dc.subjectBrain imaging geneticsen_US
dc.subjectsingle nucleotide polymorphismsen_US
dc.subjectimaging quantitative traitsen_US
dc.subjectSparse canonical correlation analysisen_US
dc.subjectgenotype–phenotype patternsen_US
dc.subjectbrain disordersen_US
dc.subjectpathophysiologen_US
dc.titleIdentifying diagnosis-specific genotype–phenotype associations via joint multitask sparse canonical correlation analysis and classificationen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
btaa434.pdf
Size:
474.65 KB
Format:
Adobe Portable Document Format
Description:
Main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: