TY - JOUR
T1 - Evaluation of genotype-based gene expression model performance
T2 - a cross-framework and cross-dataset study
AU - Tavares, Vânia
AU - Monteiro, Joana
AU - Vassos, Evangelos
AU - Coleman, Jonathan
AU - Prata, Diana
N1 - info:eu-repo/grantAgreement/FCT/OE/PD%2FBD%2F114460%2F2016/PT#
info:eu-repo/grantAgreement/FCT/3599-PPCDT/DSAIPA%2FDS%2F0065%2F2018/PT#
info:eu-repo/grantAgreement/FCT/3599-PPCDT/DSAIPA%2FDS%2F0065%2F2018/PT#
G0901254
G0802462
R01MH085542
R01MH093725
P50MH066392
P50MH080405
R01MH097276
RO1-MH-075916
P50M096891
P50MH084053S1
R37MH057881
AG02219
AG05138
MH06692
R01MH110921
R01MH109677
R01MH109897
U01MH103392
HHSN271201300031C
FCT-IF/00787/2014
LISBOA-01–0145-FEDER-030907
FP7-PEOPLE-2013-CIG 631952
292/16
PY - 2021/10
Y1 - 2021/10
N2 - Predicting gene expression from genotyped data is valuable for studying inaccessible tissues such as the brain. Herein we present eGenScore, a polygenic/poly-variation method, and compare it with PrediXcan, a method based on regularized linear regression using elastic nets. While both methods have the same purpose of predicting gene expression based on genotype, they carry important methodological differences. We compared the performance of expression quantitative trait loci (eQTL) models to predict gene expression in the frontal cortex, comparing across these frameworks (eGenScore vs. PrediXcan) and training datasets (BrainEAC, which is brain-specific, vs. GTEx, which has data across multiple tissues). In addition to internal five-fold cross-validation, we externally validated the gene expression models using the CommonMind Consortium database. Our results showed that (1) PrediXcan outperforms eGenScore regardless of the training database used; and (2) when using PrediXcan, the performance of the eQTL models in frontal cortex is higher when trained with GTEx than with BrainEAC.
AB - Predicting gene expression from genotyped data is valuable for studying inaccessible tissues such as the brain. Herein we present eGenScore, a polygenic/poly-variation method, and compare it with PrediXcan, a method based on regularized linear regression using elastic nets. While both methods have the same purpose of predicting gene expression based on genotype, they carry important methodological differences. We compared the performance of expression quantitative trait loci (eQTL) models to predict gene expression in the frontal cortex, comparing across these frameworks (eGenScore vs. PrediXcan) and training datasets (BrainEAC, which is brain-specific, vs. GTEx, which has data across multiple tissues). In addition to internal five-fold cross-validation, we externally validated the gene expression models using the CommonMind Consortium database. Our results showed that (1) PrediXcan outperforms eGenScore regardless of the training database used; and (2) when using PrediXcan, the performance of the eQTL models in frontal cortex is higher when trained with GTEx than with BrainEAC.
KW - Expression quantitative trait loci
KW - Gene expression
KW - Genome wide association study
KW - Polygenic score
KW - Transcriptome
UR - http://www.scopus.com/inward/record.url?scp=85116105079&partnerID=8YFLogxK
U2 - 10.3390/genes12101531
DO - 10.3390/genes12101531
M3 - Article
C2 - 34680927
AN - SCOPUS:85116105079
SN - 0920-8569
VL - 12
JO - Genes
JF - Genes
IS - 10
M1 - 1531
ER -