Genome-wide analysis of genetic associations for prediction of polygenic hypercholesterolemia with Bayesian networks
Journal of computational and engineering mathematics, Tome 2 (2015) no. 4, pp. 11-26

Voir la notice de l'article provenant de la source Math-Net.Ru

The genome-wide analysis of genetic associations with lipid metabolism indicators was carried out using the technology of Bayesian networks (BN). It was performed to diagnose polygenic hypercholesterolemia on the basis of genetic data of the Russian population of patients. The data of 1,200 patients was analyzed. 196725 SNPs as well as clinical data, lipid profile indicators — different types of cholesterol — were obtained for each of them. The genome-wide association analysis (GWAS) and the statistical method of Pearson's chi-squared test were used for the initial selection of the most significant parameters. Two of the patient states related to a lipid metabolism were studied. These states are the level of LDL-C (low density lipoprotein) and the level of HDL-C (high density lipoprotein). The Bayesian networks having the simplest topology — naive — were used to predict the level of lipoprotein. The construction of ROC-curves and the calculation of the area under these curves (AUC) were used to assess a quality (reliability) of the prediction. AUC value increased from 0,5 for the initial BN to 0,9 after selecting of significant parameters using the GWAS method or the Pearson one. A further increase in AUC to 0,99 and decrease in the number of prognostic parameters to 150 was performed using Bayesian network optimization with respect to the number of parameters-nodes. Here the optimized function was value of AUC. The ambiguity of obtaining prognostic parameters at various ways of initial reducing the number of network nodes using the methods of GWAS and Pirson is shown. Low values of AUC were obtained for an independent control group of patients, despite very good results on the quality of the predictions, which were obtained on the training set. Further application of the proposed methodology is possible after the substantial reduction of the number of SNPs on the base of the analysis of the respective molecular mechanisms.
Keywords: GWAS; LDL-C; HDL-C; SNP; Bayesian networks.
@article{JCEM_2015_2_4_a1,
     author = {A. V. Sulimov and A. N. Meshkov and I. A. Savkin and E. V. Katkova and D. K. Kutov and Z. B. Hasanova and N. V. Konovalova and V. V. Kukharchuk and V. B. Sulimov},
     title = {Genome-wide analysis of genetic associations for prediction of polygenic hypercholesterolemia with {Bayesian} networks},
     journal = {Journal of computational and engineering mathematics},
     pages = {11--26},
     publisher = {mathdoc},
     volume = {2},
     number = {4},
     year = {2015},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/JCEM_2015_2_4_a1/}
}
TY  - JOUR
AU  - A. V. Sulimov
AU  - A. N. Meshkov
AU  - I. A. Savkin
AU  - E. V. Katkova
AU  - D. K. Kutov
AU  - Z. B. Hasanova
AU  - N. V. Konovalova
AU  - V. V. Kukharchuk
AU  - V. B. Sulimov
TI  - Genome-wide analysis of genetic associations for prediction of polygenic hypercholesterolemia with Bayesian networks
JO  - Journal of computational and engineering mathematics
PY  - 2015
SP  - 11
EP  - 26
VL  - 2
IS  - 4
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/JCEM_2015_2_4_a1/
LA  - en
ID  - JCEM_2015_2_4_a1
ER  - 
%0 Journal Article
%A A. V. Sulimov
%A A. N. Meshkov
%A I. A. Savkin
%A E. V. Katkova
%A D. K. Kutov
%A Z. B. Hasanova
%A N. V. Konovalova
%A V. V. Kukharchuk
%A V. B. Sulimov
%T Genome-wide analysis of genetic associations for prediction of polygenic hypercholesterolemia with Bayesian networks
%J Journal of computational and engineering mathematics
%D 2015
%P 11-26
%V 2
%N 4
%I mathdoc
%U http://geodesic.mathdoc.fr/item/JCEM_2015_2_4_a1/
%G en
%F JCEM_2015_2_4_a1
A. V. Sulimov; A. N. Meshkov; I. A. Savkin; E. V. Katkova; D. K. Kutov; Z. B. Hasanova; N. V. Konovalova; V. V. Kukharchuk; V. B. Sulimov. Genome-wide analysis of genetic associations for prediction of polygenic hypercholesterolemia with Bayesian networks. Journal of computational and engineering mathematics, Tome 2 (2015) no. 4, pp. 11-26. http://geodesic.mathdoc.fr/item/JCEM_2015_2_4_a1/