High-Dimensional Pharmacogenetic Prediction Of A Continuous Trait Using Machine Learning Techniques With Application To Warfarin Dose Prediction In African Americans

Cosgun, Erdal; Limdi, Nita A.; Duarte, Christine W.

dc.contributor.author	Cosgun, Erdal
dc.contributor.author	Limdi, Nita A.
dc.contributor.author	Duarte, Christine W.
dc.date.accessioned	2019-12-10T11:25:04Z
dc.date.available	2019-12-10T11:25:04Z
dc.date.issued	2011
dc.identifier.issn	1367-4803
dc.identifier.uri	https://doi.org/10.1093/bioinformatics/btr159
dc.identifier.uri	http://hdl.handle.net/11655/15690
dc.description.abstract	Motivation: With complex traits and diseases having potential genetic contributions of thousands of genetic factors, and with current genotyping arrays consisting of millions of single nucleotide polymorphisms (SNPs), powerful high-dimensional statistical techniques are needed to comprehensively model the genetic variance. Machine learning techniques have many advantages including lack of parametric assumptions, and high power and flexibility. Results: We have applied three machine learning approaches: Random Forest Regression (RFR), Boosted Regression Tree (BRT) and Support Vector Regression (SVR) to the prediction of warfarin maintenance dose in a cohort of African Americans. We have developed a multi-step approach that selects SNPs, builds prediction models with different subsets of selected SNPs along with known associated genetic and environmental variables and tests the discovered models in a cross-validation framework. Preliminary results indicate that our modeling approach gives much higher accuracy than previous models for warfarin dose prediction. A model size of 200 SNPs (in addition to the known genetic and environmental variables) gives the best accuracy. The R-2 between the predicted and actual square root of warfarin dose in this model was on average 66.4% for RFR, 57.8% for SVR and 56.9% for BRT. Thus RFR had the best accuracy, but all three techniques achieved better performance than the current published R-2 of 43% in a sample of mixed ethnicity, and 27% in an African American sample. In summary, machine learning approaches for high-dimensional pharmacogenetic prediction, and for prediction of clinical continuous traits of interest, hold great promise and warrant further research.
dc.language.iso	en
dc.publisher	Oxford Univ Press
dc.relation.isversionof	10.1093/bioinformatics/btr159
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Biochemistry & Molecular Biology
dc.subject	Biotechnology & Applied Microbiology
dc.subject	Computer Science
dc.subject	Mathematical & Computational Biology
dc.subject	Mathematics
dc.title	High-Dimensional Pharmacogenetic Prediction Of A Continuous Trait Using Machine Learning Techniques With Application To Warfarin Dose Prediction In African Americans
dc.type	info:eu-repo/semantics/article
dc.type	info:eu-repo/semantics/publishedVersion
dc.relation.journal	Bioinformatics
dc.contributor.department	Nöroloji
dc.identifier.volume	27
dc.identifier.issue	10
dc.identifier.startpage	1384
dc.identifier.endpage	1389
dc.description.index	WoS

Bu öğenin dosyaları:

Ad:: licence.txt
Boyut:: 265bytes
Biçim:: Text file

Göster/Aç

Ad:: 3132.pdf
Boyut:: 248.8Kb
Biçim:: PDF

Göster/Aç

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Dahili Tıp Bilimleri Bölümü Makale Koleksiyonu [2912]

Basit öğe kaydını göster