dc.contributor.author | Cosgun, Erdal | |
dc.contributor.author | Limdi, Nita A. | |
dc.contributor.author | Duarte, Christine W. | |
dc.date.accessioned | 2019-12-10T11:25:04Z | |
dc.date.available | 2019-12-10T11:25:04Z | |
dc.date.issued | 2011 | |
dc.identifier.issn | 1367-4803 | |
dc.identifier.uri | https://doi.org/10.1093/bioinformatics/btr159 | |
dc.identifier.uri | http://hdl.handle.net/11655/15690 | |
dc.description.abstract | Motivation: With complex traits and diseases having potential genetic contributions of thousands of genetic factors, and with current genotyping arrays consisting of millions of single nucleotide polymorphisms (SNPs), powerful high-dimensional statistical techniques are needed to comprehensively model the genetic variance. Machine learning techniques have many advantages including lack of parametric assumptions, and high power and flexibility. Results: We have applied three machine learning approaches: Random Forest Regression (RFR), Boosted Regression Tree (BRT) and Support Vector Regression (SVR) to the prediction of warfarin maintenance dose in a cohort of African Americans. We have developed a multi-step approach that selects SNPs, builds prediction models with different subsets of selected SNPs along with known associated genetic and environmental variables and tests the discovered models in a cross-validation framework. Preliminary results indicate that our modeling approach gives much higher accuracy than previous models for warfarin dose prediction. A model size of 200 SNPs (in addition to the known genetic and environmental variables) gives the best accuracy. The R-2 between the predicted and actual square root of warfarin dose in this model was on average 66.4% for RFR, 57.8% for SVR and 56.9% for BRT. Thus RFR had the best accuracy, but all three techniques achieved better performance than the current published R-2 of 43% in a sample of mixed ethnicity, and 27% in an African American sample. In summary, machine learning approaches for high-dimensional pharmacogenetic prediction, and for prediction of clinical continuous traits of interest, hold great promise and warrant further research. | |
dc.language.iso | en | |
dc.publisher | Oxford Univ Press | |
dc.relation.isversionof | 10.1093/bioinformatics/btr159 | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.subject | Biochemistry & Molecular Biology | |
dc.subject | Biotechnology & Applied Microbiology | |
dc.subject | Computer Science | |
dc.subject | Mathematical & Computational Biology | |
dc.subject | Mathematics | |
dc.title | High-Dimensional Pharmacogenetic Prediction Of A Continuous Trait Using Machine Learning Techniques With Application To Warfarin Dose Prediction In African Americans | |
dc.type | info:eu-repo/semantics/article | |
dc.type | info:eu-repo/semantics/publishedVersion | |
dc.relation.journal | Bioinformatics | |
dc.contributor.department | Nöroloji | |
dc.identifier.volume | 27 | |
dc.identifier.issue | 10 | |
dc.identifier.startpage | 1384 | |
dc.identifier.endpage | 1389 | |
dc.description.index | WoS | |