Makine Öğrenme Yaklaşımlarının Biyoinformatikte İlaç Geliştirme Probleminde Kullanılması

Semerci, Tuğçe

dc.contributor.advisor	Aladağ, Çağdaş Hakan
dc.contributor.author	Semerci, Tuğçe
dc.date.accessioned	2023-12-12T11:51:08Z
dc.date.issued	2023
dc.date.submitted	2023-06-15
dc.identifier.citation	APA	tr_TR
dc.identifier.uri	https://hdl.handle.net/11655/34291
dc.description.abstract	Humans are at the center of the drug research and development process. It is aimed to help the patient overcome his illness and improve his quality of life. In the drug development process, innovative drugs are aimed to be effective, reliable and treatments that will be offered to patients as soon as possible. However, the discovery of a drug and putting it into the service of medicine requires time consuming and high cost. In recent years, thanks to the development of information technologies and bioinformatics-based applications, progress has been made in moving this process to the clinical stage with less cost and quickly. In this thesis, it is aimed to detect molecules that can be drug candidates for the treatment of Type-2 diabetes by using DPP-4 inhibitors and with the help of machine learning approaches. The data obtained from the ChEMBL database were analyzed with 10 machine learning algorithms and artificial neural network model. In comparison of the performances of the models, the Root Mean Square Error (RMSE) criteria were evaluated. As a result of the application, it has been seen that the machine learning approaches that produce the best predictions are Random Forest and a single layer feedforward neural network. It has been observed that these two methods give predictive results close to each other. In the evaluation of the performances of the models, the Random Forest model was chosen as the optimum model because it showed higher performance than the root mean square error value, which is the most common criterion in the literature. According to the results of this study, it has been seen that using the Random Forest approach produces good results in detecting molecules that can be drug candidates for the treatment of Type-2 diabetes.	tr_TR
dc.language.iso	tur	tr_TR
dc.publisher	Fen Bilimleri Enstitüsü	tr_TR
dc.rights	info:eu-repo/semantics/openAccess	tr_TR
dc.subject	Makine öğrenmesi	tr_TR
dc.subject	QSAR	tr_TR
dc.subject	İlaç keşfi	tr_TR
dc.subject	Yapay sinir ağları	tr_TR
dc.subject	Dipeptidil peptidaz-4 inhibitörleri	tr_TR
dc.subject.lcsh	İstatistikler	tr_TR
dc.title	Makine Öğrenme Yaklaşımlarının Biyoinformatikte İlaç Geliştirme Probleminde Kullanılması	tr_TR
dc.type	info:eu-repo/semantics/masterThesis	tr_TR
dc.description.ozet	İlaç araştırma ve geliştirme sürecinin odak noktasında insan vardır. Hastanın, hastalığını yenmesine yardım etmek ve yaşam kalitesini iyileştirmek amaçlanır. İlaç geliştirme sürecinde yenilikçi ilaçların etkin, güvenilir ve mümkün olan en kısa sürede hastaların kullanımına sunulacak tedaviler olması amaçlanır. Ancak bir ilacın keşfedilerek tıbbın hizmetine sunulması zaman alıcı ve yüksek maliyet gerektirir. Son yıllarda ise bilişim teknolojilerinin gelişmesi ve biyoinformatik tabanlı uygulamalar sayesinde bu sürecin daha az maliyetle ve hızlı bir şekilde klinik aşamaya geçilmesinde gelişme sağlanmıştır. Bu tez çalışmada Tip-2 diyabet tedavisi için, DPP-4 inhibitörleri kullanılarak ve makine öğrenme yaklaşımları yardımıyla ilaç adayı olabilecek moleküllerin tespit edilebilmesi amaçlanmıştır. ChEMBL veri tabanından elde edilen veriler 10 adet makine öğrenimi algoritmalarıyla ve yapay sinir ağı modeliyle analiz edilmiştir. Modellerin performanslarının karşılaştırılmasında Hata Kareler Ortalaması Karekök (HKOK) ölçütleri ile değerlendirilmiştir. Uygulama sonucunda, en iyi öngörüleri üreten makine öğrenme yaklaşımlarının Rastgele Orman ve tek tabakalı ileri beslemeli sinir ağı olduğu görülmüştür. Bu iki yöntemin birbirlerine yakın öngörü sonuçları verdiği gözlemlenmiştir. Modellerin performanslarının değerlendirilmesinde, literatürde en yaygın ölçüt olan kök ortalama kare hatası değerine göre, Rastgele Orman modeli daha yüksek performans gösterdiği için optimum model olarak seçilmiştir. Yapılan bu çalışma sonuçlarına göre, Tip-2 diyabet tedavisi için ilaç adayı olabilecek moleküllerin tespit edilmesinde Rastgele Orman yaklaşımı kullanmanın iyi sonuçlar ürettiği görülmüştür.	tr_TR
dc.contributor.department	İstatistik	tr_TR
dc.embargo.terms	Acik erisim	tr_TR
dc.embargo.lift	2023-12-12T11:51:08Z
dc.funding	Yok	tr_TR

Bu öğenin dosyaları:

Ad:: TUĞÇE SEMERCİ.pdf
Boyut:: 1.592Mb
Biçim:: PDF
Açıklama:: Yüksek Lisans Tezi

Göster/Aç

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

İstatistik Bölümü Tez Koleksiyonu [130]

Basit öğe kaydını göster