Selectıve Personalızatıon Usıng Topıcal User Profıle To Improve Search Results

Karimi Mansoub, Samira

dc.contributor.advisor	Çiçekli, İlyas
dc.contributor.author	Karimi Mansoub, Samira
dc.date.accessioned	2021-01-04T11:48:06Z
dc.date.issued	2020-10-02
dc.date.submitted	2020-10-02
dc.identifier.uri	http://hdl.handle.net/11655/23220
dc.description.abstract	Personalization is a technique used in Web search engines to improve the effectiveness of in- formation retrieval systems. In the field of personalized web search has recently been doing a lot of research and applications. In this research, we evaluate the effect of personalization for queries with different characteristics. With this analysis, the question of whether per- sonalization should be applied for all queries in the same way or not is investigated. While personalizing some queries yields significant improvements on user experience by providing a ranking inline with the user preferences, it fails to improve or even degrades the effective- ness for less ambiguous queries. A potential for personalization metric can improve search engines by selectively applying a personalization. Current methods for estimating the potential for personalization such as click entropy and topic entropy are based on the clicked document for query or query history. They have limitations like unavailability of the prior clicked data for new and unseen queries or queries without history. In this thesis, the topic entropy measure is improved by integrating the user distribution to the metric, robust to the sparsity problem. This metric estimates the potential ifor personalization using a topical user profile created on user documents. In this way, we can overcome the cold start problem to estimate the potential for new queries and increase the accuracy of estimates for queries with history. Although in this thesis the main focus is on topic-based user profiles, since there is not more research on keyphrase-based user profiles in the process of personalization, we do a comparison research between keyphrase-based and topic-based profiles. We examine how personalization can be integrated into the state of the art keyphrase extraction models by considering different models of supervised and unsupervised methods. We evaluate topic- based and keyphrase-based user profiles using a re-ranking algorithm to complete the process of personalization using different datasets. In personalization using keyphrase-based profiles, personalized models based on supervised keyphrase extraction approaches obtained more accuracy by 7% than unsupervised approaches however it does not improve compared to topic-based models. In topic-based models, we use a combination of personalization in the level of user-specified and group profiling as part of the ranking process. In the previous ranking methods, more improvement in ranking is for the queries which match the user’s history. To take advantage of ranking for all queries, we present a group personalized topical model(GPTM) that uses groups obtained from clustered similar users on topical profiles. Experiments reveal that the proposed potential prediction method correlates with human query ambiguity judgments and group profiles based ranking method improve the Mean Reciprocal Rank by 8%.	tr_TR
dc.language.iso	en
dc.publisher	Fen Bilimleri Enstitüsü	tr_TR
dc.rights	info:eu-repo/semantics/openAccess	tr_TR
dc.subject	User Search Behavior	tr_TR
dc.subject	Topic-based User Profile	tr_TR
dc.subject	Keyphrase-based user profile	tr_TR
dc.subject	Personalized web search	tr_TR
dc.subject	Latent dirichlet allocation	tr_TR
dc.subject.lcsh	Bilgisayar mühendisliği	tr_TR
dc.title	Selectıve Personalızatıon Usıng Topıcal User Profıle To Improve Search Results	tr_TR
dc.title.alternative	Arama Sonuçlarını iyileştirmek için Konu Kullanıcı Profilini Kullanarak Seçici Kişiselleştirme
dc.type	info:eu-repo/semantics/doctoralThesis	tr_TR
dc.description.ozet	Kişiselleştirme tekniği, bilgi erişim sistemininin etkinliğini iyileştirmek için web arama mo- torlarında kullanılan bir tekniktir. Kişiselleştirilmiş web araması alanında son günlerde sıkça araştırma ve uygulama yapılmaktadır. Bu araştırmada öncelikle farklı karakteristik- lere sahip sorgularda kişiselleştirmenin etkisini ölçüyoruz. Bu analiz ile kişiselleştirmenin tüm sorgulara aynı biçimde uygulanıp uygulanmayacağı araştırılmaktadır. Bazı sorguların kişiselleştirilmesi, kullanıcı tercihleri ile aynı öncelikte bir derecelendirme yapıldığı takdirde kullanıcı deneyiminde önemli iyileştirmeler sağlarken, belirsiz sorguların etkinliğini iyileştirmez ve hatta bazen azalmasına neden olur. Kişiselleştirme metriklerinin potansiyeli, kişiselleştirmeyi seçilimli bir şekilde uygulayarak arama motorlarını iyileştirebilir. Kişiselleştirme potansiyelini tahmin etmek için kullanılan ”tıklama entropisi” ve ”konu en- tropisi” gibi mevcut yöntemler, sorgu ve sorgu geçmişi için tıklanan dokümanlara dayalıdırlar. Bu yöntemlerin, önceki tıklanmış verilerin kullanışsızlığı gibi yeni ve görünmeyen sorgular veya geçmişi olmayan sorgular için bir takım sınırları mevcuttur. iiiBu tezde, konu entropisi ölçümü, kullanıcı dağılımını metriğe entegre ederek seyreklik prob- lemine dayanıklı olacak şekilde iyileştirildi. Bu metrik kullanıcı dökümanları üzerinde oluşturulan konusal kullanıcı profillerini kullanarak kişiselleştirme potansiyelini tahmin eder. Bu yolla yeni sorguların potansiyelini tahmin etmek ve geçmişe sahip sorguların doğruluğunu artırmak için soğuk başlangıç probleminin üstesinden gelebiliriz. Bu tezde her ne kadar ana odak noktası konu tabanlı kullanıcı profilleri olsa da anahtar kelime tabanlı kullanıcı profilleri hakkında çok araştırma olmadığı için, anahtar kelimeye dayalı profiller ve konu tabanlı profiller arasında bir karşılaştırma yapıyoruz. Denetimli ve denetimsiz yöntemlerin farklı modellerini göz önünde bulundurarak, kişiselleştirmenin ustalık derecesinde anahtar kelime çıkartma modellerine nasıl entegre edilebileceğini inceliy- oruz. Farklı veri kümelerini kullanarak kişiselleştirme sürecini tamamlamak için yeniden derecelendirme algoritmaları kullanılarak anahtar kelime ve konu tabanlı kullanıcı profil- lerini değerlendiriyoruz. Anahtar kelime tabanlı kişiselleştirmede, denetimli anahtar ke- lime çıkartma yaklaşımı %7 oranında iyileştirme sağlarken, konu tabanlı kişiselleştirmede bir iyileştirme sağlamamaktadır. Konu tabanlı modellerde, derecelendirme sürecinin bir bölümü olarak, kullanıcı tarafından belirlenen ve grup profili seviyesinde kişiselleştirme kombinasyonu kullanırız. Önceki dere- celendirme yöntemlerinde iyileştirmeler, kullanıcının geçmişi ile eşleşen sorgular içindir. Derecelendirmenin avantajlarını tüm sorgularda kullanmak için, konusal profiller üzerinde kümelenen benzer kullanıcılardan elde edilmiş grupları kullanan grup kişiselleştirmiş konusal modelini (GPTM) tanıtıyoruz. Deneyler, potansiyel tahmin yönteminin insan sorgularında belirsiz kararlar ile ilişkili olduğunu ve grup tabanlı derecelendirmenin ortalama karşılıklı derecelendirmeyi %8 artırdığını göstermektedir.	tr_TR
dc.contributor.department	Bilgisayar Mühendisliği	tr_TR
dc.embargo.terms	Acik erisim	tr_TR
dc.embargo.lift	2021-01-04T11:48:06Z
dc.funding	Yok	tr_TR
dc.subtype	software	tr_TR

Bu öğenin dosyaları:

Ad:: SELECTIVE PERSONALIZATION USING ...
Boyut:: 1.476Mb
Biçim:: PDF

Göster/Aç

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Bilgisayar Mühendisliği Bölümü Tez Koleksiyonu [253]

Basit öğe kaydını göster