Türkçe Haber Benzerliklerinin Belirlenmesinde Varlık İsimlerinin Hikaye Bağlantı Algılama Görevinin Başarımına Etkisi
Abstract
This thesis aims to test the performance of the Story Link Detection (SLD) task aspart of the Topic Detection and Tracking (TDT) program using different similarityfunctions and test their combinations performance on named entities, and find theone that provides the optimum precision recall values. To do this, we used theVector Space Model (VSM) as the main method which their performance is provenin TDT studies, and evaluate the impact of Named Entities on the VSM performance.In order to test the performance of methods, we used the BilCOL-2005 corpus aftertagged named entites which were used to respond to who, where, when and etcquestions with eight (who, where, when, organization, Money, percentage, date,unknown) different labels.