Lingo Algoritmasının Kümelerle İlişkili Dokümanların Belirlenmesi ve Küme Etiketlerinin Çıkarılması Aşamalarının İyileştirilmesi
Abstract
Search Results Clustering (SRC) algorithms are developed so that users can reach to the results that they search for easier. A good SRC algorithm is expected to correclty cluster the search results, and also to be able to generate representative, understandable and meaningful cluster labels for the produced clusters.The Lingo algorithm is a popular SRC algorithm that notice both two criterions. It is able to generate successful cluster labels as expected; however, it has some shortcomings about determining the cluster contents. As a consequence of its cluster content assignment strategy, semantically relevant documents that do not contain the terms of the cluster labels could not be assigned to the related clusters. Moreover, the method that is used to select final cluster labels results in clusters containing small number of relevant results.