Basit öğe kaydını göster

dc.contributor.authorCan, Burcu
dc.date.accessioned2019-12-13T07:41:20Z
dc.date.available2019-12-13T07:41:20Z
dc.date.issued2017
dc.identifier.issn1300-0632
dc.identifier.urihttps://doi.org/10.3906/elk-1605-216
dc.identifier.urihttp://hdl.handle.net/11655/18811
dc.description.abstractOne morpheme may have several surface forms that correspond to allomorphs. In English, ed and d are surface forms of the past tense morpheme, and s, es, and ies are surface forms of the plural or present tense morpheme. Turkish has a large number of allomorphs due to its morphophonemic processes. One morpheme can have tens of different surface forms in Turkish. This leads to a sparsity problem in natural language processing tasks in Turkish. Detection of allomorphs has not been studied much because of its difficulty. For example, tu and di are Turkish allomorphs (i.e. past tense morpheme), but all of their letters are different. This paper presents an unsupervised model to extract the allomorphs in Turkish. We are able to obtain an F-measure of 73.71% in the detection of allomorphs, and our model outperforms previous unsupervised models on morpheme clustering.
dc.language.isoen
dc.publisherTubitak Scientific & Technical Research Council Turkey
dc.relation.isversionof10.3906/elk-1605-216
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectComputer Science
dc.subjectEngineering
dc.titleUnsupervised Learning Of Allomorphs In Turkish
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/publishedVersion
dc.relation.journalTurkish Journal Of Electrical Engineering And Computer Sciences
dc.contributor.departmentElektrik Elektronik Mühendisliği
dc.identifier.volume25
dc.identifier.issue4
dc.identifier.startpage3253
dc.identifier.endpage3260
dc.description.indexWoS
dc.description.indexScopus


Bu öğenin dosyaları:

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster