Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/2339
DC FieldValueLanguage
dc.contributor.authorKovačević, Jovanaen_US
dc.contributor.authorGraovac, Jelenaen_US
dc.date.accessioned2025-08-20T17:05:34Z-
dc.date.available2025-08-20T17:05:34Z-
dc.date.issued2016-
dc.identifier.urihttps://research.matf.bg.ac.rs/handle/123456789/2339-
dc.description.abstractThe paper presents classification results of a hierarchically organized document corpus in Serbian, by using Support Vector Machine method (SVM). Two techniques have been applied derived from the SVM with structural output: multiclass flat and hierarchical classification. Common representation model of a document and a class or a hierarchy of classes the document belongs to, specific for this form of the SVM method, is based on different length byte n-grams. Four tf-idf statistics have been used that define significance of an n-gram for a specific document. The techniques and statistics described have been tested on a hierarchically structured subset of the Ebart corpus of newspaper texts. The results obtained for both types of classifiers are similar for the corpus as a whole, while hierarchical classifier performs better for most specific classes with small number of texts.en_US
dc.publisherBeograd : Filološki fakulteten_US
dc.publisherBeograd : Univerzitetska biblioteka "Svetozar Marković"en_US
dc.publisherBeograd : Zajednica biblioteka univerziteta u Srbijien_US
dc.relation.ispartofInfotheca - Journal for Digital Humanitiesen_US
dc.subjecthierarchical text classificationen_US
dc.subjectSupport Vector Machine Methoden_US
dc.subjectEbart corpusen_US
dc.titleApplication of a Structural Support Vector Machine Method to N-gram Based Text Classification in Serbianen_US
dc.title.alternativeН-грамски заснована класификација текста на српском језику применом методе структуралних подржавајућих вектораen_US
dc.typeArticleen_US
dc.identifier.doi10.18485/infotheca.2016.16.1_2.1-
dc.contributor.affiliationInformatics and Computer Scienceen_US
dc.contributor.affiliationInformatics and Computer Scienceen_US
dc.relation.issn1450-9687en_US
dc.description.rankM53en_US
dc.relation.firstpage5en_US
dc.relation.lastpage23en_US
dc.relation.volume16en_US
dc.relation.issue1-2en_US
item.grantfulltextnone-
item.fulltextNo Fulltext-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
item.openairetypeArticle-
crisitem.author.deptInformatics and Computer Science-
crisitem.author.deptInformatics and Computer Science-
crisitem.author.orcid0000-0002-0242-2472-
crisitem.author.orcid0000-0002-9323-4695-
Appears in Collections:Research outputs
Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.