Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/2378
Title: Prospective automated hierarchical classification of digitized documents
Authors: Kovačević, Jovana 
Graovac, Jelena 
Affiliations: Informatics and Computer Science 
Informatics and Computer Science 
Keywords: hierarchical text classification;structured support vector machine method;n-grams;Ebart
Issue Date: 2016
Rank: M53
Publisher: Beograd : Matematički fakultet
Journal: Pregled Nacionalnog centra za digitalizaciju
Abstract: 
The paper presents a proposal of a method for hierarchical classification of digitized documents of NCD digital library. The classification model implements Structured Support Vector Machine method (SSVM) which has shown excellent performance on Ebart corpus of documents in Serbian language. We describe the developed model and its results on Ebart dataset, suggest two types of hierarchies of classes of the NCD library regarding its content and define a protocol for the application of the method to digitized documents.
URI: https://research.matf.bg.ac.rs/handle/123456789/2378
Appears in Collections:Research outputs

Show full item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.