Please use this identifier to cite or link to this item:
https://research.matf.bg.ac.rs/handle/123456789/2378
Title: | Prospective automated hierarchical classification of digitized documents | Authors: | Kovačević, Jovana Graovac, Jelena |
Affiliations: | Informatics and Computer Science Informatics and Computer Science |
Keywords: | hierarchical text classification;structured support vector machine method;n-grams;Ebart | Issue Date: | 2016 | Rank: | M53 | Publisher: | Beograd : Matematički fakultet | Journal: | Pregled Nacionalnog centra za digitalizaciju | Abstract: | The paper presents a proposal of a method for hierarchical classification of digitized documents of NCD digital library. The classification model implements Structured Support Vector Machine method (SSVM) which has shown excellent performance on Ebart corpus of documents in Serbian language. We describe the developed model and its results on Ebart dataset, suggest two types of hierarchies of classes of the NCD library regarding its content and define a protocol for the application of the method to digitized documents. |
URI: | https://research.matf.bg.ac.rs/handle/123456789/2378 |
Appears in Collections: | Research outputs |
Show full item record
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.