Please use this identifier to cite or link to this item:
https://research.matf.bg.ac.rs/handle/123456789/1652
Title: | The Mysterious Letter J | Authors: | Zečević, Andjelka Vujičić Stanković, Staša |
Affiliations: | Informatics and Computer Science | Issue Date: | 2013 | Rank: | M33 | Publisher: | Šumen : Inkoma | Related Publication(s): | Proceedings of the Workshop on Adaptation of Language Resources and Tools for Closely Related Languages and Language Variants | Abstract: | Ekavian and Ijekavian are two different variants of the contemporary standard Serbian language. The difference between them is related to the reflex of the old Slavic vowel jat and it influences both the speaking and writing language norms. The sensibility of existing language identification tools for both variants is of great importance for building representative corpora and development of relevant linguistics resources and tools underlying an automatic text processing. In this paper we present the results obtained after testing the three popular tools for language identification on corpora containing documents from each of the two variants. As it will be reported, the identification of Ijekavian variant is a much more difficult task since the observed tools are not adopted to it at all. |
URI: | https://research.matf.bg.ac.rs/handle/123456789/1652 | ISSN: | 978-954-452-026-7 |
Appears in Collections: | Research outputs |
Show full item record
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.