Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/1652
Title: The Mysterious Letter J
Authors: Zečević, Andjelka
Vujičić Stanković, Staša 
Affiliations: Informatics and Computer Science 
Issue Date: 2013
Rank: M33
Publisher: Šumen : Inkoma
Related Publication(s): Proceedings of the Workshop on Adaptation of Language Resources and Tools for Closely Related Languages and Language Variants
Abstract: 
Ekavian and Ijekavian are two different variants of the contemporary standard Serbian language. The difference between them is related to the reflex of the old Slavic vowel jat and it influences both the speaking and writing language norms. The sensibility of existing language identification tools for both variants is of great importance for building representative corpora and development of relevant linguistics resources and tools underlying an automatic text processing. In this paper we
present the results obtained after testing the three popular tools for language identification on corpora containing documents from each of the two variants. As it will be reported, the identification of Ijekavian variant is a much more difficult task since the observed tools are not adopted to it at all.
URI: https://research.matf.bg.ac.rs/handle/123456789/1652
ISSN: 978-954-452-026-7
Appears in Collections:Research outputs

Show full item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.