Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/3218
DC FieldValueLanguage
dc.contributor.authorŠošić, Milenaen_US
dc.contributor.authorGraovac, Jelenaen_US
dc.contributor.authorStanković, Rankaen_US
dc.date.accessioned2026-03-18T17:57:34Z-
dc.date.available2026-03-18T17:57:34Z-
dc.date.issued2026-03-01-
dc.identifier.issn1574020X-
dc.identifier.urihttps://research.matf.bg.ac.rs/handle/123456789/3218-
dc.description.abstractThis article introduces a methodology for developing the first emotional affect lexicon for the Serbian language. The proposed methodology involves leveraging a Large Language Model (LLM), specifically the GPT-3-based gpt-3.5-turbo and GPT-4-based gpt-4.1 models, in conjunction with the Serbian WordNet language resource to align the English lexicon with Serbian-specific morphological and linguistic characteristics. The effectiveness of the Serbian emotion lexicon (EmoLex.SR), comprising 13,584 affective words, has been validated through emotion detection experiments using emotion-annotated corpora. The experiments demonstrated outstanding performance compared to the NRC lexicon automatically translated into Serbian, achieving a macro F1 score of 74.4% for sentences written in Serbian. In particular, the lexicon outperforms its automatically translated counterpart in detecting emotional categories across three distinct datasets, with an average improvement by 14.7% in terms of macro F1 score. The development of the EmoLex.SR lexicon and the accompanying annotated parallel corpora, referred to as LLM-Emo.SR, extends the emotion detection capabilities for Serbian language processing. This enables a more accurate interpretation of emotions in Serbian text and enhances Natural Language Processing applications for the Serbian language. Although the methodology for creating the lexicon is demonstrated for Serbian, it can also be successfully applied to other languages. The lexicon is made publicly available to the scientific community for use and further refinement.en_US
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.relation.ispartofLanguage Resources and Evaluationen_US
dc.subjectAffecten_US
dc.subjectEmotionsen_US
dc.subjectLexiconsen_US
dc.subjectSerbianen_US
dc.subjectWordNeten_US
dc.titleBuilding an emotion lexicon for Serbian using curated language resourcesen_US
dc.typeArticleen_US
dc.identifier.doi10.1007/s10579-025-09894-5-
dc.identifier.scopus2-s2.0-105027264089-
dc.identifier.isi001655357900002-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/105027264089-
dc.contributor.affiliationInformatics and Computer Scienceen_US
dc.relation.issn1574-020Xen_US
dc.description.rankM22en_US
dc.relation.firstpageArticle no. 9en_US
dc.relation.volume60en_US
dc.relation.issue1en_US
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.languageiso639-1en-
item.openairetypeArticle-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.fulltextNo Fulltext-
crisitem.author.deptInformatics and Computer Science-
crisitem.author.orcid0000-0002-9323-4695-
Appears in Collections:Research outputs
Show simple item record

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.