Please use this identifier to cite or link to this item:
https://research.matf.bg.ac.rs/handle/123456789/2626| Title: | Language-Independent Sentiment Polarity Detection in Movie Reviews: A Case Study of English and Spanish | Authors: | Graovac, Jelena Pavlović-Lažetić, Gordana |
Affiliations: | Informatics and Computer Science | Issue Date: | 2014 | Rank: | M33 | Publisher: | Ohrid : ICT Inovations | Related Publication(s): | Web proceedings of 6th ICT Inovations International Conference | Conference: | International Conference ICT Inovations (6 ; 2014 ; Ohrid) | Abstract: | We present a novel language-independent technique for determining polarity, positive or negative, of opinions expressed by different individuals. The technique is based on byte-level n-gram frequency statistics method for document representation, and a variant of k nearest neighbors (kNN) (for k = 1) machine learning algorithm for categorization process. The main advantages of the technique are its simplicity and full language and topic independence. For driving experiments we used corpora of movie reviews: Cornell polarity dataset in English and MuchoCine in Spanish. Experimental results (85.6% accuracy for English and 82.49% for Spanish corpora) confirm that the presented technique is comparable with the best ranked previously published techniques, when applied to movie reviews datasets. Still, it use no additional linguistic information nor external resources. |
URI: | https://research.matf.bg.ac.rs/handle/123456789/2626 |
| Appears in Collections: | Research outputs |
Show full item record
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.