Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/2626
Title: Language-Independent Sentiment Polarity Detection in Movie Reviews: A Case Study of English and Spanish
Authors: Graovac, Jelena 
Pavlović-Lažetić, Gordana
Affiliations: Informatics and Computer Science 
Issue Date: 2014
Rank: M33
Publisher: Ohrid : ICT Inovations
Related Publication(s): Web proceedings of 6th ICT Inovations International Conference
Conference: International Conference ICT Inovations (6 ; 2014 ; Ohrid)
Abstract: 
We present a novel language-independent technique for determining polarity, positive or negative, of opinions expressed by different individuals. The technique is based on byte-level n-gram frequency statistics method for document representation, and a variant of k nearest neighbors (kNN) (for k = 1) machine learning algorithm for categorization process. The main advantages of the technique are its simplicity and full language and topic independence. For driving experiments we used corpora of movie reviews: Cornell polarity dataset in English and MuchoCine in Spanish. Experimental results (85.6% accuracy for English and 82.49% for Spanish corpora) confirm that the presented technique is comparable with the best ranked previously published techniques, when applied to movie reviews datasets. Still, it use no additional linguistic information nor external resources.
URI: https://research.matf.bg.ac.rs/handle/123456789/2626
Appears in Collections:Research outputs

Show full item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.