Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/1489
DC FieldValueLanguage
dc.contributor.authorŠošić, Milenaen_US
dc.contributor.authorGraovac, Jelenaen_US
dc.date.accessioned2025-02-13T14:09:43Z-
dc.date.available2025-02-13T14:09:43Z-
dc.date.issued2022-09-01-
dc.identifier.issn18200214-
dc.identifier.urihttps://research.matf.bg.ac.rs/handle/123456789/1489-
dc.description.abstractWith the steady increase in the number of Internet users, email remains the most popular and extensively used communication means. Therefore, email management is an important and growing problem for individuals and organiza-tions. In this paper, we deal with the classification of emails into two main cate-gories, Business and Personal. To find the best performing solution for this problem, a comprehensive set of experiments has been conducted with the deep learning al-gorithms: Bidirectional Long-Short Term Memory (BiLSTM) and Attention-based BiLSTM (BiLSTM+Att), together with traditional Machine Learning (ML) algo-rithms: Stochastic Gradient Descent (SGD) optimization applied on Support Vector Machine (SVM) and Extremely Randomized Trees (ERT) ensemble method. The variations of individual email and conversational email thread arc representations have been explored to reach the best classification generalization on the selected task. A special contribution of this paper is the extraction of a large number of ad-ditional lexical, conversational, expressional, emotional, and moral features, which proved very useful for differentiation between personal and official written con-versations. The experiments were performed on the publicly available Enron email benchmark corpora on which we obtained the State-Of-the-Art (SOA) results. As part of the submission, we have made our work publicly available to the scientific community for research purposes.en_US
dc.language.isoenen_US
dc.publisherNovi Sad : ComSIS Consortiumen_US
dc.relation.ispartofComputer Science and Information Systemsen_US
dc.subjectBERT embeddingsen_US
dc.subjectBiLSTMen_US
dc.subjectbusinessen_US
dc.subjectdeep learningen_US
dc.subjectEmail classificationen_US
dc.subjectlexiconsen_US
dc.subjectNLPen_US
dc.subjectpersonalen_US
dc.subjectSGDen_US
dc.subjectTf-Idfen_US
dc.titleEffective Methods for Email Classification: Is it a Business or Personal Email?en_US
dc.typeArticleen_US
dc.identifier.doi10.2298/CSIS220212034S-
dc.identifier.scopus2-s2.0-85140654860-
dc.identifier.isi000877664300008-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/85140654860-
dc.contributor.affiliationInformatics and Computer Scienceen_US
dc.description.rankM23en_US
dc.relation.firstpage1155en_US
dc.relation.lastpage1175en_US
dc.relation.volume19en_US
dc.relation.issue3en_US
item.openairetypeArticle-
item.fulltextNo Fulltext-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.languageiso639-1en-
crisitem.author.deptInformatics and Computer Science-
crisitem.author.orcid0000-0002-9323-4695-
Appears in Collections:Research outputs
Show simple item record

SCOPUSTM   
Citations

3
checked on Mar 4, 2025

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.