Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/500
DC FieldValueLanguage
dc.contributor.authorTomović, Andrijaen_US
dc.contributor.authorJaničić, Predragen_US
dc.contributor.authorKeselj, Vladoen_US
dc.date.accessioned2022-08-13T10:14:41Z-
dc.date.available2022-08-13T10:14:41Z-
dc.date.issued2006-
dc.identifier.issn0169-2607en
dc.identifier.urihttps://research.matf.bg.ac.rs/handle/123456789/500-
dc.description.abstractIn this paper we address the problem of automated classification of isolates, i.e., the problem of determining the family of genomes to which a given genome belongs. Additionally, we address the problem of automated unsupervised hierarchical clustering of isolates according only to their statistical substring properties. For both of these problems we present novel algorithms based on nucleotide n-grams, with no required preprocessing steps such as sequence alignment. Results obtained experimentally are very positive and suggest that the proposed techniques can be successfully used in a variety of related problems. The reported experiments demonstrate better performance than some of the state-of-the-art methods. We report on a new distance measure between n-gram profiles, which shows superior performance compared to many other measures, including commonly used Euclidean distance.en
dc.language.isoenen
dc.relation.ispartofComputer methods and programs in biomedicineen_US
dc.subjectClassificationen
dc.subjectGenome sequenceen
dc.subjectHierarchical clusteringen
dc.subjectn-Gramen
dc.subject.meshAlgorithmsen
dc.subject.meshGenome, Humanen
dc.subject.meshMultigene Familyen
dc.subject.meshSequence Analysis, DNAen
dc.titlen-gram-based classification and unsupervised hierarchical clustering of genome sequencesen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.cmpb.2005.11.007-
dc.identifier.pmid16423423-
dc.identifier.scopus2-s2.0-31344463462-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/31344463462-
dc.contributor.affiliationInformatics and Computer Scienceen_US
dc.relation.firstpage137en_US
dc.relation.lastpage153en_US
dc.relation.volume81en_US
dc.relation.issue2en_US
item.fulltextNo Fulltext-
item.languageiso639-1en-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairetypeArticle-
crisitem.author.deptInformatics and Computer Science-
crisitem.author.orcid0000-0001-8922-4948-
Appears in Collections:Research outputs
Show simple item record

SCOPUSTM   
Citations

82
checked on Dec 18, 2024

Page view(s)

11
checked on Dec 25, 2024

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.