Please use this identifier to cite or link to this item: https://research.matf.bg.ac.rs/handle/123456789/337
DC FieldValueLanguage
dc.contributor.authorJelovic, Ana Men_US
dc.contributor.authorMitić, Nenaden_US
dc.contributor.authorEshafah, Samiraen_US
dc.contributor.authorBeljanski, Milos Ven_US
dc.date.accessioned2022-08-09T12:54:11Z-
dc.date.available2022-08-09T12:54:11Z-
dc.date.issued2018-
dc.identifier.issn10665277-
dc.identifier.urihttps://research.matf.bg.ac.rs/handle/123456789/337-
dc.description.abstractDNA repeats have great importance for biological research and a large number of tools for determining repeats have been developed. Herein we define a method for extracting a statistically significant subset of a determined set of repeats. Our aim was to identify a subset of repeats in the input sequences that are not expected to occur with a number of their appearances in a random sequence of the same length. It is expected that results obtained in such manner would reduce the quantity of processed material and could thereby represent a more important biological signal. With DNA, RNA, and protein sequences serving as input material, we also examined the possibility of statistical filtering of repeats in sequences over an arbitrary alphabet. A new method for selecting statistically significant repeats from a set of determined repeats has been defined. The proposed method was tested on a large number of randomly generated sequences. The application of the method on biological sequences revealed that for some viruses, shorter repeats are more statistically significant than longer ones because of their frequent appearance, whereas for bacteria, the majority of identified repeats are statistically significant.en_US
dc.language.isoenen_US
dc.relation.ispartofJournal of computational biology : a journal of computational molecular cell biologyen_US
dc.subjectDNAen_US
dc.subjectRNAen_US
dc.subjectprotein sequencesen_US
dc.subjectrepeatsen_US
dc.subjectstatistically significanten_US
dc.titleFinding Statistically Significant Repeats in Nucleic Acids and Proteinsen_US
dc.typeArticleen_US
dc.identifier.doi10.1089/cmb.2017.0046-
dc.identifier.pmid29272145-
dc.identifier.scopus2-s2.0-85045195060-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/85045195060-
dc.contributor.affiliationInformatics and Computer Scienceen_US
dc.relation.firstpage375en_US
dc.relation.lastpage387en_US
dc.relation.volume25en_US
dc.relation.issue4en_US
item.fulltextNo Fulltext-
item.languageiso639-1en-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairetypeArticle-
crisitem.author.deptInformatics and Computer Science-
Appears in Collections:Research outputs
Show simple item record

SCOPUSTM   
Citations

5
checked on Dec 18, 2024

Page view(s)

19
checked on Dec 24, 2024

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.