Please use this identifier to cite or link to this item:
                
       https://research.matf.bg.ac.rs/handle/123456789/335| Title: | n-Gram characterization of genomic islands in bacterial genomes | Authors: | Pavlović-Lazetić, Gordana M Mitić, Nenad Beljanski, Milos V | Affiliations: | Informatics and Computer Science | Keywords: | Backbone sequence;Escherichia coli O157:H7 EDL933;Genomic islands;Horizontal gene transfer;n-Grams;Statistical analysis;Zipf-like analysis | Issue Date: | 2009 | Journal: | Computer methods and programs in biomedicine | Abstract: | The paper presents a novel, n-gram-based method for analysis of bacterial genome segments known as genomic islands (GIs). Identification of GIs in bacterial genomes is an important task since many of them represent inserts that may contribute to bacterial evolution and pathogenesis. In order to characterize and distinguish GIs from rest of the genome, binary classification of islands based on n-gram frequency distribution have been performed. It consists of testing the agreement of islands n-gram frequency distributions with the complete genome and backbone sequence. In addition, a statistic based on the maximal order Markov model is used to identify significantly overrepresented and underrepresented n-grams in islands. The results may be used as a basis for Zipf-like analysis suggesting that some of the n-grams are overrepresented in a subset of islands and underrepresented in the backbone, or vice versa, thus complementing the binary classification. The method is applied to strain-specific regions in the Escherichia coli O157:H7 EDL933 genome (O-islands), resulting in two groups of O-islands with different n-gram characteristics. It refines a characterization based on other compositional features such as G+C content and codon usage, and may help in identification of GIs, and also in research and development of adequate drugs targeting virulence genes in them. | URI: | https://research.matf.bg.ac.rs/handle/123456789/335 | ISSN: | 01692607 | DOI: | 10.1016/j.cmpb.2008.10.014 | 
| Appears in Collections: | Research outputs | 
Show full item record
SCOPUSTM   
 Citations
		
		
		
				
		
		
		
			8
		
		
		
				
		
		
		
	
			checked on Oct 24, 2025
		
	Page view(s)
19
			checked on Jan 19, 2025
		
	Google ScholarTM
		
		
   		    Check
	Altmetric
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
