Please use this identifier to cite or link to this item:
https://research.matf.bg.ac.rs/handle/123456789/685
Title: | Graph search and variable neighborhood search for finding constrained longest common subsequences in artificial and real gene sequences | Authors: | Djukanović, Marko Kartelj, Aleksandar Matić, Dragan Grbić, Milana Blum, Christian Raidl, Günther R. |
Affiliations: | Informatics and Computer Science | Keywords: | Beam search;Computational biology;Hybrid Methods;Longest common subsequence;Variable neighborhood search | Issue Date: | 2022 | Journal: | Applied Soft Computing | Abstract: | We consider the constrained longest common subsequence problem with an arbitrary set of input strings as well as an arbitrary set of pattern strings. This problem has applications, for example, in computational biology where it serves as a measure of similarity for sets of molecules with putative structures in common. We contribute in several ways. First, it is formally proven that finding a feasible solution of arbitrary length is, in general, NP-complete. Second, we propose several heuristic approaches: a greedy algorithm, a beam search aiming for feasibility, a variable neighborhood search, and a hybrid of the latter two approaches. An exhaustive experimental study shows the effectivity and differences of the proposed approaches in respect to finding a feasible solution, finding high-quality solutions, and runtime for both, artificial and real-world instance sets. The latter ones are generated from a set of 12681 bacteria 16S rRNA gene sequences and consider 15 primer contigs as pattern strings. |
URI: | https://research.matf.bg.ac.rs/handle/123456789/685 | ISSN: | 15684946 | DOI: | 10.1016/j.asoc.2022.108844 |
Appears in Collections: | Research outputs |
Show full item record
SCOPUSTM
Citations
3
checked on Dec 20, 2024
Page view(s)
8
checked on Dec 25, 2024
Google ScholarTM
Check
Altmetric
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.