Improved approximate common interval

Amihood Amir; Leszek Gasieniec; Riva Shalom

doi:10.1016/j.ipl.2007.03.006

Improved approximate common interval

Amihood Amir, Leszek Gasieniec, Riva Shalom

Research output: Contribution to journal › Article › peer-review

12 Scopus citations

Abstract

The approximate common interval (ACI) problem, where the multiple genome strings are required to be compared to all other character sets of the other string is discussed. Genomes are considered as strings, with possible repeats of symbols representing paralogous genes, and detect the gene clusters by modeling gene intervals by the set of characters. A specific number of time algorithm that locates all intervals of two strings share the same character set, which also represents the number of the strings. This approximate common interval (ACI) problem for a specific number of strings can be solved in time and space by considering a finite length of every string. A procedure for extracting all maximal character sets of the input strings, and the ACI problem for a single input string and multiple input strings are studied. Graphic representation shows provides a simple and versatile algorithm, supporting the approximate common interval problem.

Original language	English (US)
Pages (from-to)	142-149
Number of pages	8
Journal	Information Processing Letters
Volume	103
Issue number	4
DOIs	https://doi.org/10.1016/j.ipl.2007.03.006
State	Published - Aug 16 2007
Externally published	Yes

Keywords

Computational biology
Design of algorithms
Gene evolution
Hamming distance
Pattern matching

ASJC Scopus subject areas

Theoretical Computer Science
Signal Processing
Information Systems
Computer Science Applications

Access to Document

10.1016/j.ipl.2007.03.006

Cite this

@article{32ef9c27156d4cddbd639b5cea47b4f6,

title = "Improved approximate common interval",

abstract = "The approximate common interval (ACI) problem, where the multiple genome strings are required to be compared to all other character sets of the other string is discussed. Genomes are considered as strings, with possible repeats of symbols representing paralogous genes, and detect the gene clusters by modeling gene intervals by the set of characters. A specific number of time algorithm that locates all intervals of two strings share the same character set, which also represents the number of the strings. This approximate common interval (ACI) problem for a specific number of strings can be solved in time and space by considering a finite length of every string. A procedure for extracting all maximal character sets of the input strings, and the ACI problem for a single input string and multiple input strings are studied. Graphic representation shows provides a simple and versatile algorithm, supporting the approximate common interval problem.",

keywords = "Computational biology, Design of algorithms, Gene evolution, Hamming distance, Pattern matching",

author = "Amihood Amir and Leszek Gasieniec and Riva Shalom",

note = "Funding Information: * Corresponding author at: Department of Computer Science, Bar-Ilan University, Ramat-Gan 52900, Israel. Tel.: +972 3 531 8770. E-mail addresses: amir@cs.biu.ac.il (A. Amir), leszek@csc.liv.ac.uk (L. Gasieniec), gonenr1@cs.biu.ac.il (R. Shalom). 1 Partly supported by NSF grant CCR-01-04494 and ISF grant 35/05. 2 Tel.: +44 151 794 3686. 3 Tel.: +972 3 531 8408.",

year = "2007",

month = aug,

day = "16",

doi = "10.1016/j.ipl.2007.03.006",

language = "English (US)",

volume = "103",

pages = "142--149",

journal = "Information Processing Letters",

issn = "0020-0190",

publisher = "Elsevier",

number = "4",

}

TY - JOUR

T1 - Improved approximate common interval

AU - Amir, Amihood

AU - Gasieniec, Leszek

AU - Shalom, Riva

N1 - Funding Information: * Corresponding author at: Department of Computer Science, Bar-Ilan University, Ramat-Gan 52900, Israel. Tel.: +972 3 531 8770. E-mail addresses: amir@cs.biu.ac.il (A. Amir), leszek@csc.liv.ac.uk (L. Gasieniec), gonenr1@cs.biu.ac.il (R. Shalom). 1 Partly supported by NSF grant CCR-01-04494 and ISF grant 35/05. 2 Tel.: +44 151 794 3686. 3 Tel.: +972 3 531 8408.

PY - 2007/8/16

Y1 - 2007/8/16

N2 - The approximate common interval (ACI) problem, where the multiple genome strings are required to be compared to all other character sets of the other string is discussed. Genomes are considered as strings, with possible repeats of symbols representing paralogous genes, and detect the gene clusters by modeling gene intervals by the set of characters. A specific number of time algorithm that locates all intervals of two strings share the same character set, which also represents the number of the strings. This approximate common interval (ACI) problem for a specific number of strings can be solved in time and space by considering a finite length of every string. A procedure for extracting all maximal character sets of the input strings, and the ACI problem for a single input string and multiple input strings are studied. Graphic representation shows provides a simple and versatile algorithm, supporting the approximate common interval problem.

AB - The approximate common interval (ACI) problem, where the multiple genome strings are required to be compared to all other character sets of the other string is discussed. Genomes are considered as strings, with possible repeats of symbols representing paralogous genes, and detect the gene clusters by modeling gene intervals by the set of characters. A specific number of time algorithm that locates all intervals of two strings share the same character set, which also represents the number of the strings. This approximate common interval (ACI) problem for a specific number of strings can be solved in time and space by considering a finite length of every string. A procedure for extracting all maximal character sets of the input strings, and the ACI problem for a single input string and multiple input strings are studied. Graphic representation shows provides a simple and versatile algorithm, supporting the approximate common interval problem.

KW - Computational biology

KW - Design of algorithms

KW - Gene evolution

KW - Hamming distance

KW - Pattern matching

UR - http://www.scopus.com/inward/record.url?scp=34249011129&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34249011129&partnerID=8YFLogxK

U2 - 10.1016/j.ipl.2007.03.006

DO - 10.1016/j.ipl.2007.03.006

M3 - Article

AN - SCOPUS:34249011129

SN - 0020-0190

VL - 103

SP - 142

EP - 149

JO - Information Processing Letters

JF - Information Processing Letters

IS - 4

ER -

Improved approximate common interval

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this