A hybrid gene team model and its application to genome analysis

Sun Kim, Jeong-Hyeon Choi, Amit Saple, Jiong Yang

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

It is well-known that functionally related genes occur in a physically clustered form, especially operons in bacteria. By leveraging on this fact, there has recently been an interesting problem formulation known as gene team model, which searches for a set of genes that co-occur in a pair of closely related genomes. However, many gene teams, even experimentally verified operons, frequently scatter within other genomes. Thus, the gene team model should be refined to reflect this observation. In this paper, we generalized the gene team model, that looks for gene clusters in a physically clustered form, to multiple genome cases with relaxed constraints. We propose a novel hybrid pattern model that combines the set and the sequential pattern models. Our model searches for gene clusters with and/or without physical proximity constraint. This model is implemented and tested with 97 genomes (120 replicons). The result was analyzed to show the usefulness of our model. We also compared the result from our hybrid model to those from the traditional gene team model. We also show that predicted gene teams can be used for various genome analysis: operon prediction, phylogenetic analysis of organisms, contextual sequence analysis and genome annotation. Our program is fast enough to provide a service on the web at http://platcom.informatics.indiana.edu/platcom/. Users can select any combination of 97 genomes to predict gene teams.

Original languageEnglish (US)
Pages (from-to)171-196
Number of pages26
JournalJournal of Bioinformatics and Computational Biology
Volume4
Issue number2
DOIs
StatePublished - Apr 1 2006
Externally publishedYes

Fingerprint

Genes
Genome
Operon
Multigene Family
Replicon
Informatics
Sequence Analysis
Research Design
Bacteria

Keywords

  • Cog team
  • Gene cluster
  • Gene team
  • Hybrid model
  • Proximity constraint

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology
  • Computer Science Applications

Cite this

A hybrid gene team model and its application to genome analysis. / Kim, Sun; Choi, Jeong-Hyeon; Saple, Amit; Yang, Jiong.

In: Journal of Bioinformatics and Computational Biology, Vol. 4, No. 2, 01.04.2006, p. 171-196.

Research output: Contribution to journalArticle

@article{aab667e484814436ba6267d081197b60,
title = "A hybrid gene team model and its application to genome analysis",
abstract = "It is well-known that functionally related genes occur in a physically clustered form, especially operons in bacteria. By leveraging on this fact, there has recently been an interesting problem formulation known as gene team model, which searches for a set of genes that co-occur in a pair of closely related genomes. However, many gene teams, even experimentally verified operons, frequently scatter within other genomes. Thus, the gene team model should be refined to reflect this observation. In this paper, we generalized the gene team model, that looks for gene clusters in a physically clustered form, to multiple genome cases with relaxed constraints. We propose a novel hybrid pattern model that combines the set and the sequential pattern models. Our model searches for gene clusters with and/or without physical proximity constraint. This model is implemented and tested with 97 genomes (120 replicons). The result was analyzed to show the usefulness of our model. We also compared the result from our hybrid model to those from the traditional gene team model. We also show that predicted gene teams can be used for various genome analysis: operon prediction, phylogenetic analysis of organisms, contextual sequence analysis and genome annotation. Our program is fast enough to provide a service on the web at http://platcom.informatics.indiana.edu/platcom/. Users can select any combination of 97 genomes to predict gene teams.",
keywords = "Cog team, Gene cluster, Gene team, Hybrid model, Proximity constraint",
author = "Sun Kim and Jeong-Hyeon Choi and Amit Saple and Jiong Yang",
year = "2006",
month = "4",
day = "1",
doi = "10.1142/S0219720006001850",
language = "English (US)",
volume = "4",
pages = "171--196",
journal = "Journal of Bioinformatics and Computational Biology",
issn = "0219-7200",
publisher = "World Scientific Publishing Co. Pte Ltd",
number = "2",

}

TY - JOUR

T1 - A hybrid gene team model and its application to genome analysis

AU - Kim, Sun

AU - Choi, Jeong-Hyeon

AU - Saple, Amit

AU - Yang, Jiong

PY - 2006/4/1

Y1 - 2006/4/1

N2 - It is well-known that functionally related genes occur in a physically clustered form, especially operons in bacteria. By leveraging on this fact, there has recently been an interesting problem formulation known as gene team model, which searches for a set of genes that co-occur in a pair of closely related genomes. However, many gene teams, even experimentally verified operons, frequently scatter within other genomes. Thus, the gene team model should be refined to reflect this observation. In this paper, we generalized the gene team model, that looks for gene clusters in a physically clustered form, to multiple genome cases with relaxed constraints. We propose a novel hybrid pattern model that combines the set and the sequential pattern models. Our model searches for gene clusters with and/or without physical proximity constraint. This model is implemented and tested with 97 genomes (120 replicons). The result was analyzed to show the usefulness of our model. We also compared the result from our hybrid model to those from the traditional gene team model. We also show that predicted gene teams can be used for various genome analysis: operon prediction, phylogenetic analysis of organisms, contextual sequence analysis and genome annotation. Our program is fast enough to provide a service on the web at http://platcom.informatics.indiana.edu/platcom/. Users can select any combination of 97 genomes to predict gene teams.

AB - It is well-known that functionally related genes occur in a physically clustered form, especially operons in bacteria. By leveraging on this fact, there has recently been an interesting problem formulation known as gene team model, which searches for a set of genes that co-occur in a pair of closely related genomes. However, many gene teams, even experimentally verified operons, frequently scatter within other genomes. Thus, the gene team model should be refined to reflect this observation. In this paper, we generalized the gene team model, that looks for gene clusters in a physically clustered form, to multiple genome cases with relaxed constraints. We propose a novel hybrid pattern model that combines the set and the sequential pattern models. Our model searches for gene clusters with and/or without physical proximity constraint. This model is implemented and tested with 97 genomes (120 replicons). The result was analyzed to show the usefulness of our model. We also compared the result from our hybrid model to those from the traditional gene team model. We also show that predicted gene teams can be used for various genome analysis: operon prediction, phylogenetic analysis of organisms, contextual sequence analysis and genome annotation. Our program is fast enough to provide a service on the web at http://platcom.informatics.indiana.edu/platcom/. Users can select any combination of 97 genomes to predict gene teams.

KW - Cog team

KW - Gene cluster

KW - Gene team

KW - Hybrid model

KW - Proximity constraint

UR - http://www.scopus.com/inward/record.url?scp=33745726581&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33745726581&partnerID=8YFLogxK

U2 - 10.1142/S0219720006001850

DO - 10.1142/S0219720006001850

M3 - Article

VL - 4

SP - 171

EP - 196

JO - Journal of Bioinformatics and Computational Biology

JF - Journal of Bioinformatics and Computational Biology

SN - 0219-7200

IS - 2

ER -