A run-based procedure to identify time-lagged gene clusters in microarray experiments

Sunil Mathur

Research output: Contribution to journalArticle

Abstract

Motivation: The analysis of gene-expression data obtained from microarray experiments can be useful to identify regulatory relationship between genes. Genes with a common functional role have similar expression patterns across different microarray experiments. These similar expression patterns are perhaps due to co-regulation of genes in the same functional group. Most of the existing methods available for the identification of the regulatory relationships are either made for comparing two genes at a time or methods are not computationally efficient in the identification of the regulatory relationships. The procedures adopted by these methods do not use complete information contained in the data set. In this paper, we propose a statistical procedure, which will use the information contained in the data set to cluster genes that show similar patterns. The proposed procedure compares several genes at a time instead of pair-wise comparisons as done in some of the other procedures. The proposed procedure provides gene clusters based on time-lagged data sets with more details. The proposed method provides a numerical value that would facilitate in comparing different sets of data obtained from different expressions. It also provides the identification of the gene involved and the time point at which the observation is made so that proper medicine can be developed for the gene-specific and time-specific disease. Results: We applied the proposed procedure on the Spellman data set (Mol. Biol. Cell 1998; 9(12): 3273-3297) and compared our procedure with some of the other existing procedures. We found that our procedure is more computationally efficient than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure also provides more details about the clusters than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure is really simple to apply as compared with other available procedures in the literature including Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures.

Original languageEnglish (US)
Pages (from-to)326-337
Number of pages12
JournalStatistics in Medicine
Volume28
Issue number2
DOIs
StatePublished - Jan 30 2009

Fingerprint

Multigene Family
Microarray
Gene
Experiment
Edge Detection
Bioinformatics
Genes
Computational Biology
Pairwise Comparisons
Gene Expression Data
Medicine

Keywords

  • Clustering of genes
  • Gene expression
  • Statistical procedure
  • Time-lagged

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability

Cite this

A run-based procedure to identify time-lagged gene clusters in microarray experiments. / Mathur, Sunil.

In: Statistics in Medicine, Vol. 28, No. 2, 30.01.2009, p. 326-337.

Research output: Contribution to journalArticle

@article{db204013347c4156ba264977ff9cd421,
title = "A run-based procedure to identify time-lagged gene clusters in microarray experiments",
abstract = "Motivation: The analysis of gene-expression data obtained from microarray experiments can be useful to identify regulatory relationship between genes. Genes with a common functional role have similar expression patterns across different microarray experiments. These similar expression patterns are perhaps due to co-regulation of genes in the same functional group. Most of the existing methods available for the identification of the regulatory relationships are either made for comparing two genes at a time or methods are not computationally efficient in the identification of the regulatory relationships. The procedures adopted by these methods do not use complete information contained in the data set. In this paper, we propose a statistical procedure, which will use the information contained in the data set to cluster genes that show similar patterns. The proposed procedure compares several genes at a time instead of pair-wise comparisons as done in some of the other procedures. The proposed procedure provides gene clusters based on time-lagged data sets with more details. The proposed method provides a numerical value that would facilitate in comparing different sets of data obtained from different expressions. It also provides the identification of the gene involved and the time point at which the observation is made so that proper medicine can be developed for the gene-specific and time-specific disease. Results: We applied the proposed procedure on the Spellman data set (Mol. Biol. Cell 1998; 9(12): 3273-3297) and compared our procedure with some of the other existing procedures. We found that our procedure is more computationally efficient than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure also provides more details about the clusters than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure is really simple to apply as compared with other available procedures in the literature including Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures.",
keywords = "Clustering of genes, Gene expression, Statistical procedure, Time-lagged",
author = "Sunil Mathur",
year = "2009",
month = "1",
day = "30",
doi = "10.1002/sim.3467",
language = "English (US)",
volume = "28",
pages = "326--337",
journal = "Statistics in Medicine",
issn = "0277-6715",
publisher = "John Wiley and Sons Ltd",
number = "2",

}

TY - JOUR

T1 - A run-based procedure to identify time-lagged gene clusters in microarray experiments

AU - Mathur, Sunil

PY - 2009/1/30

Y1 - 2009/1/30

N2 - Motivation: The analysis of gene-expression data obtained from microarray experiments can be useful to identify regulatory relationship between genes. Genes with a common functional role have similar expression patterns across different microarray experiments. These similar expression patterns are perhaps due to co-regulation of genes in the same functional group. Most of the existing methods available for the identification of the regulatory relationships are either made for comparing two genes at a time or methods are not computationally efficient in the identification of the regulatory relationships. The procedures adopted by these methods do not use complete information contained in the data set. In this paper, we propose a statistical procedure, which will use the information contained in the data set to cluster genes that show similar patterns. The proposed procedure compares several genes at a time instead of pair-wise comparisons as done in some of the other procedures. The proposed procedure provides gene clusters based on time-lagged data sets with more details. The proposed method provides a numerical value that would facilitate in comparing different sets of data obtained from different expressions. It also provides the identification of the gene involved and the time point at which the observation is made so that proper medicine can be developed for the gene-specific and time-specific disease. Results: We applied the proposed procedure on the Spellman data set (Mol. Biol. Cell 1998; 9(12): 3273-3297) and compared our procedure with some of the other existing procedures. We found that our procedure is more computationally efficient than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure also provides more details about the clusters than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure is really simple to apply as compared with other available procedures in the literature including Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures.

AB - Motivation: The analysis of gene-expression data obtained from microarray experiments can be useful to identify regulatory relationship between genes. Genes with a common functional role have similar expression patterns across different microarray experiments. These similar expression patterns are perhaps due to co-regulation of genes in the same functional group. Most of the existing methods available for the identification of the regulatory relationships are either made for comparing two genes at a time or methods are not computationally efficient in the identification of the regulatory relationships. The procedures adopted by these methods do not use complete information contained in the data set. In this paper, we propose a statistical procedure, which will use the information contained in the data set to cluster genes that show similar patterns. The proposed procedure compares several genes at a time instead of pair-wise comparisons as done in some of the other procedures. The proposed procedure provides gene clusters based on time-lagged data sets with more details. The proposed method provides a numerical value that would facilitate in comparing different sets of data obtained from different expressions. It also provides the identification of the gene involved and the time point at which the observation is made so that proper medicine can be developed for the gene-specific and time-specific disease. Results: We applied the proposed procedure on the Spellman data set (Mol. Biol. Cell 1998; 9(12): 3273-3297) and compared our procedure with some of the other existing procedures. We found that our procedure is more computationally efficient than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure also provides more details about the clusters than Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures. The proposed procedure is really simple to apply as compared with other available procedures in the literature including Ji and Tan (Bioinformatics 2005; 21:509-516), event method and edge detection procedures.

KW - Clustering of genes

KW - Gene expression

KW - Statistical procedure

KW - Time-lagged

UR - http://www.scopus.com/inward/record.url?scp=61749097965&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=61749097965&partnerID=8YFLogxK

U2 - 10.1002/sim.3467

DO - 10.1002/sim.3467

M3 - Article

VL - 28

SP - 326

EP - 337

JO - Statistics in Medicine

JF - Statistics in Medicine

SN - 0277-6715

IS - 2

ER -