A new efficient statistical test for detecting variability in the gene expression data

Sunil Mathur, Samuel Dolo

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

DNA microarray technology allows researchers to monitor the expressions of thousands of genes under different conditions. The detection of differential gene expression under two different conditions is very important in microarray studies. Microarray experiments are multi-step procedures and each step is a potential source of variance. This makes the measurement of variability difficult because approach based on gene-by-gene estimation of variance will have few degrees of freedom. It is highly possible that the assumption of equal variance for all the expression levels may not hold. Also, the assumption of normality of gene expressions may not hold. Thus it is essential to have a statistical procedure which is not based on the normality assumption and also it can detect genes with differential variance efficiently. The detection of differential gene expression variance will allow us to identify experimental variables that affect different biological processes and accuracy of DNA microarray measurements. In this article, a new nonparametric test for scale is developed based on the arctangent of the ratio of two expression levels. Most of the tests available in literature require the assumption of normal distribution, which makes them inapplicable in many situations, and it is also hard to verify the suitability of the normal distribution assumption for the given data set. The proposed test does not require the assumption of the distribution for the underlying population and hence makes it more practical and widely applicable. The asymptotic relative efficiency is calculated under different distributions, which show that the proposed test is very powerful when the assumption of normality breaks down. Monte Carlo simulation studies are performed to compare the power of the proposed test with some of the existing procedures. It is found that the proposed test is more powerful than commonly used tests under almost all the distributions considered in the study. A microarray data is used to illustrate the working of the proposed test. Results indicate that the proposed test is very powerful in detecting the smallest change in differential expression variance with high degree of confidence than some of its competitors.

Original languageEnglish (US)
Pages (from-to)405-419
Number of pages15
JournalStatistical Methods in Medical Research
Volume17
Issue number4
DOIs
StatePublished - Nov 7 2008

Fingerprint

Statistical test
Gene Expression Data
Gene Expression
Normal Distribution
Oligonucleotide Array Sequence Analysis
Differential Expression
Normality
Genes
Gene
Biological Phenomena
DNA Microarray
Microarray
Gaussian distribution
Research Personnel
Demography
Technology
Asymptotic Relative Efficiency
Non-parametric test
Microarray Data
Confidence

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability
  • Health Information Management

Cite this

A new efficient statistical test for detecting variability in the gene expression data. / Mathur, Sunil; Dolo, Samuel.

In: Statistical Methods in Medical Research, Vol. 17, No. 4, 07.11.2008, p. 405-419.

Research output: Contribution to journalArticle

@article{89242eff61884070b69a27b6e93d404e,
title = "A new efficient statistical test for detecting variability in the gene expression data",
abstract = "DNA microarray technology allows researchers to monitor the expressions of thousands of genes under different conditions. The detection of differential gene expression under two different conditions is very important in microarray studies. Microarray experiments are multi-step procedures and each step is a potential source of variance. This makes the measurement of variability difficult because approach based on gene-by-gene estimation of variance will have few degrees of freedom. It is highly possible that the assumption of equal variance for all the expression levels may not hold. Also, the assumption of normality of gene expressions may not hold. Thus it is essential to have a statistical procedure which is not based on the normality assumption and also it can detect genes with differential variance efficiently. The detection of differential gene expression variance will allow us to identify experimental variables that affect different biological processes and accuracy of DNA microarray measurements. In this article, a new nonparametric test for scale is developed based on the arctangent of the ratio of two expression levels. Most of the tests available in literature require the assumption of normal distribution, which makes them inapplicable in many situations, and it is also hard to verify the suitability of the normal distribution assumption for the given data set. The proposed test does not require the assumption of the distribution for the underlying population and hence makes it more practical and widely applicable. The asymptotic relative efficiency is calculated under different distributions, which show that the proposed test is very powerful when the assumption of normality breaks down. Monte Carlo simulation studies are performed to compare the power of the proposed test with some of the existing procedures. It is found that the proposed test is more powerful than commonly used tests under almost all the distributions considered in the study. A microarray data is used to illustrate the working of the proposed test. Results indicate that the proposed test is very powerful in detecting the smallest change in differential expression variance with high degree of confidence than some of its competitors.",
author = "Sunil Mathur and Samuel Dolo",
year = "2008",
month = "11",
day = "7",
doi = "10.1177/0962280206078643",
language = "English (US)",
volume = "17",
pages = "405--419",
journal = "Statistical Methods in Medical Research",
issn = "0962-2802",
publisher = "SAGE Publications Ltd",
number = "4",

}

TY - JOUR

T1 - A new efficient statistical test for detecting variability in the gene expression data

AU - Mathur, Sunil

AU - Dolo, Samuel

PY - 2008/11/7

Y1 - 2008/11/7

N2 - DNA microarray technology allows researchers to monitor the expressions of thousands of genes under different conditions. The detection of differential gene expression under two different conditions is very important in microarray studies. Microarray experiments are multi-step procedures and each step is a potential source of variance. This makes the measurement of variability difficult because approach based on gene-by-gene estimation of variance will have few degrees of freedom. It is highly possible that the assumption of equal variance for all the expression levels may not hold. Also, the assumption of normality of gene expressions may not hold. Thus it is essential to have a statistical procedure which is not based on the normality assumption and also it can detect genes with differential variance efficiently. The detection of differential gene expression variance will allow us to identify experimental variables that affect different biological processes and accuracy of DNA microarray measurements. In this article, a new nonparametric test for scale is developed based on the arctangent of the ratio of two expression levels. Most of the tests available in literature require the assumption of normal distribution, which makes them inapplicable in many situations, and it is also hard to verify the suitability of the normal distribution assumption for the given data set. The proposed test does not require the assumption of the distribution for the underlying population and hence makes it more practical and widely applicable. The asymptotic relative efficiency is calculated under different distributions, which show that the proposed test is very powerful when the assumption of normality breaks down. Monte Carlo simulation studies are performed to compare the power of the proposed test with some of the existing procedures. It is found that the proposed test is more powerful than commonly used tests under almost all the distributions considered in the study. A microarray data is used to illustrate the working of the proposed test. Results indicate that the proposed test is very powerful in detecting the smallest change in differential expression variance with high degree of confidence than some of its competitors.

AB - DNA microarray technology allows researchers to monitor the expressions of thousands of genes under different conditions. The detection of differential gene expression under two different conditions is very important in microarray studies. Microarray experiments are multi-step procedures and each step is a potential source of variance. This makes the measurement of variability difficult because approach based on gene-by-gene estimation of variance will have few degrees of freedom. It is highly possible that the assumption of equal variance for all the expression levels may not hold. Also, the assumption of normality of gene expressions may not hold. Thus it is essential to have a statistical procedure which is not based on the normality assumption and also it can detect genes with differential variance efficiently. The detection of differential gene expression variance will allow us to identify experimental variables that affect different biological processes and accuracy of DNA microarray measurements. In this article, a new nonparametric test for scale is developed based on the arctangent of the ratio of two expression levels. Most of the tests available in literature require the assumption of normal distribution, which makes them inapplicable in many situations, and it is also hard to verify the suitability of the normal distribution assumption for the given data set. The proposed test does not require the assumption of the distribution for the underlying population and hence makes it more practical and widely applicable. The asymptotic relative efficiency is calculated under different distributions, which show that the proposed test is very powerful when the assumption of normality breaks down. Monte Carlo simulation studies are performed to compare the power of the proposed test with some of the existing procedures. It is found that the proposed test is more powerful than commonly used tests under almost all the distributions considered in the study. A microarray data is used to illustrate the working of the proposed test. Results indicate that the proposed test is very powerful in detecting the smallest change in differential expression variance with high degree of confidence than some of its competitors.

UR - http://www.scopus.com/inward/record.url?scp=55249115099&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=55249115099&partnerID=8YFLogxK

U2 - 10.1177/0962280206078643

DO - 10.1177/0962280206078643

M3 - Article

VL - 17

SP - 405

EP - 419

JO - Statistical Methods in Medical Research

JF - Statistical Methods in Medical Research

SN - 0962-2802

IS - 4

ER -