A statistical method for flagging weak spots improves normalization and ratio estimates in microarrays.

M. C. Yang, Q. G. Ruan, J. J. Yang, S. Eckenrode, S. Wu, R. A. McIndoe, J. X. She

Research output: Contribution to journalArticle

95 Citations (Scopus)

Abstract

Over the last few years, there has been a dramatic increase in the use of cDNA microarrays to monitor gene expression changes in biological systems. Data from these experiments are usually transformed into expression ratios between experimental samples and a common reference sample for subsequent data analysis. The accuracy of this critical transformation depends on two major parameters: the signal intensities and the normalization of the experiment vs. reference signal intensities. Here we describe and validate a new model for microarray signal intensity that has one multiplicative variation and one additive background variation. Using replicative experiments and simulated data, we found that the signal intensity is the most critical parameter that influences the performance of normalization, accuracy of ratio estimates, reproducibility, specificity, and sensitivity of microarray experiments. Therefore, we developed a statistical procedure to flag spots with weak signal intensity based on the standard deviation (delta(ij)) of background differences between a spot and the neighboring spots, i.e., a spot is considered as too weak if the signal is weaker than cdelta(ij). Our studies suggest that normalization and ratio estimates were unacceptable when this threshold (c) is small. We further showed that when a reasonable compromise of c (c = 6) is applied, normalization using trimmed mean of log ratios performed slightly better than global intensity and mean of ratios. These studies suggest that decreasing the background noise is critical to improve the quality of microarray experiments.

Original languageEnglish (US)
Pages (from-to)45-53
Number of pages9
JournalPhysiological Genomics
Volume7
Issue number1
StatePublished - Oct 10 2001

Fingerprint

Oligonucleotide Array Sequence Analysis
Noise
Gene Expression
Sensitivity and Specificity

Keywords

  • Functional genomics
  • Gene expression
  • Microarray
  • Normalization
  • Statistics

ASJC Scopus subject areas

  • Genetics
  • Physiology

Cite this

A statistical method for flagging weak spots improves normalization and ratio estimates in microarrays. / Yang, M. C.; Ruan, Q. G.; Yang, J. J.; Eckenrode, S.; Wu, S.; McIndoe, R. A.; She, J. X.

In: Physiological Genomics, Vol. 7, No. 1, 10.10.2001, p. 45-53.

Research output: Contribution to journalArticle

Yang, M. C. ; Ruan, Q. G. ; Yang, J. J. ; Eckenrode, S. ; Wu, S. ; McIndoe, R. A. ; She, J. X. / A statistical method for flagging weak spots improves normalization and ratio estimates in microarrays. In: Physiological Genomics. 2001 ; Vol. 7, No. 1. pp. 45-53.
@article{0a1cf0edf70f4a1e91a93904e17d030f,
title = "A statistical method for flagging weak spots improves normalization and ratio estimates in microarrays.",
abstract = "Over the last few years, there has been a dramatic increase in the use of cDNA microarrays to monitor gene expression changes in biological systems. Data from these experiments are usually transformed into expression ratios between experimental samples and a common reference sample for subsequent data analysis. The accuracy of this critical transformation depends on two major parameters: the signal intensities and the normalization of the experiment vs. reference signal intensities. Here we describe and validate a new model for microarray signal intensity that has one multiplicative variation and one additive background variation. Using replicative experiments and simulated data, we found that the signal intensity is the most critical parameter that influences the performance of normalization, accuracy of ratio estimates, reproducibility, specificity, and sensitivity of microarray experiments. Therefore, we developed a statistical procedure to flag spots with weak signal intensity based on the standard deviation (delta(ij)) of background differences between a spot and the neighboring spots, i.e., a spot is considered as too weak if the signal is weaker than cdelta(ij). Our studies suggest that normalization and ratio estimates were unacceptable when this threshold (c) is small. We further showed that when a reasonable compromise of c (c = 6) is applied, normalization using trimmed mean of log ratios performed slightly better than global intensity and mean of ratios. These studies suggest that decreasing the background noise is critical to improve the quality of microarray experiments.",
keywords = "Functional genomics, Gene expression, Microarray, Normalization, Statistics",
author = "Yang, {M. C.} and Ruan, {Q. G.} and Yang, {J. J.} and S. Eckenrode and S. Wu and McIndoe, {R. A.} and She, {J. X.}",
year = "2001",
month = "10",
day = "10",
language = "English (US)",
volume = "7",
pages = "45--53",
journal = "Physiological Genomics",
issn = "1094-8341",
publisher = "American Physiological Society",
number = "1",

}

TY - JOUR

T1 - A statistical method for flagging weak spots improves normalization and ratio estimates in microarrays.

AU - Yang, M. C.

AU - Ruan, Q. G.

AU - Yang, J. J.

AU - Eckenrode, S.

AU - Wu, S.

AU - McIndoe, R. A.

AU - She, J. X.

PY - 2001/10/10

Y1 - 2001/10/10

N2 - Over the last few years, there has been a dramatic increase in the use of cDNA microarrays to monitor gene expression changes in biological systems. Data from these experiments are usually transformed into expression ratios between experimental samples and a common reference sample for subsequent data analysis. The accuracy of this critical transformation depends on two major parameters: the signal intensities and the normalization of the experiment vs. reference signal intensities. Here we describe and validate a new model for microarray signal intensity that has one multiplicative variation and one additive background variation. Using replicative experiments and simulated data, we found that the signal intensity is the most critical parameter that influences the performance of normalization, accuracy of ratio estimates, reproducibility, specificity, and sensitivity of microarray experiments. Therefore, we developed a statistical procedure to flag spots with weak signal intensity based on the standard deviation (delta(ij)) of background differences between a spot and the neighboring spots, i.e., a spot is considered as too weak if the signal is weaker than cdelta(ij). Our studies suggest that normalization and ratio estimates were unacceptable when this threshold (c) is small. We further showed that when a reasonable compromise of c (c = 6) is applied, normalization using trimmed mean of log ratios performed slightly better than global intensity and mean of ratios. These studies suggest that decreasing the background noise is critical to improve the quality of microarray experiments.

AB - Over the last few years, there has been a dramatic increase in the use of cDNA microarrays to monitor gene expression changes in biological systems. Data from these experiments are usually transformed into expression ratios between experimental samples and a common reference sample for subsequent data analysis. The accuracy of this critical transformation depends on two major parameters: the signal intensities and the normalization of the experiment vs. reference signal intensities. Here we describe and validate a new model for microarray signal intensity that has one multiplicative variation and one additive background variation. Using replicative experiments and simulated data, we found that the signal intensity is the most critical parameter that influences the performance of normalization, accuracy of ratio estimates, reproducibility, specificity, and sensitivity of microarray experiments. Therefore, we developed a statistical procedure to flag spots with weak signal intensity based on the standard deviation (delta(ij)) of background differences between a spot and the neighboring spots, i.e., a spot is considered as too weak if the signal is weaker than cdelta(ij). Our studies suggest that normalization and ratio estimates were unacceptable when this threshold (c) is small. We further showed that when a reasonable compromise of c (c = 6) is applied, normalization using trimmed mean of log ratios performed slightly better than global intensity and mean of ratios. These studies suggest that decreasing the background noise is critical to improve the quality of microarray experiments.

KW - Functional genomics

KW - Gene expression

KW - Microarray

KW - Normalization

KW - Statistics

UR - http://www.scopus.com/inward/record.url?scp=0035841226&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0035841226&partnerID=8YFLogxK

M3 - Article

C2 - 11595791

AN - SCOPUS:0035841226

VL - 7

SP - 45

EP - 53

JO - Physiological Genomics

JF - Physiological Genomics

SN - 1094-8341

IS - 1

ER -