Detecting periodic patterns in unevenly spaced gene expression time series using Lomb-Scargle periodograms

Earl F. Glynn, Jie Chen, Arcady R. Mushegian

Research output: Contribution to journalArticle

129 Citations (Scopus)

Abstract

Motivation: Periodic patterns in time series resulting from biological experiments are of great interest. The commonly used Fast Fourier Transform (FFT) algorithm is applicable only when data are evenly spaced and when no values are missing, which is not always the case in high-throughput measurements. The choice of statistic to evaluate the significance of the periodic patterns for unevenly spaced gene expression time series has not been well substantiated. Methods: The Lo mb-Scargle periodogram approach is used to search time series of gene expression to quantify the periodic behavior of every gene represented on the DNA array. The Lomb-Scargle periodogram analysis provides a direct method to treat missing values and unevenly spaced time points. We propose the combination of a Lomb-Scargle test statistic for periodicity and a multiple hypothesis testing procedure with controlled false discovery rate to detect significant periodic gene expression patterns. Results: We analyzed the Plasmodium falciparumgene expression dataset. In the Quality Control Dataset of 5080 expression patterns, we found 4112 periodic probes. In addition, we identified 243 probes with periodic expression in the Complete Dataset, which could not be examined in the original study by the FFT analysis due to an excessive number of missing values. While most periodic genes had a period of 48 h, some had a period close to 24 h. Our approach should be applicable for detection and quantification of periodic patterns in any unevenly spaced gene expression time-series data.

Original languageEnglish (US)
Pages (from-to)310-316
Number of pages7
JournalBioinformatics
Volume22
Issue number3
DOIs
StatePublished - Feb 1 2006

Fingerprint

Periodogram
Gene expression
Gene Expression
Time series
Fourier Analysis
Fast Fourier transforms
Genes
Statistics
Missing Values
Fast Fourier transform
Plasmodium
Quality control
Periodicity
Oligonucleotide Array Sequence Analysis
DNA
Probe
Quality Control
Throughput
Multiple Hypothesis Testing
Gene

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this

Detecting periodic patterns in unevenly spaced gene expression time series using Lomb-Scargle periodograms. / Glynn, Earl F.; Chen, Jie; Mushegian, Arcady R.

In: Bioinformatics, Vol. 22, No. 3, 01.02.2006, p. 310-316.

Research output: Contribution to journalArticle

@article{e2ca5e3579f44150bfa19bd61306cc33,
title = "Detecting periodic patterns in unevenly spaced gene expression time series using Lomb-Scargle periodograms",
abstract = "Motivation: Periodic patterns in time series resulting from biological experiments are of great interest. The commonly used Fast Fourier Transform (FFT) algorithm is applicable only when data are evenly spaced and when no values are missing, which is not always the case in high-throughput measurements. The choice of statistic to evaluate the significance of the periodic patterns for unevenly spaced gene expression time series has not been well substantiated. Methods: The Lo mb-Scargle periodogram approach is used to search time series of gene expression to quantify the periodic behavior of every gene represented on the DNA array. The Lomb-Scargle periodogram analysis provides a direct method to treat missing values and unevenly spaced time points. We propose the combination of a Lomb-Scargle test statistic for periodicity and a multiple hypothesis testing procedure with controlled false discovery rate to detect significant periodic gene expression patterns. Results: We analyzed the Plasmodium falciparumgene expression dataset. In the Quality Control Dataset of 5080 expression patterns, we found 4112 periodic probes. In addition, we identified 243 probes with periodic expression in the Complete Dataset, which could not be examined in the original study by the FFT analysis due to an excessive number of missing values. While most periodic genes had a period of 48 h, some had a period close to 24 h. Our approach should be applicable for detection and quantification of periodic patterns in any unevenly spaced gene expression time-series data.",
author = "Glynn, {Earl F.} and Jie Chen and Mushegian, {Arcady R.}",
year = "2006",
month = "2",
day = "1",
doi = "10.1093/bioinformatics/bti789",
language = "English (US)",
volume = "22",
pages = "310--316",
journal = "Bioinformatics",
issn = "1367-4803",
publisher = "Oxford University Press",
number = "3",

}

TY - JOUR

T1 - Detecting periodic patterns in unevenly spaced gene expression time series using Lomb-Scargle periodograms

AU - Glynn, Earl F.

AU - Chen, Jie

AU - Mushegian, Arcady R.

PY - 2006/2/1

Y1 - 2006/2/1

N2 - Motivation: Periodic patterns in time series resulting from biological experiments are of great interest. The commonly used Fast Fourier Transform (FFT) algorithm is applicable only when data are evenly spaced and when no values are missing, which is not always the case in high-throughput measurements. The choice of statistic to evaluate the significance of the periodic patterns for unevenly spaced gene expression time series has not been well substantiated. Methods: The Lo mb-Scargle periodogram approach is used to search time series of gene expression to quantify the periodic behavior of every gene represented on the DNA array. The Lomb-Scargle periodogram analysis provides a direct method to treat missing values and unevenly spaced time points. We propose the combination of a Lomb-Scargle test statistic for periodicity and a multiple hypothesis testing procedure with controlled false discovery rate to detect significant periodic gene expression patterns. Results: We analyzed the Plasmodium falciparumgene expression dataset. In the Quality Control Dataset of 5080 expression patterns, we found 4112 periodic probes. In addition, we identified 243 probes with periodic expression in the Complete Dataset, which could not be examined in the original study by the FFT analysis due to an excessive number of missing values. While most periodic genes had a period of 48 h, some had a period close to 24 h. Our approach should be applicable for detection and quantification of periodic patterns in any unevenly spaced gene expression time-series data.

AB - Motivation: Periodic patterns in time series resulting from biological experiments are of great interest. The commonly used Fast Fourier Transform (FFT) algorithm is applicable only when data are evenly spaced and when no values are missing, which is not always the case in high-throughput measurements. The choice of statistic to evaluate the significance of the periodic patterns for unevenly spaced gene expression time series has not been well substantiated. Methods: The Lo mb-Scargle periodogram approach is used to search time series of gene expression to quantify the periodic behavior of every gene represented on the DNA array. The Lomb-Scargle periodogram analysis provides a direct method to treat missing values and unevenly spaced time points. We propose the combination of a Lomb-Scargle test statistic for periodicity and a multiple hypothesis testing procedure with controlled false discovery rate to detect significant periodic gene expression patterns. Results: We analyzed the Plasmodium falciparumgene expression dataset. In the Quality Control Dataset of 5080 expression patterns, we found 4112 periodic probes. In addition, we identified 243 probes with periodic expression in the Complete Dataset, which could not be examined in the original study by the FFT analysis due to an excessive number of missing values. While most periodic genes had a period of 48 h, some had a period close to 24 h. Our approach should be applicable for detection and quantification of periodic patterns in any unevenly spaced gene expression time-series data.

UR - http://www.scopus.com/inward/record.url?scp=32144464541&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=32144464541&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/bti789

DO - 10.1093/bioinformatics/bti789

M3 - Article

C2 - 16303799

AN - SCOPUS:32144464541

VL - 22

SP - 310

EP - 316

JO - Bioinformatics

JF - Bioinformatics

SN - 1367-4803

IS - 3

ER -