A Method to Detect Differentially Methylated Loci With Next-Generation Sequencing

Hongyan Xu, Robert H. Podolsky, Duchwan Ryu, Xiaoling Wang, Shaoyong Su, Huidong Shi, Varghese George

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Epigenetic changes, especially DNA methylation at CpG loci have important implications in cancer and other complex diseases. With the development of next-generation sequencing (NGS), it is feasible to generate data to interrogate the difference in methylation status for genome-wide loci using case-control design. However, a proper and efficient statistical test is lacking. There are several challenges. First, unlike methylation experiments using microarrays, where there is one measure of methylation for one individual at a particular CpG site, here we have the counts of methylation allele and unmethylation allele for each individual. Second, due to the nature of sample preparation, the measured methylation reflects the methylation status of a mixture of cells involved in sample preparation. Therefore, the underlying distribution of the measured methylation level is unknown, and a robust test is more desirable than parametric approach. Third, currently NGS measures methylation at over 2 million CpG sites. Any statistical tests have to be computationally efficient in order to be applied to the NGS data. Taking these challenges into account, we propose a test for differential methylation based on clustered data analysis by modeling the methylation counts. We performed simulations to show that it is robust under several distributions for the measured methylation levels. It has good power and is computationally efficient. Finally, we apply the test to our NGS data on chronic lymphocytic leukemia. The results indicate that it is a promising and practical test.

Original languageEnglish (US)
Pages (from-to)377-382
Number of pages6
JournalGenetic Epidemiology
Volume37
Issue number4
DOIs
StatePublished - May 1 2013

Fingerprint

Methylation
Alleles
DNA Methylation
B-Cell Chronic Lymphocytic Leukemia
Epigenomics
Genome

Keywords

  • DNA methylation
  • Differential methylation test
  • Next-generation sequencing

ASJC Scopus subject areas

  • Genetics(clinical)
  • Epidemiology

Cite this

A Method to Detect Differentially Methylated Loci With Next-Generation Sequencing. / Xu, Hongyan; Podolsky, Robert H.; Ryu, Duchwan; Wang, Xiaoling; Su, Shaoyong; Shi, Huidong; George, Varghese.

In: Genetic Epidemiology, Vol. 37, No. 4, 01.05.2013, p. 377-382.

Research output: Contribution to journalArticle

@article{d7b4519e2f79416a8d15dbc9dd99e6bd,
title = "A Method to Detect Differentially Methylated Loci With Next-Generation Sequencing",
abstract = "Epigenetic changes, especially DNA methylation at CpG loci have important implications in cancer and other complex diseases. With the development of next-generation sequencing (NGS), it is feasible to generate data to interrogate the difference in methylation status for genome-wide loci using case-control design. However, a proper and efficient statistical test is lacking. There are several challenges. First, unlike methylation experiments using microarrays, where there is one measure of methylation for one individual at a particular CpG site, here we have the counts of methylation allele and unmethylation allele for each individual. Second, due to the nature of sample preparation, the measured methylation reflects the methylation status of a mixture of cells involved in sample preparation. Therefore, the underlying distribution of the measured methylation level is unknown, and a robust test is more desirable than parametric approach. Third, currently NGS measures methylation at over 2 million CpG sites. Any statistical tests have to be computationally efficient in order to be applied to the NGS data. Taking these challenges into account, we propose a test for differential methylation based on clustered data analysis by modeling the methylation counts. We performed simulations to show that it is robust under several distributions for the measured methylation levels. It has good power and is computationally efficient. Finally, we apply the test to our NGS data on chronic lymphocytic leukemia. The results indicate that it is a promising and practical test.",
keywords = "DNA methylation, Differential methylation test, Next-generation sequencing",
author = "Hongyan Xu and Podolsky, {Robert H.} and Duchwan Ryu and Xiaoling Wang and Shaoyong Su and Huidong Shi and Varghese George",
year = "2013",
month = "5",
day = "1",
doi = "10.1002/gepi.21726",
language = "English (US)",
volume = "37",
pages = "377--382",
journal = "Genetic Epidemiology",
issn = "0741-0395",
publisher = "Wiley-Liss Inc.",
number = "4",

}

TY - JOUR

T1 - A Method to Detect Differentially Methylated Loci With Next-Generation Sequencing

AU - Xu, Hongyan

AU - Podolsky, Robert H.

AU - Ryu, Duchwan

AU - Wang, Xiaoling

AU - Su, Shaoyong

AU - Shi, Huidong

AU - George, Varghese

PY - 2013/5/1

Y1 - 2013/5/1

N2 - Epigenetic changes, especially DNA methylation at CpG loci have important implications in cancer and other complex diseases. With the development of next-generation sequencing (NGS), it is feasible to generate data to interrogate the difference in methylation status for genome-wide loci using case-control design. However, a proper and efficient statistical test is lacking. There are several challenges. First, unlike methylation experiments using microarrays, where there is one measure of methylation for one individual at a particular CpG site, here we have the counts of methylation allele and unmethylation allele for each individual. Second, due to the nature of sample preparation, the measured methylation reflects the methylation status of a mixture of cells involved in sample preparation. Therefore, the underlying distribution of the measured methylation level is unknown, and a robust test is more desirable than parametric approach. Third, currently NGS measures methylation at over 2 million CpG sites. Any statistical tests have to be computationally efficient in order to be applied to the NGS data. Taking these challenges into account, we propose a test for differential methylation based on clustered data analysis by modeling the methylation counts. We performed simulations to show that it is robust under several distributions for the measured methylation levels. It has good power and is computationally efficient. Finally, we apply the test to our NGS data on chronic lymphocytic leukemia. The results indicate that it is a promising and practical test.

AB - Epigenetic changes, especially DNA methylation at CpG loci have important implications in cancer and other complex diseases. With the development of next-generation sequencing (NGS), it is feasible to generate data to interrogate the difference in methylation status for genome-wide loci using case-control design. However, a proper and efficient statistical test is lacking. There are several challenges. First, unlike methylation experiments using microarrays, where there is one measure of methylation for one individual at a particular CpG site, here we have the counts of methylation allele and unmethylation allele for each individual. Second, due to the nature of sample preparation, the measured methylation reflects the methylation status of a mixture of cells involved in sample preparation. Therefore, the underlying distribution of the measured methylation level is unknown, and a robust test is more desirable than parametric approach. Third, currently NGS measures methylation at over 2 million CpG sites. Any statistical tests have to be computationally efficient in order to be applied to the NGS data. Taking these challenges into account, we propose a test for differential methylation based on clustered data analysis by modeling the methylation counts. We performed simulations to show that it is robust under several distributions for the measured methylation levels. It has good power and is computationally efficient. Finally, we apply the test to our NGS data on chronic lymphocytic leukemia. The results indicate that it is a promising and practical test.

KW - DNA methylation

KW - Differential methylation test

KW - Next-generation sequencing

UR - http://www.scopus.com/inward/record.url?scp=84876406263&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84876406263&partnerID=8YFLogxK

U2 - 10.1002/gepi.21726

DO - 10.1002/gepi.21726

M3 - Article

C2 - 23554163

AN - SCOPUS:84876406263

VL - 37

SP - 377

EP - 382

JO - Genetic Epidemiology

JF - Genetic Epidemiology

SN - 0741-0395

IS - 4

ER -