Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data

Zhaohui Qin, Ben Li, Karen N. Conneely, Hao Wu, Ming Hu, Deepak Nag Ayyala, Yongseok Park, Victor X. Jin, Fangyuan Zhang, Han Zhang, Li Li, Shili Lin

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

With the rapid development of high-throughput technologies such as array and next generation sequencing, genome-wide, nucleotide-resolution epigenomic data are increasingly available. In recent years, there has been particular interest in data on DNA methylation and 3-dimensional (3D) chromosomal organization, which are believed to hold keys to understand biological mechanisms, such as transcription regulation, that are closely linked to human health and diseases. However, small sample size, complicated correlation structure, substantial noise, biases, and uncertainties, all present difficulties for performing statistical inference. In this review, we present an overview of the new technologies that are frequently utilized in studying DNA methylation and 3D chromosomal organization. We focus on reviewing recent developments in statistical methodologies designed for better interrogating epigenomic data, pointing out statistical challenges facing the field whenever appropriate.

Original languageEnglish (US)
Pages (from-to)284-309
Number of pages26
JournalStatistics in Biosciences
Volume8
Issue number2
DOIs
StatePublished - Oct 1 2016

Fingerprint

Methylation
Long-range Interactions
DNA Methylation
Epigenomics
Technology
Transcription
Sample Size
Uncertainty
Noise
Nucleotides
Genes
Correlation Structure
Throughput
Small Sample Size
Health
Genome
Statistical Inference
Sequencing
High Throughput
Methodology

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry, Genetics and Molecular Biology (miscellaneous)

Cite this

Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data. / Qin, Zhaohui; Li, Ben; Conneely, Karen N.; Wu, Hao; Hu, Ming; Ayyala, Deepak Nag; Park, Yongseok; Jin, Victor X.; Zhang, Fangyuan; Zhang, Han; Li, Li; Lin, Shili.

In: Statistics in Biosciences, Vol. 8, No. 2, 01.10.2016, p. 284-309.

Research output: Contribution to journalArticle

Qin, Z, Li, B, Conneely, KN, Wu, H, Hu, M, Ayyala, DN, Park, Y, Jin, VX, Zhang, F, Zhang, H, Li, L & Lin, S 2016, 'Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data', Statistics in Biosciences, vol. 8, no. 2, pp. 284-309. https://doi.org/10.1007/s12561-016-9145-0
Qin, Zhaohui ; Li, Ben ; Conneely, Karen N. ; Wu, Hao ; Hu, Ming ; Ayyala, Deepak Nag ; Park, Yongseok ; Jin, Victor X. ; Zhang, Fangyuan ; Zhang, Han ; Li, Li ; Lin, Shili. / Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data. In: Statistics in Biosciences. 2016 ; Vol. 8, No. 2. pp. 284-309.
@article{85f6111747c24d6c840ad3bcd86ad3be,
title = "Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data",
abstract = "With the rapid development of high-throughput technologies such as array and next generation sequencing, genome-wide, nucleotide-resolution epigenomic data are increasingly available. In recent years, there has been particular interest in data on DNA methylation and 3-dimensional (3D) chromosomal organization, which are believed to hold keys to understand biological mechanisms, such as transcription regulation, that are closely linked to human health and diseases. However, small sample size, complicated correlation structure, substantial noise, biases, and uncertainties, all present difficulties for performing statistical inference. In this review, we present an overview of the new technologies that are frequently utilized in studying DNA methylation and 3D chromosomal organization. We focus on reviewing recent developments in statistical methodologies designed for better interrogating epigenomic data, pointing out statistical challenges facing the field whenever appropriate.",
author = "Zhaohui Qin and Ben Li and Conneely, {Karen N.} and Hao Wu and Ming Hu and Ayyala, {Deepak Nag} and Yongseok Park and Jin, {Victor X.} and Fangyuan Zhang and Han Zhang and Li Li and Shili Lin",
year = "2016",
month = "10",
day = "1",
doi = "10.1007/s12561-016-9145-0",
language = "English (US)",
volume = "8",
pages = "284--309",
journal = "Statistics in Biosciences",
issn = "1867-1764",
publisher = "Springer New York",
number = "2",

}

TY - JOUR

T1 - Statistical Challenges in Analyzing Methylation and Long-Range Chromosomal Interaction Data

AU - Qin, Zhaohui

AU - Li, Ben

AU - Conneely, Karen N.

AU - Wu, Hao

AU - Hu, Ming

AU - Ayyala, Deepak Nag

AU - Park, Yongseok

AU - Jin, Victor X.

AU - Zhang, Fangyuan

AU - Zhang, Han

AU - Li, Li

AU - Lin, Shili

PY - 2016/10/1

Y1 - 2016/10/1

N2 - With the rapid development of high-throughput technologies such as array and next generation sequencing, genome-wide, nucleotide-resolution epigenomic data are increasingly available. In recent years, there has been particular interest in data on DNA methylation and 3-dimensional (3D) chromosomal organization, which are believed to hold keys to understand biological mechanisms, such as transcription regulation, that are closely linked to human health and diseases. However, small sample size, complicated correlation structure, substantial noise, biases, and uncertainties, all present difficulties for performing statistical inference. In this review, we present an overview of the new technologies that are frequently utilized in studying DNA methylation and 3D chromosomal organization. We focus on reviewing recent developments in statistical methodologies designed for better interrogating epigenomic data, pointing out statistical challenges facing the field whenever appropriate.

AB - With the rapid development of high-throughput technologies such as array and next generation sequencing, genome-wide, nucleotide-resolution epigenomic data are increasingly available. In recent years, there has been particular interest in data on DNA methylation and 3-dimensional (3D) chromosomal organization, which are believed to hold keys to understand biological mechanisms, such as transcription regulation, that are closely linked to human health and diseases. However, small sample size, complicated correlation structure, substantial noise, biases, and uncertainties, all present difficulties for performing statistical inference. In this review, we present an overview of the new technologies that are frequently utilized in studying DNA methylation and 3D chromosomal organization. We focus on reviewing recent developments in statistical methodologies designed for better interrogating epigenomic data, pointing out statistical challenges facing the field whenever appropriate.

UR - http://www.scopus.com/inward/record.url?scp=84960105545&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84960105545&partnerID=8YFLogxK

U2 - 10.1007/s12561-016-9145-0

DO - 10.1007/s12561-016-9145-0

M3 - Article

AN - SCOPUS:84960105545

VL - 8

SP - 284

EP - 309

JO - Statistics in Biosciences

JF - Statistics in Biosciences

SN - 1867-1764

IS - 2

ER -