A statistical change point model approach for the detection of DNA copy number variations in array CGH data

Jie Chen, Yu Ping Wang

Research output: Contribution to journalArticle

36 Scopus citations

Abstract

Array comparative genomic hybridization (aCGH) provides a high-resolution and high-throughput technique for screening of copy number variations (CNVs) within the entire genome. This technique, compared to the conventional CGH, significantly improves the identification of chromosomal abnormalities. However, due to the random noise inherited in the imaging and hybridization process, identifying statistically significant DNA copy number changes in aCGH data is challenging. We propose a novel approach that uses the mean and variance change point model (MVCM) to detect CNVs or breakpoints in aCGH data sets. We derive an approximate p-value for the test statistic and also give the estimate of the locus of the DNA copy number change. We carry out simulation studies to evaluate the accuracy of the estimate and the p-value formulation. These simulation results show that the approach is effective in identifying copy number changes. The approach is also tested on fibroblast cancer cell line data, breast tumor cell line data, and breast cancer cell line aCGH data sets that are publicly available. Changes that have not been identified by the circular binary segmentation (CBS) method but are biologically verified are detected by our approach on these cell lines with higher sensitivity and specificity than CBS.

Original languageEnglish (US)
Article number4695823
Pages (from-to)529-541
Number of pages13
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Volume6
Issue number4
DOIs
StatePublished - Oct 2009

Keywords

  • ACGH microarray data
  • CNVs
  • DNA copy numbers
  • Gene expression
  • Statistical hypothesis testing

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics

Fingerprint Dive into the research topics of 'A statistical change point model approach for the detection of DNA copy number variations in array CGH data'. Together they form a unique fingerprint.

  • Cite this