Data reduction using a discrete wavelet transform in discriminant analysis of very high dimensionality data

Yinsheng Qu, Bao Ling Adam, Mark Thornquist, John D. Potter, Mary Lou Thompson, Yutaka Yasui, John Davis, Paul F. Schellhammer, Lisa Cazares, Mary Ann Clements, George L. Wright, Ziding Feng

Research output: Contribution to journalArticlepeer-review

63 Scopus citations

Abstract

We present a method of data reduction using a wavelet transform in discriminant analysis when the number of variables is much greater than the number of observations. The method is illustrated with a prostate cancer study, where the sample size is 248, and the number of variables is 48,538 (generated using the ProteinChip technology). Using a discrete wavelet transform, the 48,538 data points are represented by 1271 wavelet coefficients. Information criteria identified 11 of the 1271 wavelet coefficients with the highest discriminatory power. The linear classifier with the 11 wavelet coefficients detected prostate cancer in a separate test set with a sensitivity of 97% and specificity of 100%.

Original languageEnglish (US)
Pages (from-to)143-151
Number of pages9
JournalBiometrics
Volume59
Issue number1
DOIs
StatePublished - Mar 2003
Externally publishedYes

Keywords

  • Area under the ROC curve
  • Divergence
  • Fisher discriminant analysis
  • Kullback-Leibler information
  • Mahalanobis distance
  • Principal components analysis

ASJC Scopus subject areas

  • Statistics and Probability
  • General Biochemistry, Genetics and Molecular Biology
  • General Immunology and Microbiology
  • General Agricultural and Biological Sciences
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Data reduction using a discrete wavelet transform in discriminant analysis of very high dimensionality data'. Together they form a unique fingerprint.

Cite this