Abstract
In data analysis, when data are unattainable, it is common to select a closely related attribute as a proxy. But sometimes substitution of one attribute for another is not sufficient to satisfy the needs of the analysis. In these cases, a classification model based on one dataset can be investigated as a possible proxy for another closely related domain's dataset. If the model's structure is sufficient to classify data from the related domain, the model can be used as a proxy tree. Such a proxy tree also provides an alternative characterization of the related domain. Just as important, if the original model does not successfully classify the related domain data the domains are not as closely related as believed. This paper presents a methodology for evaluating datasets as proxies along with three cases that demonstrate the methodology and the three types of results.
Original language | English (US) |
---|---|
Pages (from-to) | 31-44 |
Number of pages | 14 |
Journal | International Journal of Business Analytics |
Volume | 2 |
Issue number | 2 |
DOIs | |
State | Published - Apr 1 2015 |
Externally published | Yes |
Keywords
- Classification
- Data analysis
- Data mining
- Proxy
- Social science
ASJC Scopus subject areas
- Business and International Management
- Strategy and Management