Classification trees as proxies

Anthony Scime; Nilay Saiya; Gregg R. Murray; Steven J. Jurek

doi:10.4018/IJBAN.2015040103

Classification trees as proxies

Anthony Scime, Nilay Saiya, Gregg R. Murray, Steven J. Jurek

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

In data analysis, when data are unattainable, it is common to select a closely related attribute as a proxy. But sometimes substitution of one attribute for another is not sufficient to satisfy the needs of the analysis. In these cases, a classification model based on one dataset can be investigated as a possible proxy for another closely related domain's dataset. If the model's structure is sufficient to classify data from the related domain, the model can be used as a proxy tree. Such a proxy tree also provides an alternative characterization of the related domain. Just as important, if the original model does not successfully classify the related domain data the domains are not as closely related as believed. This paper presents a methodology for evaluating datasets as proxies along with three cases that demonstrate the methodology and the three types of results.

Original language	English (US)
Pages (from-to)	31-44
Number of pages	14
Journal	International Journal of Business Analytics
Volume	2
Issue number	2
DOIs	https://doi.org/10.4018/IJBAN.2015040103
State	Published - Apr 1 2015
Externally published	Yes

Keywords

Classification
Data analysis
Data mining
Proxy
Social science

ASJC Scopus subject areas

Business and International Management
Strategy and Management

Access to Document

10.4018/IJBAN.2015040103

Cite this

@article{bb3b7ce4e7f541e7a5aa1ba3cc186694,

title = "Classification trees as proxies",

abstract = "In data analysis, when data are unattainable, it is common to select a closely related attribute as a proxy. But sometimes substitution of one attribute for another is not sufficient to satisfy the needs of the analysis. In these cases, a classification model based on one dataset can be investigated as a possible proxy for another closely related domain's dataset. If the model's structure is sufficient to classify data from the related domain, the model can be used as a proxy tree. Such a proxy tree also provides an alternative characterization of the related domain. Just as important, if the original model does not successfully classify the related domain data the domains are not as closely related as believed. This paper presents a methodology for evaluating datasets as proxies along with three cases that demonstrate the methodology and the three types of results.",

keywords = "Classification, Data analysis, Data mining, Proxy, Social science",

author = "Anthony Scime and Nilay Saiya and Murray, {Gregg R.} and Jurek, {Steven J.}",

note = "Publisher Copyright: Copyright {\textcopyright} 2015, IGI Global.",

year = "2015",

month = apr,

day = "1",

doi = "10.4018/IJBAN.2015040103",

language = "English (US)",

volume = "2",

pages = "31--44",

journal = "International Journal of Business Analytics",

issn = "2334-4547",

publisher = "IGI Global Publishing",

number = "2",

}

TY - JOUR

T1 - Classification trees as proxies

AU - Scime, Anthony

AU - Saiya, Nilay

AU - Murray, Gregg R.

AU - Jurek, Steven J.

PY - 2015/4/1

Y1 - 2015/4/1

N2 - In data analysis, when data are unattainable, it is common to select a closely related attribute as a proxy. But sometimes substitution of one attribute for another is not sufficient to satisfy the needs of the analysis. In these cases, a classification model based on one dataset can be investigated as a possible proxy for another closely related domain's dataset. If the model's structure is sufficient to classify data from the related domain, the model can be used as a proxy tree. Such a proxy tree also provides an alternative characterization of the related domain. Just as important, if the original model does not successfully classify the related domain data the domains are not as closely related as believed. This paper presents a methodology for evaluating datasets as proxies along with three cases that demonstrate the methodology and the three types of results.

AB - In data analysis, when data are unattainable, it is common to select a closely related attribute as a proxy. But sometimes substitution of one attribute for another is not sufficient to satisfy the needs of the analysis. In these cases, a classification model based on one dataset can be investigated as a possible proxy for another closely related domain's dataset. If the model's structure is sufficient to classify data from the related domain, the model can be used as a proxy tree. Such a proxy tree also provides an alternative characterization of the related domain. Just as important, if the original model does not successfully classify the related domain data the domains are not as closely related as believed. This paper presents a methodology for evaluating datasets as proxies along with three cases that demonstrate the methodology and the three types of results.

KW - Classification

KW - Data analysis

KW - Data mining

KW - Proxy

KW - Social science

UR - http://www.scopus.com/inward/record.url?scp=85046757913&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85046757913&partnerID=8YFLogxK

U2 - 10.4018/IJBAN.2015040103

DO - 10.4018/IJBAN.2015040103

M3 - Article

AN - SCOPUS:85046757913

SN - 2334-4547

VL - 2

SP - 31

EP - 44

JO - International Journal of Business Analytics

JF - International Journal of Business Analytics

IS - 2

ER -

Classification trees as proxies

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this