Estimating reliability of workers for cooperative distributed computing

Seda Davtyan; Kishori M. Konwar; Alexander A. Shvartsman

doi:10.1109/ISPDC.2013.22

Estimating reliability of workers for cooperative distributed computing

Seda Davtyan, Kishori M. Konwar, Alexander A. Shvartsman

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

Internet supercomputing is an approach to solving partitionable, computation-intensive problems by harnessing the power of a vast number of interconnected computers. Forthe problem of using network supercomputing to perform a large collection of independent tasks, prior work introduced a decentralized approach and provided randomized synchronousalgorithms that perform all tasks correctly with high probability, while dealing with misbehaving or crash-prone processors. The main weaknesses of existing algorithms is that they assume either that the average probability of a non-crashed processor returningincorrect results is inferior to 1/2, or that the probability of returning incorrect results is known to each processor. Here we present a randomized synchronous distributed algorithm that tightly estimates the probability of each processor returning correct results. Starting with the set P of n processors, let F be the set of processors that crash. Our algorithm estimates the probability p-i of returning a correct result for each processor i \in P - F, making the estimates available to all these processors. The estimation is based on the (e, d)-approximation, where each estimated probability pi of p-i obeys the bound Pr[pi(1 - e) = pi=pi(1+e)]>1 - d, for any constants d>0 and e>0 chosen by the user. An important aspect of this algorithm is that each processor terminates without global coordination. We assess the efficiency of the algorithm in three adversarial models as follows. For the model where the number of non-crashed processors |P - F | is linearly bounded the time complexity T(n) of the algorithm is O(log n), work complexity W(n) is O(n log n), and message complexity M(n) is O(n log2 n). For the model where |P - F | is bounded by a fractional polynomial we have T(n) = O(n1 - a logn loglogn), W(n) = O(nlogn loglogn), and M(n) = O(nlog2n loglogn). For the model where |P - F| is bounded by a poly-logarithm we have T(n) = O(n), W(n) = O(n polylog n), and M(n) = O(n log2 n polylog n). All bounds are shown to hold with high probability.

Original language	English (US)
Title of host publication	Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013
Pages	101-108
Number of pages	8
DOIs	https://doi.org/10.1109/ISPDC.2013.22
State	Published - 2013
Externally published	Yes
Event	2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013 - Bucharest, Romania Duration: Jun 27 2013 → Jun 30 2013

Publication series

Name	Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013

Conference

Conference	2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013
Country/Territory	Romania
City	Bucharest
Period	6/27/13 → 6/30/13

Keywords

Distributed Computing
Internet Supercomputing
Worker Reliability Estimation

ASJC Scopus subject areas

Software

Access to Document

10.1109/ISPDC.2013.22

Cite this

Davtyan, S., Konwar, K. M., & Shvartsman, A. A. (2013). Estimating reliability of workers for cooperative distributed computing. In Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013 (pp. 101-108). Article 6663570 (Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013). https://doi.org/10.1109/ISPDC.2013.22

Estimating reliability of workers for cooperative distributed computing. / Davtyan, Seda; Konwar, Kishori M.; Shvartsman, Alexander A.
Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013. 2013. p. 101-108 6663570 (Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Davtyan, S, Konwar, KM & Shvartsman, AA 2013, Estimating reliability of workers for cooperative distributed computing. in Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013., 6663570, Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013, pp. 101-108, 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013, Bucharest, Romania, 6/27/13. https://doi.org/10.1109/ISPDC.2013.22

Davtyan S, Konwar KM, Shvartsman AA. Estimating reliability of workers for cooperative distributed computing. In Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013. 2013. p. 101-108. 6663570. (Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013). doi: 10.1109/ISPDC.2013.22

@inproceedings{bfcd461b0e3d434890b40c298df92502,

title = "Estimating reliability of workers for cooperative distributed computing",

abstract = "Internet supercomputing is an approach to solving partitionable, computation-intensive problems by harnessing the power of a vast number of interconnected computers. Forthe problem of using network supercomputing to perform a large collection of independent tasks, prior work introduced a decentralized approach and provided randomized synchronousalgorithms that perform all tasks correctly with high probability, while dealing with misbehaving or crash-prone processors. The main weaknesses of existing algorithms is that they assume either that the average probability of a non-crashed processor returningincorrect results is inferior to 1/2, or that the probability of returning incorrect results is known to each processor. Here we present a randomized synchronous distributed algorithm that tightly estimates the probability of each processor returning correct results. Starting with the set P of n processors, let F be the set of processors that crash. Our algorithm estimates the probability p-i of returning a correct result for each processor i \in P - F, making the estimates available to all these processors. The estimation is based on the (e, d)-approximation, where each estimated probability pi of p-i obeys the bound Pr[pi(1 - e) = pi=pi(1+e)]>1 - d, for any constants d>0 and e>0 chosen by the user. An important aspect of this algorithm is that each processor terminates without global coordination. We assess the efficiency of the algorithm in three adversarial models as follows. For the model where the number of non-crashed processors |P - F | is linearly bounded the time complexity T(n) of the algorithm is O(log n), work complexity W(n) is O(n log n), and message complexity M(n) is O(n log2 n). For the model where |P - F | is bounded by a fractional polynomial we have T(n) = O(n1 - a logn loglogn), W(n) = O(nlogn loglogn), and M(n) = O(nlog2n loglogn). For the model where |P - F| is bounded by a poly-logarithm we have T(n) = O(n), W(n) = O(n polylog n), and M(n) = O(n log2 n polylog n). All bounds are shown to hold with high probability.",

keywords = "Distributed Computing, Internet Supercomputing, Worker Reliability Estimation",

author = "Seda Davtyan and Konwar, {Kishori M.} and Shvartsman, {Alexander A.}",

note = "DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.; 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013 ; Conference date: 27-06-2013 Through 30-06-2013",

year = "2013",

doi = "10.1109/ISPDC.2013.22",

language = "English (US)",

isbn = "9780769550183",

series = "Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013",

pages = "101--108",

booktitle = "Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013",

}

TY - GEN

T1 - Estimating reliability of workers for cooperative distributed computing

AU - Davtyan, Seda

AU - Konwar, Kishori M.

AU - Shvartsman, Alexander A.

N1 - DBLP License: DBLP's bibliographic metadata records provided through http://dblp.org/ are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.

PY - 2013

Y1 - 2013

N2 - Internet supercomputing is an approach to solving partitionable, computation-intensive problems by harnessing the power of a vast number of interconnected computers. Forthe problem of using network supercomputing to perform a large collection of independent tasks, prior work introduced a decentralized approach and provided randomized synchronousalgorithms that perform all tasks correctly with high probability, while dealing with misbehaving or crash-prone processors. The main weaknesses of existing algorithms is that they assume either that the average probability of a non-crashed processor returningincorrect results is inferior to 1/2, or that the probability of returning incorrect results is known to each processor. Here we present a randomized synchronous distributed algorithm that tightly estimates the probability of each processor returning correct results. Starting with the set P of n processors, let F be the set of processors that crash. Our algorithm estimates the probability p-i of returning a correct result for each processor i \in P - F, making the estimates available to all these processors. The estimation is based on the (e, d)-approximation, where each estimated probability pi of p-i obeys the bound Pr[pi(1 - e) = pi=pi(1+e)]>1 - d, for any constants d>0 and e>0 chosen by the user. An important aspect of this algorithm is that each processor terminates without global coordination. We assess the efficiency of the algorithm in three adversarial models as follows. For the model where the number of non-crashed processors |P - F | is linearly bounded the time complexity T(n) of the algorithm is O(log n), work complexity W(n) is O(n log n), and message complexity M(n) is O(n log2 n). For the model where |P - F | is bounded by a fractional polynomial we have T(n) = O(n1 - a logn loglogn), W(n) = O(nlogn loglogn), and M(n) = O(nlog2n loglogn). For the model where |P - F| is bounded by a poly-logarithm we have T(n) = O(n), W(n) = O(n polylog n), and M(n) = O(n log2 n polylog n). All bounds are shown to hold with high probability.

AB - Internet supercomputing is an approach to solving partitionable, computation-intensive problems by harnessing the power of a vast number of interconnected computers. Forthe problem of using network supercomputing to perform a large collection of independent tasks, prior work introduced a decentralized approach and provided randomized synchronousalgorithms that perform all tasks correctly with high probability, while dealing with misbehaving or crash-prone processors. The main weaknesses of existing algorithms is that they assume either that the average probability of a non-crashed processor returningincorrect results is inferior to 1/2, or that the probability of returning incorrect results is known to each processor. Here we present a randomized synchronous distributed algorithm that tightly estimates the probability of each processor returning correct results. Starting with the set P of n processors, let F be the set of processors that crash. Our algorithm estimates the probability p-i of returning a correct result for each processor i \in P - F, making the estimates available to all these processors. The estimation is based on the (e, d)-approximation, where each estimated probability pi of p-i obeys the bound Pr[pi(1 - e) = pi=pi(1+e)]>1 - d, for any constants d>0 and e>0 chosen by the user. An important aspect of this algorithm is that each processor terminates without global coordination. We assess the efficiency of the algorithm in three adversarial models as follows. For the model where the number of non-crashed processors |P - F | is linearly bounded the time complexity T(n) of the algorithm is O(log n), work complexity W(n) is O(n log n), and message complexity M(n) is O(n log2 n). For the model where |P - F | is bounded by a fractional polynomial we have T(n) = O(n1 - a logn loglogn), W(n) = O(nlogn loglogn), and M(n) = O(nlog2n loglogn). For the model where |P - F| is bounded by a poly-logarithm we have T(n) = O(n), W(n) = O(n polylog n), and M(n) = O(n log2 n polylog n). All bounds are shown to hold with high probability.

KW - Distributed Computing

KW - Internet Supercomputing

KW - Worker Reliability Estimation

UR - http://www.scopus.com/inward/record.url?scp=84892768402&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84892768402&partnerID=8YFLogxK

U2 - 10.1109/ISPDC.2013.22

DO - 10.1109/ISPDC.2013.22

M3 - Conference contribution

AN - SCOPUS:84892768402

SN - 9780769550183

T3 - Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013

SP - 101

EP - 108

BT - Proceedings - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013

T2 - 2013 IEEE 12th International Symposium on Parallel and Distributed Computing, ISPDC 2013

Y2 - 27 June 2013 through 30 June 2013

ER -

Estimating reliability of workers for cooperative distributed computing

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this