Cooperative computing with fragmentable and mergeable groups

Chryssis Georgiou, Alex A. Shvartsman

Research output: Contribution to journalArticlepeer-review

Abstract

This work considers the problem of performing a set of N tasks on a set of P cooperating message-passing processors (P ≤ N). The processors use a group communication service (GCS) to coordinate their activity in the setting where dynamic changes in the underlying network topology cause the processor groups to change over time. GCSs have been recognized as effective building blocks for fault-tolerant applications in such settings. Our results explore the efficiency of fault-tolerant cooperative computation using GCSs. The original investigation of this area by (Dolev et al., Dynamic load balancing with group communication, in: Proc. of the 6th International Colloquium on Structural Information and Communication Complexity, 1999) focused on competitive lower bounds, non-redundant task allocation schemes and work-efficient algorithms in the presence of fragmentation regroupings. In this work we investigate work-efficient and message-efficient algorithms for fragmentation and merge regroupings. We present an algorithm that uses GCSs and implements a coordinator-based strategy. For the analysis of our algorithm we introduce the notion of view-graphs that represent the partially-ordered view evolution history witnessed by the processors. For fragmentations and merges, the work of the algorithm (defined as the worst case total number of task executions counting multiplicities) is not more than min{N · f + N, N · P}, and the message complexity is no worse than 4(N · f + N + P · m), where f and m denote the number of new groups created by fragmentations and merges, respectively. Note that the constants are very small and that, interestingly, while the work efficiency depends on the number of groups f created as the result of fragmentations, work does not depend on the number of groups m created as the result of merges.

Original languageEnglish (US)
Pages (from-to)211-235
Number of pages25
JournalJournal of Discrete Algorithms
Volume1
Issue number2
DOIs
StatePublished - Apr 2003
Externally publishedYes

Keywords

  • Communication
  • Complexity
  • Distributed algorithms
  • Group communication
  • Work

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Discrete Mathematics and Combinatorics
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Cooperative computing with fragmentable and mergeable groups'. Together they form a unique fingerprint.

Cite this