Fast distributed algorithms for connectivity and MST in large graphs

Gopal Pandurangan; Peter Robinson; Michele Scquizzato

doi:10.1145/2935764.2935785

Fast distributed algorithms for connectivity and MST in large graphs

Gopal Pandurangan, Peter Robinson, Michele Scquizzato

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

21 Scopus citations

Abstract

Motivated by the increasing need to understand the algorithmic foundations of distributed large-scale graph computations, we study a number of fundamental graph problems in a message-passing model for distributed computing where k ≥ 2 machines jointly perform computations on graphs with n nodes (typically, n > k). The input graph is assumed to be initially randomly partitioned among the k machines, a common implementation in many real-world systems. Communication is point-to-point, and the goal is to minimize the number of communication rounds of the computation. Our main result is an (almost) optimal distributed randomized algorithm for graph connectivity. Our algorithm runs in Õ(n/k) rounds (Õ notation hides a polylog(n) factor and an additive polylog(n) term). This improves over the best previously known bound of Õ(n/k)[Klauck et al., SODA 2015], and is optimal (up to a polylogarithmic factor) in view of an existing lower bound of Ω(n/k²). Our improved algorithm uses a bunch of techniques, including linear graph sketching, that prove useful in the design of efficient distributed graph algorithms. We then present fast randomized algorithms for computing minimum spanning trees, (approximate) min-cuts, and for many graph verification problems. All these algorithms take Õ(n/k²) rounds, and are optimal up to polylogarithmic factors. We also show an almost matching lower bound of Ω(n/k²) for many graph verification problems using lower bounds in random-partition communication complexity.

Original language	English (US)
Title of host publication	SPAA 2016 - Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures
Publisher	Association for Computing Machinery
Pages	429-438
Number of pages	10
ISBN (Electronic)	9781450342100
DOIs	https://doi.org/10.1145/2935764.2935785
State	Published - Jul 11 2016
Externally published	Yes
Event	28th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2016 - Pacific Grove, United States Duration: Jul 11 2016 → Jul 13 2016

Publication series

Name	Annual ACM Symposium on Parallelism in Algorithms and Architectures
Volume	11-13-July-2016

Conference

Conference	28th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2016
Country/Territory	United States
City	Pacific Grove
Period	7/11/16 → 7/13/16

Keywords

Distributed graph algorithms
Graph connectivity
Graph sketching
Massive graphs
Minimum spanning trees

ASJC Scopus subject areas

Software
Theoretical Computer Science
Hardware and Architecture

Access to Document

10.1145/2935764.2935785

Cite this

Pandurangan, G., Robinson, P., & Scquizzato, M. (2016). Fast distributed algorithms for connectivity and MST in large graphs. In SPAA 2016 - Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures (pp. 429-438). (Annual ACM Symposium on Parallelism in Algorithms and Architectures; Vol. 11-13-July-2016). Association for Computing Machinery. https://doi.org/10.1145/2935764.2935785

Fast distributed algorithms for connectivity and MST in large graphs. / Pandurangan, Gopal; Robinson, Peter; Scquizzato, Michele.
SPAA 2016 - Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures. Association for Computing Machinery, 2016. p. 429-438 (Annual ACM Symposium on Parallelism in Algorithms and Architectures; Vol. 11-13-July-2016).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Pandurangan, G, Robinson, P & Scquizzato, M 2016, Fast distributed algorithms for connectivity and MST in large graphs. in SPAA 2016 - Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures. Annual ACM Symposium on Parallelism in Algorithms and Architectures, vol. 11-13-July-2016, Association for Computing Machinery, pp. 429-438, 28th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2016, Pacific Grove, United States, 7/11/16. https://doi.org/10.1145/2935764.2935785

@inproceedings{6a9474f0dcd8415fadab3fa0cf31170f,

title = "Fast distributed algorithms for connectivity and MST in large graphs",

abstract = "Motivated by the increasing need to understand the algorithmic foundations of distributed large-scale graph computations, we study a number of fundamental graph problems in a message-passing model for distributed computing where k ≥ 2 machines jointly perform computations on graphs with n nodes (typically, n > k). The input graph is assumed to be initially randomly partitioned among the k machines, a common implementation in many real-world systems. Communication is point-to-point, and the goal is to minimize the number of communication rounds of the computation. Our main result is an (almost) optimal distributed randomized algorithm for graph connectivity. Our algorithm runs in {\~O}(n/k) rounds ({\~O} notation hides a polylog(n) factor and an additive polylog(n) term). This improves over the best previously known bound of {\~O}(n/k)[Klauck et al., SODA 2015], and is optimal (up to a polylogarithmic factor) in view of an existing lower bound of Ω(n/k2). Our improved algorithm uses a bunch of techniques, including linear graph sketching, that prove useful in the design of efficient distributed graph algorithms. We then present fast randomized algorithms for computing minimum spanning trees, (approximate) min-cuts, and for many graph verification problems. All these algorithms take {\~O}(n/k2) rounds, and are optimal up to polylogarithmic factors. We also show an almost matching lower bound of Ω(n/k2) for many graph verification problems using lower bounds in random-partition communication complexity.",

keywords = "Distributed graph algorithms, Graph connectivity, Graph sketching, Massive graphs, Minimum spanning trees",

author = "Gopal Pandurangan and Peter Robinson and Michele Scquizzato",

note = "Funding Information: Supported, in part, by US-Israel Binational Science Foundation grant 2008348, NSF grant CCF-1527867, and NSF grant CCF-1540512.; 28th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2016 ; Conference date: 11-07-2016 Through 13-07-2016",

year = "2016",

month = jul,

day = "11",

doi = "10.1145/2935764.2935785",

language = "English (US)",

series = "Annual ACM Symposium on Parallelism in Algorithms and Architectures",

publisher = "Association for Computing Machinery",

pages = "429--438",

booktitle = "SPAA 2016 - Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures",

}

TY - GEN

T1 - Fast distributed algorithms for connectivity and MST in large graphs

AU - Pandurangan, Gopal

AU - Robinson, Peter

AU - Scquizzato, Michele

N1 - Funding Information: Supported, in part, by US-Israel Binational Science Foundation grant 2008348, NSF grant CCF-1527867, and NSF grant CCF-1540512.

PY - 2016/7/11

Y1 - 2016/7/11

N2 - Motivated by the increasing need to understand the algorithmic foundations of distributed large-scale graph computations, we study a number of fundamental graph problems in a message-passing model for distributed computing where k ≥ 2 machines jointly perform computations on graphs with n nodes (typically, n > k). The input graph is assumed to be initially randomly partitioned among the k machines, a common implementation in many real-world systems. Communication is point-to-point, and the goal is to minimize the number of communication rounds of the computation. Our main result is an (almost) optimal distributed randomized algorithm for graph connectivity. Our algorithm runs in Õ(n/k) rounds (Õ notation hides a polylog(n) factor and an additive polylog(n) term). This improves over the best previously known bound of Õ(n/k)[Klauck et al., SODA 2015], and is optimal (up to a polylogarithmic factor) in view of an existing lower bound of Ω(n/k2). Our improved algorithm uses a bunch of techniques, including linear graph sketching, that prove useful in the design of efficient distributed graph algorithms. We then present fast randomized algorithms for computing minimum spanning trees, (approximate) min-cuts, and for many graph verification problems. All these algorithms take Õ(n/k2) rounds, and are optimal up to polylogarithmic factors. We also show an almost matching lower bound of Ω(n/k2) for many graph verification problems using lower bounds in random-partition communication complexity.

AB - Motivated by the increasing need to understand the algorithmic foundations of distributed large-scale graph computations, we study a number of fundamental graph problems in a message-passing model for distributed computing where k ≥ 2 machines jointly perform computations on graphs with n nodes (typically, n > k). The input graph is assumed to be initially randomly partitioned among the k machines, a common implementation in many real-world systems. Communication is point-to-point, and the goal is to minimize the number of communication rounds of the computation. Our main result is an (almost) optimal distributed randomized algorithm for graph connectivity. Our algorithm runs in Õ(n/k) rounds (Õ notation hides a polylog(n) factor and an additive polylog(n) term). This improves over the best previously known bound of Õ(n/k)[Klauck et al., SODA 2015], and is optimal (up to a polylogarithmic factor) in view of an existing lower bound of Ω(n/k2). Our improved algorithm uses a bunch of techniques, including linear graph sketching, that prove useful in the design of efficient distributed graph algorithms. We then present fast randomized algorithms for computing minimum spanning trees, (approximate) min-cuts, and for many graph verification problems. All these algorithms take Õ(n/k2) rounds, and are optimal up to polylogarithmic factors. We also show an almost matching lower bound of Ω(n/k2) for many graph verification problems using lower bounds in random-partition communication complexity.

KW - Distributed graph algorithms

KW - Graph connectivity

KW - Graph sketching

KW - Massive graphs

KW - Minimum spanning trees

UR - http://www.scopus.com/inward/record.url?scp=84979761867&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84979761867&partnerID=8YFLogxK

U2 - 10.1145/2935764.2935785

DO - 10.1145/2935764.2935785

M3 - Conference contribution

AN - SCOPUS:84979761867

T3 - Annual ACM Symposium on Parallelism in Algorithms and Architectures

SP - 429

EP - 438

BT - SPAA 2016 - Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and Architectures

PB - Association for Computing Machinery

T2 - 28th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA 2016

Y2 - 11 July 2016 through 13 July 2016

ER -

Fast distributed algorithms for connectivity and MST in large graphs

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this