Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path.: A case study of fine-grained multithreading on an evolutionary path

Kevin B. Theobald; Gagan Agrawal; Rishi Kumar; Gerd Heber; Guang R. Gao; Paul Stodghill; Keshav Pingali

doi:10.1109/SC.2000.10011

Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path. A case study of fine-grained multithreading on an evolutionary path

Kevin B. Theobald, Gagan Agrawal, Rishi Kumar, Gerd Heber, Guang R. Gao, Paul Stodghill, Keshav Pingali

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

7 Scopus citations

Abstract

We report on our work in developing a fine-grained multithreaded solution for the communication-intensive Conjugate Gradient (CG) problem. In our recent work, we developed a simple yet efficient program for sparse matrix-vector multiply on a multithreaded system. This paper presents an effective mechanism for the reduction-broadcast phase, which is integrated with the sparse MVM, resulting in a scalable implementation of the complete CG application. Three major observations from our experiments on the EARTH multithreaded testbed are: (1) The scalability of our CG implementation is impressive, e.g., absolute speedup is 90 on 120 processors for the NAS CG class B input. (2) Our dataflow-style reduction-broadcast network based on fine-grain multithreading is twice as fast as a serial reduction scheme on the same system. (3) By slowing down the network by a factor of 2, no notable degradation of overall CG performance was observed.

Original language	English (US)
Title of host publication	SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing
Publisher	Association for Computing Machinery
Pages	4
ISBN (Electronic)	0780398025
DOIs	https://doi.org/10.1109/SC.2000.10011
State	Published - 2000
Externally published	Yes
Event	2000 ACM/IEEE Conference on Supercomputing, SC 2000 - Dallas, United States Duration: Nov 4 2000 → Nov 10 2000

Publication series

Name	Proceedings of the International Conference on Supercomputing
Volume	2000-November

Conference

Conference	2000 ACM/IEEE Conference on Supercomputing, SC 2000
Country/Territory	United States
City	Dallas
Period	11/4/00 → 11/10/00

ASJC Scopus subject areas

General Computer Science

Access to Document

10.1109/SC.2000.10011

https://dblp.org/rec/conf/sc/TheobaldAKHGSP00

Cite this

Theobald, K. B., Agrawal, G., Kumar, R., Heber, G., Gao, G. R., Stodghill, P., & Pingali, K. (2000). Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path. A case study of fine-grained multithreading on an evolutionary path. In SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing (pp. 4). (Proceedings of the International Conference on Supercomputing; Vol. 2000-November). Association for Computing Machinery. https://doi.org/10.1109/SC.2000.10011

Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path. A case study of fine-grained multithreading on an evolutionary path. / Theobald, Kevin B.; Agrawal, Gagan; Kumar, Rishi et al.
SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing. Association for Computing Machinery, 2000. p. 4 (Proceedings of the International Conference on Supercomputing; Vol. 2000-November).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Theobald, KB, Agrawal, G, Kumar, R, Heber, G, Gao, GR, Stodghill, P & Pingali, K 2000, Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path. A case study of fine-grained multithreading on an evolutionary path. in SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing. Proceedings of the International Conference on Supercomputing, vol. 2000-November, Association for Computing Machinery, pp. 4, 2000 ACM/IEEE Conference on Supercomputing, SC 2000, Dallas, United States, 11/4/00. https://doi.org/10.1109/SC.2000.10011

Theobald KB, Agrawal G, Kumar R, Heber G, Gao GR, Stodghill P et al. Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path. A case study of fine-grained multithreading on an evolutionary path. In SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing. Association for Computing Machinery. 2000. p. 4. (Proceedings of the International Conference on Supercomputing). doi: 10.1109/SC.2000.10011

Theobald, Kevin B. ; Agrawal, Gagan ; Kumar, Rishi et al. / Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path. A case study of fine-grained multithreading on an evolutionary path. SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing. Association for Computing Machinery, 2000. pp. 4 (Proceedings of the International Conference on Supercomputing).

@inproceedings{e282730ccc2b484490a8ae96772c29e8,

title = "Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path.: A case study of fine-grained multithreading on an evolutionary path",

abstract = "We report on our work in developing a fine-grained multithreaded solution for the communication-intensive Conjugate Gradient (CG) problem. In our recent work, we developed a simple yet efficient program for sparse matrix-vector multiply on a multithreaded system. This paper presents an effective mechanism for the reduction-broadcast phase, which is integrated with the sparse MVM, resulting in a scalable implementation of the complete CG application. Three major observations from our experiments on the EARTH multithreaded testbed are: (1) The scalability of our CG implementation is impressive, e.g., absolute speedup is 90 on 120 processors for the NAS CG class B input. (2) Our dataflow-style reduction-broadcast network based on fine-grain multithreading is twice as fast as a serial reduction scheme on the same system. (3) By slowing down the network by a factor of 2, no notable degradation of overall CG performance was observed.",

author = "Theobald, {Kevin B.} and Gagan Agrawal and Rishi Kumar and Gerd Heber and Gao, {Guang R.} and Paul Stodghill and Keshav Pingali",

note = "DBLP's bibliographic metadata records provided through http://dblp.org/search/publ/api are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.; 2000 ACM/IEEE Conference on Supercomputing, SC 2000 ; Conference date: 04-11-2000 Through 10-11-2000",

year = "2000",

doi = "10.1109/SC.2000.10011",

language = "English (US)",

series = "Proceedings of the International Conference on Supercomputing",

publisher = "Association for Computing Machinery",

pages = "4",

booktitle = "SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing",

}

TY - GEN

T1 - Landing CG on EARTH - A Case Study of Fine-Grained Multithreading on an Evolutionary Path.

T2 - 2000 ACM/IEEE Conference on Supercomputing, SC 2000

AU - Theobald, Kevin B.

AU - Agrawal, Gagan

AU - Kumar, Rishi

AU - Heber, Gerd

AU - Gao, Guang R.

AU - Stodghill, Paul

AU - Pingali, Keshav

N1 - DBLP's bibliographic metadata records provided through http://dblp.org/search/publ/api are distributed under a Creative Commons CC0 1.0 Universal Public Domain Dedication. Although the bibliographic metadata records are provided consistent with CC0 1.0 Dedication, the content described by the metadata records is not. Content may be subject to copyright, rights of privacy, rights of publicity and other restrictions.

PY - 2000

Y1 - 2000

N2 - We report on our work in developing a fine-grained multithreaded solution for the communication-intensive Conjugate Gradient (CG) problem. In our recent work, we developed a simple yet efficient program for sparse matrix-vector multiply on a multithreaded system. This paper presents an effective mechanism for the reduction-broadcast phase, which is integrated with the sparse MVM, resulting in a scalable implementation of the complete CG application. Three major observations from our experiments on the EARTH multithreaded testbed are: (1) The scalability of our CG implementation is impressive, e.g., absolute speedup is 90 on 120 processors for the NAS CG class B input. (2) Our dataflow-style reduction-broadcast network based on fine-grain multithreading is twice as fast as a serial reduction scheme on the same system. (3) By slowing down the network by a factor of 2, no notable degradation of overall CG performance was observed.

AB - We report on our work in developing a fine-grained multithreaded solution for the communication-intensive Conjugate Gradient (CG) problem. In our recent work, we developed a simple yet efficient program for sparse matrix-vector multiply on a multithreaded system. This paper presents an effective mechanism for the reduction-broadcast phase, which is integrated with the sparse MVM, resulting in a scalable implementation of the complete CG application. Three major observations from our experiments on the EARTH multithreaded testbed are: (1) The scalability of our CG implementation is impressive, e.g., absolute speedup is 90 on 120 processors for the NAS CG class B input. (2) Our dataflow-style reduction-broadcast network based on fine-grain multithreading is twice as fast as a serial reduction scheme on the same system. (3) By slowing down the network by a factor of 2, no notable degradation of overall CG performance was observed.

UR - http://www.scopus.com/inward/record.url?scp=77955203874&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77955203874&partnerID=8YFLogxK

U2 - 10.1109/SC.2000.10011

DO - 10.1109/SC.2000.10011

M3 - Conference contribution

T3 - Proceedings of the International Conference on Supercomputing

SP - 4

BT - SC 2000 - Proceedings of the 2000 ACM/IEEE Conference on Supercomputing

PB - Association for Computing Machinery

Y2 - 4 November 2000 through 10 November 2000

ER -