Scheduling concurrent applications on a cluster of CPU-GPU nodes

Vignesh T. Ravi, Michela Becchi, Wei Jiang, Gagan Agrawal, Srimat Chakradhar

Research output: Contribution to journalArticlepeer-review

27 Scopus citations

Abstract

Heterogeneous architectures comprising a multi-core CPU and many-core GPU(s) are increasingly being used within cluster and cloud environments. In this paper, we study the problem of optimizing the overall throughput of a set of applications deployed on a cluster of such heterogeneous nodes. We consider two different scheduling formulations. In the first formulation, we consider jobs that can be executed on either the GPU or the CPU of a single node. In the second formulation, we consider jobs that can be executed on the CPU, GPU, or both, of any number of nodes in the system. We have developed scheduling schemes addressing both of the problems. In our evaluation, we first show that the schemes proposed for first formulation outperform a blind round-robin scheduler and approximate the performances of an ideal scheduler that involves an impractical exhaustive exploration of all possible schedules. Next, we show that the scheme proposed for the second formulation outperforms the best of existing schemes for heterogeneous clusters, TORQUE and MCT, by up to 42%. Additionally, we evaluate the robustness of our proposed scheduling policies under inaccurate inputs to account for real execution scenarios. We show that, with up to 20% of inaccuracy in the input, the degradation in performance is marginal (less than 7%) on the average.

Original languageEnglish (US)
Pages (from-to)2262-2271
Number of pages10
JournalFuture Generation Computer Systems
Volume29
Issue number8
DOIs
StatePublished - 2013
Externally publishedYes
Event2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) - Ottawa, Canada
Duration: May 13 2012May 16 2012

Keywords

  • CPU-GPU systems
  • Scheduling

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Scheduling concurrent applications on a cluster of CPU-GPU nodes'. Together they form a unique fingerprint.

Cite this