Determining a Device Crossover Point in CPU/GPU Systems for Streaming Applications

A4 Konferenspublikationer

Interna författare/redaktörer

Publikationens författare: Sudeep Kanur, Wictor Lund, Leonidas Tsiopoulos, Johan Lilius
Redaktörer: Jose Moura, Dapeng Oliver Wu
Publiceringsår: 2015
Förläggare: IEEE Global Conference on Signal and Information Processing
Moderpublikationens namn: 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP 2015)
Artikelns första sida, sidnummer: 100
Artikelns sista sida, sidnummer: 106
ISBN: 978-1-4799-7590-7


In streaming dataflow applications such as video conferencing systems, the applications are often subjected to traffic occurring in bursts. As systems consisting of a CPU and a GPU are becoming ubiquitous, efficient utilisation of such platforms for handling bursts of data becomes an interesting problem. For GPUs to be efficient, the chunk size of data to process must be large. The bursty nature of the traffic due to the underlying network connections may result in an unacceptable increase in latency if the large chunk size requirement is held strict. We study these systems in the context of the dataflow programming language Reconfigurable Video Coding-Cal Actor Language (RVC-CAL) with a streaming dataflow perspective. To address the aforementioned issue, a crucial step is to determine a device crossover point defined as the chunk size at which the decision to switch to the other device can be made. This is predicted quantitatively using an analytical model of CPUs and GPUs whose parameters are statically determined and later tuned during runtime. In this paper, we validate this model against experimentally measured values for kernels generated for a streaming dataflow application and show that the crossover point determined by the model lies within the range predicted by the measurements.

Senast uppdaterad 2020-20-09 vid 05:12