Lattice Boltzmann Simulations at Petascale on Multi-GPU Systems with Asynchronous Data Transfer and Strictly Enforced Memory Read Alignment

Fredrik Robertsén, Jan Westerholm, Keijo Mattila

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

8 Sitaatiot (Scopus)

Abstrakti

The lattice Boltzmann method is a well-established numerical approach for complex fluid flow simulations. Recently general-purpose graphics processing units have become accessible as high-performance computing resources at large-scale. We report on implementing a lattice Boltzmann solver for multi-GPU systems that achieves 0.69 PFLOPS performance on 16384 GPUs. In addition to optimizing the data layout on the GPUs and eliminating the halo sites, we make use of the possibility to overlap data transfer between the host CPU and the device GPU with computing on the GPU. We simulate flow in porous media and measure both strong and weak scaling performance with the emphasis being on a large scale simulation using realistic input data.
AlkuperäiskieliEi tiedossa
OtsikkoParallel, Distributed and Network-Based Processing (PDP), 2015 23rd Euromicro International Conference on
ToimittajatDaneshtalab Masoud, Aldinucci Marco, Leppänen Ville, Lilius Johan, Brorsson Mats
KustantajaIEEE
Sivut604–609
ISBN (painettu)978-1-4799-8492-3
DOI - pysyväislinkit
TilaJulkaistu - 2015
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaEuromicro International Conference on Parallel, Distributed and Network-Based Processing - 23rd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2015)
Kesto: 4 maaliskuuta 20156 maaliskuuta 2015

Konferenssi

KonferenssiEuromicro International Conference on Parallel, Distributed and Network-Based Processing
Ajanjakso04/03/1506/03/15

Keywords

  • GPU
  • Lattice Boltzmann methods

Viittausmuodot