Big data causing big (TLB) problems: Taming random memory accesses on the GPU

Tomas Karnagel; Tal Ben-Nun; Matthias Werner; Dirk Habich; Wolfgang Lehner

doi:doi:10.1145/3076113.3076115

Big data causing big (TLB) problems: Taming random memory accesses on the GPU

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Tomas Karnagel - , Chair of Databases (Author)
Tal Ben-Nun - , Hebrew University of Jerusalem (Author)
Matthias Werner - , Center for Information Services and High Performance Computing (ZIH) (Author)
Dirk Habich - , Chair of Databases (Author)
Wolfgang Lehner - , Chair of Databases (Author)

Abstract

GPUS are increasingly adopted for large-scale database processing, where data accesses represent the major part of the computation. If the data accesses are irregular, like hash table accesses or random sampling, the GPU performance can suffer. Especially when scaling such accesses beyond 2GB of data, a performance decrease of an order of magnitude is encountered. This paper analyzes the source of the slowdown through extensive micro-benchmarking, attributing the root cause to the Translation Look aside Buffer (TLB). Using the micro-benchmarks, the TLB hierarchy and structure are fully analyzed on two different GPU architectures, identifying never before- published TLB sizes that can be used for efficient large-scale application tuning. Based on the gained knowledge, we propose a TLB-conscious approach to mitigate the slowdown for algorithms with irregular memory access. The proposed approach is applied to two fundamental database operations - random sampling and hash-based grouping - showing that the slowdown can be dramatically reduced, and resulting in a performance increase of up to 13x.

Details

Original language	English
Title of host publication	DAMON '17: Proceedings of the 13th International Workshop on Data Management on New Hardware
Publisher	Association for Computing Machinery (ACM), New York
Number of pages	10
ISBN (print)	978-1-4503-5025-9
Publication status	Published - 14 May 2017
Peer-reviewed	Yes

Publication series

Series	MOD: International Conference on Management of Data (DaMoN)
ISSN	0730-8078

Conference

Title	13th International Workshop on Data Management on New Hardware, DAMON 2017
Duration	14 - 19 May 2017
City	Chicago
Country	United States of America

External IDs

Scopus	85021772296
ORCID	/0000-0001-8107-2775/work/142253579

Research Portal of the TU Dresden

Big data causing big (TLB) problems: Taming random memory accesses on the GPU

Contributors

Abstract

Details

Publication series

Conference

External IDs

Keywords

ASJC Scopus subject areas

Keywords