Compiler-based graph representations for deep learning models of code
Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review
Contributors
Abstract
In natural language processing, novel methods in deep learning, like recurrent neural networks (RNNs) on sequences of words, have been very successful. In contrast to natural languages, programming languages usually have a well-defined structure. With this structure compilers can reason about programs, using graphs such as abstract syntax trees (ASTs) or control-data flow graphs (CDFGs). In this paper, we argue that we should use these graph structures instead of sequences for learning compiler optimization tasks. To this end, we use graph neural networks (GNNs) for learning predictive compiler tasks on two representations based on ASTs and CDFGs. Experiments show that this improves upon the state-of-the-art in the task of heterogeneous OpenCL mapping, while providing orders of magnitude faster inference times, crucial for compiler optimizations. When testing on benchmark suites not included for training, our AST-based model significantly outperforms the state-of-the-art by over 12 percentage points in terms of accuracy. It is the only one to perform clearly better than a random mapping. On the task of predicting thread coarsening factors, we show that all of the methods fail to produce an overall speedup.
Details
Original language | English |
---|---|
Title of host publication | CC 2020 - Proceedings of the 29th International Conference on Compiler Construction |
Editors | Louis-Noel Pouchet, Alexandra Jimborean |
Publisher | Association for Computing Machinery, Inc |
Pages | 201-211 |
Number of pages | 11 |
ISBN (electronic) | 9781450371209 |
Publication status | Published - 22 Feb 2020 |
Peer-reviewed | Yes |
Conference
Title | 29th ACM SIGPLAN International Conference on Compiler Construction, CC 2020 |
---|---|
Duration | 22 - 23 February 2020 |
City | San Diego |
Country | United States of America |
External IDs
ORCID | /0000-0002-5007-445X/work/141545531 |
---|
Keywords
Research priority areas of TU Dresden
ASJC Scopus subject areas
Keywords
- Compilers, Deep Learning, Graphs, LLVM