An online guided tuning approach to run CNN pipelines on edge devices

Research output: Contribution to book/conference proceedings/anthology/reportConference contributionContributedpeer-review

Contributors

Abstract

Modern edge and mobile devices are equipped with powerful computing resources. These are often organized as heterogeneous multi-cores, featuring performance-asymmetric core clusters. This raises the question on how to effectively execute the inference pass of convolutional neural networks (CNN) on such devices. Existing CNN implementations on edge devices leverage offline profiling data to determine a better schedule for CNN applications. This approach requires a time consuming phase of generating a performance profile for each type of representative kernel on various core configurations available on the device, coupled with a search space exploration. We propose an online tuning technique which utilizes compile time hints and online profiling data to generate high throughput CNN pipelines. We explore core heterogeneity and compatible core-layer configurations through an online guided search. Unlike exhaustive search, we adopt an evolutionary approach with a guided starting point in order to find the solution. We show that by pruning and navigating through the complex search space using compile time hints, 79% of the tested configurations turn out to be near-optimal candidates for a throughput maximizing pipeline on NVIDIA Jetson TX2 platform.

Details

Original languageEnglish
Title of host publicationCF '21: Proceedings of the 18th ACM International Conference on Computing Frontiers
PublisherAssociation for Computing Machinery, Inc
Pages45-53
Number of pages9
ISBN (electronic)978-1-4503-8404-9
Publication statusPublished - 11 May 2021
Peer-reviewedYes

Publication series

SeriesCF: Computing Frontiers Conference

Conference

Title18th ACM International Conference on Computing Frontiers 2021, CF 2021
Duration11 - 13 May 2021
CityVirtual, Online
CountryItaly

External IDs

ORCID /0000-0002-5007-445X/work/141545523

Keywords

Research priority areas of TU Dresden

ASJC Scopus subject areas

Keywords

  • CNN pipelines, design space exploration, edge devices, evolutionary algorithm, heterogeneous core clusters, online tuning, task moldability, task parallel runtimes