AD for an Array Language with Nested Parallelism

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

2 Citationer (Scopus)
18 Downloads (Pure)

Abstract

We present a technique for applying reverse mode automatic differentiation (AD) on a non-recursive second-order functional array language that supports nested parallelism and is primarily aimed at efficient GPU execution. The key idea is to eliminate the need for a tape by relying on redundant execution to bring into each new scope all program variables that may be needed by the differentiated code. Efficient execution is enabled by the observation that perfectly nested scopes do not introduce re-execution and that such perfect nests can be readily produced by application of known compiler transformations. Our technique differentiates loops and bulk-parallel operators-e.g., map, reduce(-by-index), scan, and scatter-by specific rewrite rules and aggressively optimizes the resulting nested-parallel code. We report an evaluation that compares with established AD solutions and demonstrates competitive performance on ten common benchmarks from recent applied AD literature.

OriginalsprogEngelsk
TitelProceedings of SC 2022 : International Conference for High Performance Computing, Networking, Storage and Analysis
Antal sider15
ForlagIEEE Computer Society Press
Publikationsdato2022
ISBN (Elektronisk)9781665454445
DOI
StatusUdgivet - 2022
Begivenhed2022 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2022 - Dallas, USA
Varighed: 13 nov. 202218 nov. 2022

Konference

Konference2022 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2022
Land/OmrådeUSA
ByDallas
Periode13/11/202218/11/2022
SponsorACM's Special Interest Group on High Performance Computing (SIGHPC), Association for Computing Machinery, IEEE Computer Society, IEEE's Technical Committee on High Performance Computing (TCHPC)
NavnInternational Conference for High Performance Computing, Networking, Storage and Analysis, SC
Vol/bind2022-November
ISSN2167-4329

Bibliografisk note

Funding Information:
This work has been supported by the Independent Research Fund Denmark (DFF) under the grants Deep Probabilistic Programming for Protein Structure Prediction and FUTHARK: Functional Technology for High-performance Architectures, and by the UCPH Data+ grant: High-Performance Land Change Assessment.

Publisher Copyright:
© 2022 IEEE.

Citationsformater