Automatic Differentiation of Programs with Discrete Randomness
Paper i proceeding, 2022

Automatic differentiation (AD), a technique for constructing new programs which compute the derivative of an original program, has become ubiquitous throughout scientific computing and deep learning due to the improved performance afforded by gradient-based optimization. However, AD systems have been restricted to the subset of programs that have a continuous dependence on parameters. Programs that have discrete stochastic behaviors governed by distribution parameters, such as flipping a coin with probability p of being heads, pose a challenge to these systems because the connection between the result (heads vs tails) and the parameters (p) is fundamentally discrete. In this paper we develop a new reparameterization-based methodology that allows for generating programs whose expectation is the derivative of the expectation of the original program. We showcase how this method gives an unbiased and low-variance estimator which is as automated as traditional AD mechanisms. We demonstrate unbiased forward-mode AD of discrete-time Markov chains, agent-based models such as Conway's Game of Life, and unbiased reverse-mode AD of a particle filter. Our code package is available at https://github.com/gaurav-arya/StochasticAD.jl.

differentiable stochastic programming

stochastic methods

gradient based inference

compositionality

discrete randomness

reparameterization trick

automatic differentiation

chain rule

Författare

Gaurav Arya

Massachusetts Institute of Technology (MIT)

Moritz Schauer

Chalmers, Matematiska vetenskaper, Tillämpad matematik och statistik

Göteborgs universitet

Frank Schäfer

Massachusetts Institute of Technology (MIT)

Christopher Vincent Rackauckas

Massachusetts Institute of Technology (MIT)

Advances in Neural Information Processing Systems

10495258 (ISSN)

Vol. 35
9781713871088 (ISBN)

36th Conference on Neural Information Processing Systems, NeurIPS 2022
New Orleans + virtual, USA,

Ämneskategorier

Beräkningsmatematik

Sannolikhetsteori och statistik

Reglerteknik

Matematisk analys

DOI

10.48550/arXiv.2210.08572

Mer information

Senast uppdaterat

2024-01-03