Categorical Foundations of Gradient-Based Learning

G. S. H. Cruttwell,Bruno Gavranović,Neil Ghani,Paul A. Wilson,Fabio Zanasi

Categorical Foundations of Gradient-Based Learning

2021

G. S. H. Cruttwell
Bruno Gavranović
Neil Ghani
Paul A. Wilson
Fabio Zanasi

We propose a categorical foundation of gradient-based machine learning algorithms in terms of lenses, parametrised maps, and reverse derivative categories. This foundation provides a powerful explanatory and unifying framework: it encompasses a variety of gradient descent algorithms such as ADAM, AdaGrad, and Nesterov momentum, as well as a variety of loss functions such as as MSE and Softmax cross-entropy, shedding new light on their similarities and differences. Our approach also generalises beyond neural networks (modelled in categories of smooth maps), accounting for other structures relevant to gradient-based learning such as boolean circuits. Finally, we also develop a novel implementation of gradient-based learning in Python, informed by the principles introduced by our framework.

Keywords:

gradient based algorithm
Softmax function
Categorical variable
Momentum
Computer science
Python (programming language)
Artificial neural network
Gradient descent
Boolean circuit
Artificial intelligence

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations