Neuromorphic Hardware Learns to Learn

Thomas Bohnstingl; Franz Scherr; Christian Pehle; Karlheinz Meier; Wolfgang Maass

doi:10.3389/fnins.2019.00483

Neuromorphic Hardware Learns to Learn

Thomas Bohnstingl^*, Franz Scherr, Christian Pehle, Karlheinz Meier, Wolfgang Maass

^*Korrespondierende/r Autor/-in für diese Arbeit

Institut für Grundlagen der Informationsverarbeitung (7080)

Publikation: Beitrag in einer Fachzeitschrift › Artikel › Begutachtung

Abstract

Hyperparameters and learning algorithms for neuromorphic hardware are usually chosen by hand to suit a particular task. In contrast, networks of neurons in the brain were optimized through extensive evolutionary and developmental processes to work well on a range of computing and learning tasks. Occasionally this process has been emulated through genetic algorithms, but these require themselves hand-design of their details and tend to provide a limited range of improvements. We employ instead other powerful gradient-free optimization tools, such as cross-entropy methods and evolutionary strategies, in order to port the function of biological optimization processes to neuromorphic hardware. As an example, we show these optimization algorithms enable neuromorphic agents to learn very efficiently from rewards. In particular, meta-plasticity, i.e., the optimization of the learning rule which they use, substantially enhances reward-based learning capability of the hardware. In addition, we demonstrate for the first time Learning-to-Learn benefits from such hardware, in particular, the capability to extract abstract knowledge from prior learning experiences that speeds up the learning of new but related tasks. Learning-to-Learn is especially suited for accelerated neuromorphic hardware, since it makes it feasible to carry out the required very large number of network computations.

Originalsprache	englisch
Aufsatznummer	483
Seitenumfang	14
Fachzeitschrift	Frontiers in Neuroscience
Jahrgang	13
DOIs	https://doi.org/10.3389/fnins.2019.00483
Publikationsstatus	Veröffentlicht - 21 Mai 2019

Zugriff auf Dokument

10.3389/fnins.2019.00483Lizenz: CC BY 4.0

Dieses zitieren

@article{b9ebc28ae8324565b3b6056aba23a659,

title = "Neuromorphic Hardware Learns to Learn",

abstract = "Hyperparameters and learning algorithms for neuromorphic hardware are usually chosen by hand to suit a particular task. In contrast, networks of neurons in the brain were optimized through extensive evolutionary and developmental processes to work well on a range of computing and learning tasks. Occasionally this process has been emulated through genetic algorithms, but these require themselves hand-design of their details and tend to provide a limited range of improvements. We employ instead other powerful gradient-free optimization tools, such as cross-entropy methods and evolutionary strategies, in order to port the function of biological optimization processes to neuromorphic hardware. As an example, we show these optimization algorithms enable neuromorphic agents to learn very efficiently from rewards. In particular, meta-plasticity, i.e., the optimization of the learning rule which they use, substantially enhances reward-based learning capability of the hardware. In addition, we demonstrate for the first time Learning-to-Learn benefits from such hardware, in particular, the capability to extract abstract knowledge from prior learning experiences that speeds up the learning of new but related tasks. Learning-to-Learn is especially suited for accelerated neuromorphic hardware, since it makes it feasible to carry out the required very large number of network computations.",

author = "Thomas Bohnstingl and Franz Scherr and Christian Pehle and Karlheinz Meier and Wolfgang Maass",

year = "2019",

month = may,

day = "21",

doi = "10.3389/fnins.2019.00483",

language = "English",

volume = "13",

journal = "Frontiers in Neuroscience",

publisher = "Frontiers Research Foundation",

}

TY - JOUR

T1 - Neuromorphic Hardware Learns to Learn

AU - Bohnstingl, Thomas

AU - Scherr, Franz

AU - Pehle, Christian

AU - Meier, Karlheinz

AU - Maass, Wolfgang

PY - 2019/5/21

Y1 - 2019/5/21

N2 - Hyperparameters and learning algorithms for neuromorphic hardware are usually chosen by hand to suit a particular task. In contrast, networks of neurons in the brain were optimized through extensive evolutionary and developmental processes to work well on a range of computing and learning tasks. Occasionally this process has been emulated through genetic algorithms, but these require themselves hand-design of their details and tend to provide a limited range of improvements. We employ instead other powerful gradient-free optimization tools, such as cross-entropy methods and evolutionary strategies, in order to port the function of biological optimization processes to neuromorphic hardware. As an example, we show these optimization algorithms enable neuromorphic agents to learn very efficiently from rewards. In particular, meta-plasticity, i.e., the optimization of the learning rule which they use, substantially enhances reward-based learning capability of the hardware. In addition, we demonstrate for the first time Learning-to-Learn benefits from such hardware, in particular, the capability to extract abstract knowledge from prior learning experiences that speeds up the learning of new but related tasks. Learning-to-Learn is especially suited for accelerated neuromorphic hardware, since it makes it feasible to carry out the required very large number of network computations.

AB - Hyperparameters and learning algorithms for neuromorphic hardware are usually chosen by hand to suit a particular task. In contrast, networks of neurons in the brain were optimized through extensive evolutionary and developmental processes to work well on a range of computing and learning tasks. Occasionally this process has been emulated through genetic algorithms, but these require themselves hand-design of their details and tend to provide a limited range of improvements. We employ instead other powerful gradient-free optimization tools, such as cross-entropy methods and evolutionary strategies, in order to port the function of biological optimization processes to neuromorphic hardware. As an example, we show these optimization algorithms enable neuromorphic agents to learn very efficiently from rewards. In particular, meta-plasticity, i.e., the optimization of the learning rule which they use, substantially enhances reward-based learning capability of the hardware. In addition, we demonstrate for the first time Learning-to-Learn benefits from such hardware, in particular, the capability to extract abstract knowledge from prior learning experiences that speeds up the learning of new but related tasks. Learning-to-Learn is especially suited for accelerated neuromorphic hardware, since it makes it feasible to carry out the required very large number of network computations.

U2 - 10.3389/fnins.2019.00483

DO - 10.3389/fnins.2019.00483

M3 - Article

VL - 13

JO - Frontiers in Neuroscience

JF - Frontiers in Neuroscience

M1 - 483

ER -