Reinforcement learning or active inference?

Karl J. Friston; Jean Daunizeau; Stefan J. Kiebel

doi:10.1371/journal.pone.0006421

Reinforcement learning or active inference?

Publikation: Beitrag in Fachzeitschrift › Forschungsartikel › Beigetragen › Begutachtung

Beitragende

Karl J. Friston - , University College London (Autor:in)
Jean Daunizeau - , University College London (Autor:in)
Stefan J. Kiebel - , University College London (Autor:in)

Abstract

This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive behaviours using a free-energy formulation of perception. In this formulation, agents adjust their internal states and sampling of the environment to minimize their free-energy. Such agents learn causal structure in the environment and sample it in an adaptive and self-supervised fashion. This results in behavioural policies that reproduce those optimised by reinforcement learning and dynamic programming. Critically, we do not need to invoke the notion of reward, value or utility. We illustrate these points by solving a benchmark problem in dynamic programming; namely the mountain-car problem, using active perception or inference under the free-energy principle. The ensuing proof-of-concept may be important because the free-energy formulation furnishes a unified account of both action and perception and may speak to a reappraisal of the role of dopamine in the brain.

Details

Originalsprache	Englisch
Aufsatznummer	e6421
Fachzeitschrift	PloS one
Jahrgang	4
Ausgabenummer	7
Publikationsstatus	Veröffentlicht - 29 Juli 2009
Peer-Review-Status	Ja
Extern publiziert	Ja

Externe IDs

PubMed	19641614

Forschungsportal der TU Dresden

Reinforcement learning or active inference?

Beitragende

Abstract

Details

Externe IDs

Schlagworte

ASJC Scopus Sachgebiete

Bibliotheksschlagworte