Active inference and learning
Abstract
This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity.
Additional Information
© 2016 The Authors. Published by Elsevier Under a Creative Commons license - Attribution 4.0 International (CC BY 4.0) Received date: 5-3-2016; Revised date: 15-6-2016; Accepted date: 17-6-2016. Available online 29 June 2016. KJF is funded by the Wellcome Trust (Ref: 088130/Z/09/Z). Philipp Schwartenbeck is a recipient of a DOC Fellowship of the Austrian Academy of Sciences at the Centre for Cognitive Neuroscience; University of Salzburg. GP gratefully acknowledges support of HFSP (Young Investigator Grant RGY0088/2014).Attached Files
Published - 1-s2.0-S0149763416301336-main.pdf
Files
Name | Size | Download all |
---|---|---|
md5:56c5bdab17d99b28d61f1746d33a20e9
|
5.5 MB | Preview Download |
Additional details
- PMCID
- PMC5167251
- Eprint ID
- 69039
- Resolver ID
- CaltechAUTHORS:20160715-071522556
- Wellcome Trust
- 088130/Z/09/Z
- Österreichische Akademie der Wissenschaften
- Human Frontier Science Program
- RGY0088/2014
- Created
-
2016-07-15Created from EPrint's datestamp field
- Updated
-
2021-11-11Created from EPrint's last_modified field