Risk, Unexpected Uncertainty, and Estimation Uncertainty: Bayesian Learning in Unstable Settings
- Creators
- Payzan-LeNestour, Elise
-
Bossaerts, Peter
Abstract
Recently, evidence has emerged that humans approach learning using Bayesian updating rather than (model-free) reinforcement algorithms in a six-arm restless bandit problem. Here, we investigate what this implies for human appreciation of uncertainty. In our task, a Bayesian learner distinguishes three equally salient levels of uncertainty. First, the Bayesian perceives irreducible uncertainty or risk: even knowing the payoff probabilities of a given arm, the outcome remains uncertain. Second, there is (parameter) estimation uncertainty or ambiguity: payoff probabilities are unknown and need to be estimated. Third, the outcome probabilities of the arms change: the sudden jumps are referred to as unexpected uncertainty. We document how the three levels of uncertainty evolved during the course of our experiment and how it affected the learning rate. We then zoom in on estimation uncertainty, which has been suggested to be a driving force in exploration, in spite of evidence of widespread aversion to ambiguity. Our data corroborate the latter. We discuss neural evidence that foreshadowed the ability of humans to distinguish between the three levels of uncertainty. Finally, we investigate the boundaries of human capacity to implement Bayesian learning. We repeat the experiment with different instructions, reflecting varying levels of structural uncertainty. Under this fourth notion of uncertainty, choices were no better explained by Bayesian updating than by (model-free) reinforcement learning. Exit questionnaires revealed that participants remained unaware of the presence of unexpected uncertainty and failed to acquire the right model with which to implement Bayesian updating.
Additional Information
© 2011 Payzan-LeNestour, Bossaerts. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Received November 2, 2009; Accepted December 2, 2010; Published January 20, 2011. Editor: Tim Behrens, John Radcliffe Hospital, United Kingdom. Funding: The authors acknowledge financial support from the Swiss Finance Institute and from NCCR FINRISK of the Swiss National Science Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript application. We are grateful to Chen Feng for programming the board game application. Author Contributions: Conceived and designed the experiments: EP-LN. Performed the experiments: EP-LN. Analyzed the data: EP-LN. Wrote the paper: EPLN PB. Supervised EP-LN: PB.Attached Files
Published - PayzanLeNestour2011p12669Plos_Comput_Biol.pdf
Supplemental Material - Figure_S1.pdf
Supplemental Material - Figure_S2.pdf
Supplemental Material - Figure_S3.pdf
Supplemental Material - Figure_S4.pdf
Supplemental Material - Text_S1.pdf
Files
Name | Size | Download all |
---|---|---|
md5:00f7057d6537796471e7088260cbdadc
|
789.5 kB | Preview Download |
md5:1b2008878651741088eb65be0efc5b9e
|
19.2 kB | Preview Download |
md5:906fe12d5e38940117be54b8f335b593
|
14.5 kB | Preview Download |
md5:57cceab5282a96fa4276dca6041ce0ee
|
19.8 kB | Preview Download |
md5:57cceab5282a96fa4276dca6041ce0ee
|
19.8 kB | Preview Download |
md5:cd37a62da2e15868287014b9015d419a
|
195.3 kB | Preview Download |
Additional details
- PMCID
- PMC3024253
- Eprint ID
- 22571
- Resolver ID
- CaltechAUTHORS:20110301-094909173
- Swiss Finance Institute
- Swiss National Science Foundation (SNSF)
- Created
-
2011-03-03Created from EPrint's datestamp field
- Updated
-
2021-11-09Created from EPrint's last_modified field