Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published September 2013 | public
Journal Article

Nonparametric learning rules from bandit experiments: The eyes have it!

Abstract

How do people learn? We assess, in a model-free manner, subjectsʼ belief dynamics in a two-armed bandit learning experiment. A novel feature of our approach is to supplement the choice and reward data with subjectsʼ eye movements during the experiment to pin down estimates of subjectsʼ beliefs. Estimates show that subjects are more reluctant to "update down" following unsuccessful choices, than "update up" following successful choices. The profits from following the estimated learning and decision rules are smaller (by about 25% of average earnings by subjects in this experiment) than what would be obtained from a fully-rational Bayesian learning model, but comparable to the profits from alternative non-Bayesian learning models, including reinforcement learning and a simple "win-stay" choice heuristic.

Additional Information

© 2013 Elsevier Inc. Received 22 February 2012; Available online 30 May 2013. We are indebted to Antonio Rangel for his encouragement and for the funding and use of facilities in his lab. We thank Dan Ackerberg, Peter Bossaerts, Colin Camerer, Andrew Ching, Mark Dean, Cary Frydman, Ian Krajbich, Pietro Ortoleva, Joseph Tao-yi Wang and participants in presentations at U. Arizona, Caltech, UCLA, U. Washington and Choice Symposium 2010 (Key Largo) for comments and suggestions. Kayaba thanks the Nakajima Foundation for the financial support.

Additional details

Created:
August 22, 2023
Modified:
October 25, 2023