Please use this identifier to cite or link to this item:
Title: Probability Matching and Reinforcement Learning
Authors: Rivas, Javier
First Published: Mar-2011
Publisher: Dept. of Economics, University of Leicester
Abstract: Probability matching occurs when an action is chosen with a frequency equivalent to the probability of that action being the best choice. This sub-optimal behavior has been reported repeatedly by psychologist and experimental economist. We provide an evolutionary foundation for this phenomenon by showing that learning by reinforcement can lead to probability matching and, if learning occurs su ciently slowly, probability matching does not only occur in choice frequencies but also in choice probabilities. Our results are completed by proving that there exists no quasi-linear reinforcement learning speci cation such that behavior is optimal for all environments where counterfactuals are observed.
Series/Report no.: Papers in Economics;11/20
Type: Report
Appears in Collections:Reports, Dept. of Economics

Files in This Item:
File Description SizeFormat 
dp11-20.pdf160.61 kBAdobe PDFView/Open

Items in LRA are protected by copyright, with all rights reserved, unless otherwise indicated.