Learning by (limited) forward looking players

F. Mengel

doi:10.1016/j.jebo.2014.08.001

Learning by (limited) forward looking players

F. Mengel^*

^*Corresponding author for this work

Microeconomics & Public Economics

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

We present a model of adaptive economic agents who are k periods forward looking. Agents in our model are randomly matched to interact in finitely repeated games. They form beliefs by learning from past behavior of others and then best respond to these beliefs looking k periods ahead. We establish almost sure convergence of our stochastic process and characterize absorbing sets. These can be very different from the predictions in both the fully rational model and the adaptive, but myopic case. In particular we find that also non-nash outcomes can be sustained whenever they satisfy a “local” efficiency condition. We then characterize stochastically stable states in a class of 2 × 2 games and show that under certain conditions the efficient action in prisoner's dilemma games and coordination games can be singled out as uniquely stochastically stable. We show that our results are consistent with typical patterns observed in experiments on finitely repeated prisoner's dilemma games and in particular can explain what is commonly called the “endgame effect” and the “restart effect”. Finally, if populations are composed of some myopic and some forward looking agents, parameter constellations exist such that either might obtain higher average payoffs.

Original language	English
Pages (from-to)	59-77
Journal	Journal of Economic Behavior & Organization
Volume	108
Early online date	21 Aug 2014
DOIs	https://doi.org/10.1016/j.jebo.2014.08.001
Publication status	Published - Dec 2014

Access to Document

10.1016/j.jebo.2014.08.001

Cite this

@article{30016442d72a4869807952d9569f401e,

title = "Learning by (limited) forward looking players",

abstract = "We present a model of adaptive economic agents who are k periods forward looking. Agents in our model are randomly matched to interact in finitely repeated games. They form beliefs by learning from past behavior of others and then best respond to these beliefs looking k periods ahead. We establish almost sure convergence of our stochastic process and characterize absorbing sets. These can be very different from the predictions in both the fully rational model and the adaptive, but myopic case. In particular we find that also non-nash outcomes can be sustained whenever they satisfy a “local” efficiency condition. We then characterize stochastically stable states in a class of 2 × 2 games and show that under certain conditions the efficient action in prisoner's dilemma games and coordination games can be singled out as uniquely stochastically stable. We show that our results are consistent with typical patterns observed in experiments on finitely repeated prisoner's dilemma games and in particular can explain what is commonly called the “endgame effect” and the “restart effect”. Finally, if populations are composed of some myopic and some forward looking agents, parameter constellations exist such that either might obtain higher average payoffs.",

author = "F. Mengel",

year = "2014",

month = dec,

doi = "10.1016/j.jebo.2014.08.001",

language = "English",

volume = "108",

pages = "59--77",

journal = "Journal of Economic Behavior & Organization",

issn = "0167-2681",

publisher = "Elsevier Science",

}

TY - JOUR

T1 - Learning by (limited) forward looking players

AU - Mengel, F.

PY - 2014/12

Y1 - 2014/12

N2 - We present a model of adaptive economic agents who are k periods forward looking. Agents in our model are randomly matched to interact in finitely repeated games. They form beliefs by learning from past behavior of others and then best respond to these beliefs looking k periods ahead. We establish almost sure convergence of our stochastic process and characterize absorbing sets. These can be very different from the predictions in both the fully rational model and the adaptive, but myopic case. In particular we find that also non-nash outcomes can be sustained whenever they satisfy a “local” efficiency condition. We then characterize stochastically stable states in a class of 2 × 2 games and show that under certain conditions the efficient action in prisoner's dilemma games and coordination games can be singled out as uniquely stochastically stable. We show that our results are consistent with typical patterns observed in experiments on finitely repeated prisoner's dilemma games and in particular can explain what is commonly called the “endgame effect” and the “restart effect”. Finally, if populations are composed of some myopic and some forward looking agents, parameter constellations exist such that either might obtain higher average payoffs.

AB - We present a model of adaptive economic agents who are k periods forward looking. Agents in our model are randomly matched to interact in finitely repeated games. They form beliefs by learning from past behavior of others and then best respond to these beliefs looking k periods ahead. We establish almost sure convergence of our stochastic process and characterize absorbing sets. These can be very different from the predictions in both the fully rational model and the adaptive, but myopic case. In particular we find that also non-nash outcomes can be sustained whenever they satisfy a “local” efficiency condition. We then characterize stochastically stable states in a class of 2 × 2 games and show that under certain conditions the efficient action in prisoner's dilemma games and coordination games can be singled out as uniquely stochastically stable. We show that our results are consistent with typical patterns observed in experiments on finitely repeated prisoner's dilemma games and in particular can explain what is commonly called the “endgame effect” and the “restart effect”. Finally, if populations are composed of some myopic and some forward looking agents, parameter constellations exist such that either might obtain higher average payoffs.

U2 - 10.1016/j.jebo.2014.08.001

DO - 10.1016/j.jebo.2014.08.001

M3 - Article

SN - 0167-2681

VL - 108

SP - 59

EP - 77

JO - Journal of Economic Behavior & Organization

JF - Journal of Economic Behavior & Organization

ER -