The paper considers a class of decision problems with infinite time horizon that contains Markov decision problems as an important special case. Our interest concerns the case where the decision maker cannot commit himself to his future action choices. We model the decision maker as consisting of multiple selves, where each history of the decision problem corresponds to one self. Each self is assumed to have the same utility function as the decision maker. We introduce the notions of Nash equilibrium, subgame perfect equilibrium, and curb sets for decision problems. An optimal policy at the initial history is a Nash equilibrium but not vice versa. Both subgame perfect equilibria and curb sets are equivalent to subgame optimal policies. The concept of a subgame optimal policy is therefore robust to the absence of commitment technologies.
|Series||GSBE Research Memoranda|
- c61 - "Optimization Techniques; Programming Models; Dynamic Analysis"
- c62 - Existence and Stability Conditions of Equilibrium
- c73 - "Stochastic and Dynamic Games; Evolutionary Games; Repeated Games"