Dynamic programs are optimization problems where the objective to be maximized is lifetime value. As such, one key topic is how to combine a sequence of rewards into a corresponding lifetime value. So far we have considered linear valuation based on summation over expected discounted rewards, using either constant discount rates (Chapter 1–Chapter 5) or state-dependent discounting (Chapter 6). In this chapter, we consider extensions, where lifetime value is computed from a recursion over the reward sequence instead of a discounted sum. This “recursive preference” approach permits far more general specifications of lifetime value, and is becoming increasingly popular in economics, finance, and computer science (see, e.g., Section 6.4).
This chapter focuses purely on valuation (i.e., combining reward sequences into lifetime values), rather than optimization. Later, in Chapter 8, we will show how to maximize lifetime value in settings where recursive preferences are adopted.
Throughout this chapter, the symbol X always represents a finite set.
The most natural way to express lifetime value in recursive preference environments is as a fixed point of a (typically nonlinear) operator. One challenge is that some recursive preference specifications induce operators that fail to be contractions. For this reason, we now invest in additional fixed point theory. All of this theory concerns order preserving maps, since the operators we consider always inherit monotonicity from underlying preferences.
If you try to draw an increasing function that maps [0,1] to itself without touching the 45-degree line, you will find it impossible. Below we state a famous fixed-point theorem due to Bronislaw Knaster (1893–1980) and Alfred Tarski (1901–1983) that generalizes this idea. In the statement, X is a finite set and V:=[v1,v2], where v1,v2 are functions in RX with v1⩽v2.
Unlike, say, the fixed-point theorem of Banach (Section 1.2.2.3), Theorem 7.1.1 only yields existence. Uniqueness does not hold in general, as you can easily confirm by sketching the one-dimensional case or completing the following exercise.
In this section, we study sufficient conditions for global stability that replace contractivity with shape properties such as concavity and monotonicity. To build intuition, we start with the one-dimensional case and show how these properties can be combined to achieve stability. Readers focused on results can safely skip to Section 7.1.2.2.
In Section 1.2.3.2, we showed that concavity and monotonicity can yield global stability for the Solow–Swan model. Here is a more general result.
Figure 7.1:Global stability induced by increasing concave functions
Figure 7.1 gives one example, where g(x)=1+x/2. The conditions of Proposition 7.1.2 hold because, given any x>0, we can find an a in (0,x) that gets mapped strictly up (i.e., g(a) is above the 45-degree line) and a point b>x that gets mapped down (i.e., g(b) is below the 45-degree line).
Proposition 7.1.2 extends to multiple dimensions. In this section, we present a multidimensional version that covers both convex and concave functions.
To state our result, we extend the definition of convexity and concavity to vector-valued self-maps. The definitions mirror those for scalar-valued functions: A self-map T on a convex subset D of RX is called convex if
T(λu+(1−λ)v)⩽λTu+(1−λ)Tv whenever u,v∈D and λ∈[0,1];
and concave if
λTu+(1−λ)Tv⩽T(λu+(1−λ)v) whenever u,v∈D and λ∈[0,1].
Here ⩽ is, as usual, the pointwise order.
We are now ready to state our next fixed-point result, which was first proved in an infinite-dimensional setting by Du (1990). In the statement, X is a finite set, V:=[v1,v2] is a nonempty order interval in (RX,⩽), and T is a self-map on V.
Conditions (i) and (ii) are similar – in fact (ii) holds whenever (i) holds, so (ii) is the weaker (but slightly more complicated) condition. Conditions (iii) and (iv) are similar in the same sense. Figure 7.2 illustrates the convex and the concave versions of the result in one dimension. We encourage you to sketch your own variations to understand the roles that different conditions play.
A full proof of Theorem 7.1.3 can be found in Du (1990) or Theorem 2.1.2 and Corollary 2.1.1 of Zhang (2012). In our setting, existence follows from the Knaster–Tarski theorem. We prove uniqueness.
Du’s theorem provides conditions under which concave or convex order preserving self-maps on order intervals attain global stability. In this section we study maps of this type that have additional structure. While this additional structure is restrictive, it allows us to obtain global stability on unbounded subsets rather than order intervals.
To begin, let X be a finite set and consider the equation
Continuing to assume that h≫0 and A is a positive linear operator, we can use Du’s theorem to establish the next result (which generalizes Lemma 6.1.4).
The key to proving (i) implies (ii) is that G is order preserving and either convex or concave, depending on the value of θ. The remaining conditions in Du’s theorem are established over order intervals using ρ(A)1/θ<1. By applying an approximation argument, global stability is extended from order intervals to all of V. Some of these details are contained in the following exercises and a full proof can be found in Stachurski et al. (2022).
In this section, we compute lifetime values associated with given reward processes in settings that involve nonlinear recursions. These nonlinear recursions are called recursive preferences. We will show how some common specifications of recursive preferences can be translated into lifetime valuations via the fixed-point methods introduced in Chapter 2 and Section 7.1.
The time additive model of valuation in Section 3.2.2.3 can be studied from a purely recursive point of view. As a starting point, we state that the value Vt of current and future consumption is defined at each point in time t by the recursion
The random variables Vt and Vt+1 are the unknown objects in this expression. The expectation Et conditions on X0,…,Xt and Ct=c(Xt). The process (Xt)t⩾0 is P-Markov.
Since consumption is a function of (Xt)t⩾0 and knowledge of the current state Xt is sufficient to forecast future values (by the Markov property), it is natural to guess that Vt will depend on the Markov chain only through Xt. Hence we guess there is a solution of (7.5) takes the form Vt=v(Xt) for some v∈RX.
(Here v is an ansatz, meaning “educated guess.” First we guess the form of a solution and then we try to verify that the guess is correct. So long as we carry out the second step, starting with a guess brings no loss of rigor.)
Under this conjecture, (7.5) can be rewritten as v(Xt)=u(c(Xt))+βEtv(Xt+1). Conditioning on Xt=x and setting r:=u∘c, this becomes
In vector form, we get v=r+βPv. From the Neumann series lemma, the solution is v∗=(I−βP)−1r, which is identical to (3.21).
In summary, (7.5) and the sequential representation (3.20) specify the same lifetime value for consumption paths.
While the recursive formulation in (7.5) now seems redundant, since it produces the same specification that we obtained from the sequential approach, the recursive set up gives us a formula to build on, and hence a pathway to overcoming limitations of the time additive approach. Most of the rest of this chapter will be focused on this agenda.
Pursuing this agenda will produce preferences over consumption paths where the sequential approach has no natural counterpart. This occurs when current value Vt is nonlinear in current rewards and continuation values (unlike the linear specification (7.5)). Such specifications are called recursive preferences. When dealing with recursive preference models, the lack of a sequential counterpart means that we are forced to proceed recursively.
also called the discounted expected utility model, can be framed recursively, and how this provides a pathway to go beyond the time additive specification. We are motivated to do so because the time additive specification has been rejected by experimental and observational data in many settings.
In this section, we highlight some of the limitations of time additive preferences. While our discussion is only brief, more background and a list of references can be found in Section 7.4.
One issue with (7.7) is the assumption of a constant positive discount rate, which has been refuted by a long list of empirical studies. This issue was discussed in Section 6.4.
Another limitation of time additive preferences is that agents are risk-neutral in future utility (see, e.g., (7.5), where current value depends linearly on future value). Although risk aversion over consumption can be built in through curvature of u, this same curvature also determines the elasticity of intertemporal substitution, meaning that the two aspects of preferences cannot be separated. We elaborate on this point in Section 7.3.1.4.
A third issue with time additivity is that agents with such preferences are indifferent to any variation in the joint distribution of rewards that leaves marginal distributions unchanged. To get a sense of what this means, suppose you accept a new job and will be employed by this firm for the rest of your life. Your daily consumption will be entirely determined by your daily wage. Your boss offers you two options:
Your boss will flip a coin at the start of your first day on the job. If the coin is heads, you will receive $10,000 a day for the rest of your life. If the coin is tails, you will receive $1 per day for the rest of your life.
Your boss will flip a coin at the start of every day. If the coin is heads, you will receive $10,000. If the coin is tails, you will receive $1.
If you have a strict preference between options A and B, then your choice cannot be rationalized with time additive preferences.
To see why, let φ be a probability distribution that represents the lottery just described, putting mass 0.5 on 10,000 and mass 0.5 on 1. Under option A, consumption (Ct)t⩾1 is given by Ct=C1 for all t, where C1∼φ. Under option B, consumption (Ct)t⩾1 is an IID sequence drawn from φ. Either way, lifetime utility is
Et⩾1∑βtu(Ct)=t⩾1∑βtEu(Ct)=1−ββuˉ,
where uˉ:=Eu(C1)=u(1)/2+u(10,000)/2.
The critical part of this argument is the passing of expectations through the sum, which uses time additivity . The implication is that lifetime utility depends only on the marginal distribution of each Ct, rather than on the joint distribution of the stochastic process (Ct)t⩾0.
Having motivated recursive preferences, let’s turn to our first example: risk-sensitive preferences. For the consumption problem described in Section 7.2.1.1, imposing risk-sensitive preferences means replacing the recursion v=r+βPv for v with
As before, r(x)=u(c(x)) represents current utility when the current state is x. The parameter θ is a nonzero constant in R.
In (7.8), the transform f(v)=exp(θv) is applied to v before expectation is taken. After the expectation is computed, the transform is undone via f−1(v)=(1/θ)ln(v). We will show that the agent can be either risk-averse or risk-loving with respect to future outcomes, depending on the value of θ.
We understand the functional equation (7.8) as “defining” lifetime utility under risk-sensitive preferences. A function v solving (7.8) gives a lifetime valuation v(x) to each x∈X, with the interpretation that v(x) is lifetime utility conditional on initial state x. This definition of lifetime value is by analogy to the time additive case studied in Section 7.2.1.1, where the function v solving v=r+βPv measures lifetime utility from each initial state.
In the previous paragraph we wrote “defining” in scare quotes because we can’t be sure we have a definition at this point. Just because we write down a recursive expression for lifetime utility doesn’t mean that corresponding lifetime utility is actually well defined. (For example, we can happily write down the recursive vector equation v=v+11 but no vector v solving this equation exists.) One aim of this chapter is to provide conditions under which recursions like (7.8) have solutions.
Another issue is uniqueness. Suppose that (7.8) has many solutions. In this case the predictions of the utility model are ambiguous. Our perspective is that the recursive preference specification (7.8) is not correctly formulated unless existence and uniqueness hold. We return to this point in Section 7.2.2.3.
One final comment: even if we can find a v that solves (7.8), the nonlinearities introduced by risk sensitivity imply that there will be no neat sequential representation analogous to v(x)=Ex∑tβtu(Ct) from the time additive case. (This connects to Remark 7.2.1, where we discuss recursive preference terminology.)
We want to understand the “expectation-like” expression on the right hand side of (7.8) that replaces the ordinary conditional expectation ∑x′v(x′)P(x,x′) from the time additive case. To this end, we define, for arbitrary random variable ξ and nonzero θ∈R,
Eθ[ξ]=θ1ln{E[exp(θξ)]}.
The value Eθ[ξ] is called the entropic risk-adjusted expectation of ξ given θ.
The key idea behind the entropic risk-adjusted expectation is that decreasing θ lowers appetite for risk and increasing θ does the opposite.
Expression (7.9) shows that, for the Gaussian case, Eθ[ξ] equals the mean plus a term that penalizes variance when θ<0 and rewards it when θ>0.
Let’s return to investigating lifetime utility under risk-sensitive preferences. To this end, we introduce the risk-sensitive Koopmans operatorKθ on RX via
Evidently, for given nonzero θ, a function v∈RX solves the risk-sensitive preference lifetime utility specification (7.8) if and only if v is a fixed point of Kθ. This explains the significance of the following result:
We postpone a proof of Proposition 7.2.2 because we will prove a more general result in Section 7.3.2.2. For now we note the following implications.
(i) For each nonzero θ, lifetime utility is both well-defined and uniquely defined for risk-sensitive preferences (i.e., (7.8) has a unique solution).
(ii) The unique solution, denoted henceforth by v∗, can be computed by successive approximation using Kθ.
As a tractable case, let’s suppose that r(x)=x and that Xt+1=ρXt+σWt+1 where (Wt)t⩾1 is IID and standard normal. Here ∣ρ∣<1 and σ⩾0 controls volatility of the state. Rather than discretizing the state process, we leave it as continuous and proceed by hand.
In this setting, the functional equation (7.8) for v becomes
Since ρx+σW is Gaussian, the expression (7.9) for the risk-adjusted expectation of a normal random variable leads us to conjecture that the solution v will be affine, i.e., v(x)=ax+b for some a,b∈R. This conjecture turns out to be correct:
We can see that, under the stated assumptions, lifetime value v is increasing in the state variable x. However, impacts of the parameters generally depend on θ. For example, if θ>0, increasing σ shifts up lifetime utility. If θ<0, then lifetime value decreases with σ. This is as we expect: Lifetime utility is affected positively or negatively by volatility, depending on whether or not the agent is risk averse or risk loving.
Figure 7.3 shows the true solution v(x)=ax+b to the risk-sensitive lifetime utility model, as well as an approximate fixed point from a discrete approximation. The discrete approximation is computed by applying successive approximation to Kθ after discretizing the state process via Tauchen’s method. The parameters and discretization are shown in Listing 1.
Figure 7.3:Approximate and true solutions in the Gaussian case
1
2
3
4
5
6
7
8
9
10
11
12
13
using LinearAlgebra, QuantEcon
function create_rs_utility_model(;
n=180, # size of state space
β=0.95, # time discount factor
ρ=0.96, # correlation coef in AR(1)
σ=0.1, # volatility
θ=-1.0) # risk aversion
mc = tauchen(n, ρ, σ, 0, 10) # n_std = 10
x_vals, P = mc.state_values, mc.p
r = x_vals # special case u(c(x)) = x
return (; β, θ, ρ, σ, r, x_vals, P)
end
Program 1:Risk sensitive utility model parameters (rs_utility.jl)
One of the most popular specifications of recursive preferences in quantitative research is Epstein–Zin utility.[1] This class of preferences has been used to study asset pricing, business cycles, monetary policy, fiscal policy, optimal taxation, climate policy, pension plans, and other topics. In this section, we introduce the Epstein–Zin specification and discuss how to solve it. We will see that the specification, while highly nonlinear, is nonetheless well behaved.
where γ, α are nonzero parameters and β∈(0,1). As for risk-sensitive preferences, lack of time additivity implies that there is no neat sequential representation for lifetime value. As a result, we must work directly with the recursive expression (7.12).
Assume as before that Ct=c(Xt), where c∈R+X and (Xt)t⩾0 is P-Markov on finite set X. We conjecture a solution of the form Vt=v(Xt) for some v∈V:=R+X. Under this conjecture, the Epstein–Zin Koopmans operator corresponding to (7.12) is
As will be discussed further in Section 7.3.1.1, the parameter γ governs risk aversion with respect to temporal gambles (where outcomes are resolved in the next period), while β controls impatience and α parametrizes the intertemporal elasticity of substitution. The fact that all three parameters have distinct effects helps fit data. For example, see Tallarini Jr (2000) and Barillas et al. (2009).
An important question is whether Epstein–Zin preferences are well defined. In particular, what conditions do we need on primitives such that the Koopmans operator K in (7.13) has a unique fixed point?
where h∈RX. This is equivalent to (7.13) when h=(1−β)cα. To avoid fractional powers of negative numbers, we assume throughout that h⩾0.
The set V is called the interior of the positive cone of RX.
The operator K is difficult to work with for two reasons. First, linear and nonlinear transformations are intertwined. Second, there are several cases for the parameters that we need to handle in order to understand stability. Nonetheless, by applying a smooth transformation, we will find it easy to show that the Epstein–Zin Koopmans operator K is globally stable under mild conditions. In particular,
Proposition 7.2.3 implies that Epstein–Zin utility is well-defined under the stated conditions and, moreover, that the solution can be computed via successive approximation on K. Listing 2 provides code for performing this operation. Figure 7.4 shows convergence of the sequence of iterates to the fixed point v∗, under the parameters in Listing 2, given an initial condition v0. The figure plots every 10th iterate, repeated 100 times.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
include("s_approx.jl")
using LinearAlgebra, QuantEcon
function create_ez_utility_model(;
n=200, # size of state space
ρ=0.96, # correlation coef in AR(1)
σ=0.1, # volatility
β=0.99, # time discount factor
α=0.75, # EIS parameter
γ=-2.0) # risk aversion parameter
mc = tauchen(n, ρ, σ, 0, 5)
x_vals, P = mc.state_values, mc.p
c = exp.(x_vals)
return (; β, ρ, σ, α, γ, c, x_vals, P)
end
function K(v, model)
(; β, ρ, σ, α, γ, c, x_vals, P) = model
R = (P * (v.^γ)).^(1/γ)
return ((1 - β) * c.^α + β * R.^α).^(1/α)
end
function compute_ez_utility(model)
v_init = ones(length(model.x_vals))
v_star = successive_approx(v -> K(v, model),
v_init,
tolerance=1e-10)
return v_star
end
Program 2:Epstein--Zin utility model and Koopmans operator (ez_utility.jl)
Figure 7.4:Convergence of Koopmans iterates for Epstein–Zin utility
The operator K^ is simpler to work with than K because it unifies α,γ into a single parameter θ and decomposes the Epstein–Zin update rule into two parts: a linear map P and a separate nonlinear component.
While we can consider studying stability of K^ using contraction arguments, this approach fails under useful parameterizations. To illustrate, suppose that X={x1}. Then h is a constant, P is the identity, v is a scalar and K^v=F(v) with F(v)={h+βv1/θ}θ, as shown in Figure 7.5. Here θ=5, h=0.5 and β=0.5. We see that K^ has infinite slope at zero, so the contraction property fails.[2]
Figure 7.5:Shape properties of K^ in one dimension
We have discussed two well-known examples of recursive preferences. In this section we build a general representation. While various constructions can be found in the decision theory literature, many are not well suited to quantitative work. Here we give a relatively parsimonious operator-theoretic definition.
In Section 7.2.2.3 and Section 7.2.3.1 we met risk-sensitive and Epstein–Zin Koopmans operators respectively. In this section, we provide a general definition of a Koopmans operator that will contain these two examples as special cases.
We begin by outlining structure that can be combined to generate Koopmans operators in a Markov environment. The two key components are an aggregation function and a certainty equivalent operator. We then build Koopmans operators from these primitives and connect them to applications. In every setting we consider, lifetime value is identified with the unique fixed point of the Koopmans operator (whenever it exists).
The first primitive we consider is a generalization of conditional expectations: Given V⊂RX, we define a certainty equivalent operator on V to be a self-map R on V such that
(i) R is order preserving on V and
(ii) all constants are fixed under R (i.e., R(λ11)=λ11 for all λ∈R with λ11∈V).
The next example is nonlinear. It treats the risk-adjusted expectation that appears in risk-sensitive preferences.
The set of certainty equivalent operators on RX is invariant under convex combinations, as the next exercise asks you to confirm.
Later we will combine Lemma 7.3.1 with the fixed-point results for convex and concave operators in Section 7.1.2.2 to establish existence and uniqueness of lifetime values for certain kinds of Koopmans operators.
Let X be partially ordered and let iRX be the set of increasing functions in RX. Let V be such that iRX⊂V⊂RX and let R be a certainty equivalent on V. We call Rmonotone increasing if R is invariant on iRX. This extends the terminology in Section 3.2.1.3, where we applied it to Markov operators (cf., Exercise 3.2.4). The concept of monotone increasing certainty equivalent operators is connected to outcomes where lifetime preferences are increasing in the state.
We mentioned that Koopmans operators are typically constructed by combining a certainty equivalent operator and an aggregation function. Let’s now discuss the second of these components.
Given V⊂RX, an aggregatorA on V is a map A from X×R to R such that
(i) w(x)=A(x,v(x)) is in V whenever v∈V and
(ii) y↦A(x,y) is increasing for all x∈X.
Intuitively, an aggregator combines current state and continuation values to measure lifetime value.
Common types of aggregators include the
Leontief aggregatorAMIN(x,y)=min{r(x),βy} with r∈RX and β⩾0,
Uzawa aggregatorAUZAWA(x,y)=r(x)+b(x)y with r∈RX and b∈R+X, and
CES aggregatorACES(x,y)={r(x)α+βyα}1/α with r∈(0,∞)X, β⩾0 and α=0.
Here CES stands for “constant elasticity of substitution.” An important special case of both the CES and Uzawa aggregators is the
additive aggregatorAADD(x,y)=r(x)+βy with r∈RX and β⩾0.
From these basic types we can also build composite aggregators. For example, we might consider a CES-Uzawa aggregator of the form A(x,y)={r(x)α+b(x)yα}1/α with r,b∈RX, b⩾0 and α=0. As we will see in Section 7.3.3.3, the CES-Uzawa aggregator can be used to construct models with both Epstein–Zin utility and state-dependent discounting (as in, say, Albuquerque et al. (2016) or Schorfheide et al. (2018).)
We are now ready to build Koopmans operators by combining certainty equivalents and aggregators. Given V⊂RX, we call a self-map K on V a Koopmans operator if
for some aggregator A and certainty equivalent operator R on V. The expression in (7.18) means that (Kv)(x)=A(x,(Rv)(x)) at v∈V and x∈X.
It is generally appropriate to suppose that a uniform increase in continuation values will increase current value. This property holds for K in (7.18). In particular, it follows from the definitions of A and R that K is an order preserving self-map on V.
The CES aggregator is so-named because, in a static utility maximization problem where c and y are two goods and utility is U(c,y)=((1−β)cα+βyα)1/α, the elasticity of substitution is constant and given by 1/(1−α). In the present setting, where aggregation is across time, 1/(1−α) is usually called the elasticity of intertemporal substitution (EIS). The next exercise explains.
The fact that EIS =1/(1−α) under the CES aggregator is significant because the EIS can be measured from data using regression and other techniques. While estimates vary significantly, the detailed meta-analysis by Havranek et al. (2015) suggests 0.5 as a plausible average value for international studies, with rich countries tending slightly higher. Basu & Bundick (2017) use 0.8 when calibrating to US data. Under these estimates, the relationship EIS =1/(1−α) implies a value for α between -1.0 and -0.25.
In Section 7.3.1.5 we constructed a generic Koopmans operator using an aggregator and a certainty equivalent operator. In this section, we connect this Koopmans operator to lifetime values and discuss the significance of global stability.
To begin, fix set X and function class V⊂RX. Let K=A∘R be a Koopmans operator for some aggregator A and certainty equivalent operator R on V. The lifetime value generated by K is the unique fixed point of K in V, whenever it exists. Given such a v, the value v(x) is interpreted as lifetime value conditional on initial state x.
In many applications, our existence and uniqueness proofs for fixed points of K will also establish global stability. For Koopmans operators, global stability has the following interpretation: for w∈V, m∈N and x∈X, the value (Kmw)(x) gives total finite-horizon utility over periods 0,…,m under the preferences embedded in K, with initial state x and terminal condition w. Hence global stability implies that, for any choice of terminal condition, finite-horizon utility converges to infinite-horizon utility as the time horizon converges to infinity. The next exercise helps to illustrate this point.
Exercise 7.3.15 confirms that, at least for the time additive case, global stability of K is equivalent to the statement that a finite-horizon valuation with arbitrary terminal condition w converges to the infinite-horizon valuation.
Let X=(X,⪯) be partially ordered, let iRX be the set of increasing functions in RX, and let V be such that iRX⊂V⊂RX. Let K be a Koopmans operator on V, so that Kv=A∘R for some aggregator A and certainty equivalent operator R on V. Suppose that K has a unique fixed point v∗∈V. A natural question is: when is v∗ increasing in the state?
Let R be a certainty equivalent operator on V=RX and let A be an aggregator on V. Let K be the Koopmans operator on V defined by (Kv)(x)=A(x,(Rv)(x)). When R is constant-subadditive, we can often establish global stability of K on V via a contraction mapping argument. This section gives details.
for β∈(0,1), τ∈[0,1], r∈RX and Rτ as described in Exercise 7.3.4. Since Rτ is constant-subadditive (Exercise 7.3.7) and the additive aggregator is Blackwell, Kτ is globally stable (Proposition 7.3.3). The operator Kτ represents quantile preferences, as described in Castro & Galvao (2019) and other studies (see Section 7.4). The value τ parameterizes attitude to risk, a point we return to in Section 8.2.1.4.
Let V=RX and suppose R=P for some P∈M(RX), so that R is ordinary conditional expectations. Then K becomes Kv=r+Lv where L∈L(RX) with L(x,x′)=b(x)P(x,x′). By Exercise 1.2.17, K is globally stable on V whenever ρ(L)<1.
This kind of structure arises when households derive utility from a consumption path while their discount factor fluctuates according to some state variable (see, e.g., Krusell & Smith (1998), Toda (2019), Cao (2020), and Hubmer et al. (2020)). For a given consumption path (Ct), lifetime values takes the form
where u is a flow utility function and {βt} is a discount factor process. Suppose Ct=c(Xt) and βt=b(Xt) where b⩾0 and (Xt) is P-Markov for some P∈M(RX). Set r:=u∘c and L(x,x′):=b(x)P(x,x′). By Theorem 6.1.1, the condition ρ(L)<1 implies that v in (7.22) is the unique fixed point of Kv=r+Lv=r+bPv. In other words, lifetime value under (7.22) is the unique fixed point of the Koopmans operator when the aggregator is of Uzawa type and the certainty equivalent is conditional expectation.
How does this relate to optimization? Recall our discussion of state-dependent MDPs in Chapter 6. There, the policy operator Tσ in (6.16) is a special case of (7.21) when the discount factor depends only on the current state and action.
With some additional requirements, the condition ρ(L)<1 is necessary as well as sufficient for existence of a unique fixed point for Kv=r+Lv. Indeed, if b≫0 and P is irreducible, then L is also irreducible and a positive linear operator. Applying Lemma 6.1.4, we see that r≫0 and ρ(L)⩾1 implies Kv=r+Lv has no fixed point in V:={v∈RX:v≫0}.
Now consider Kv=r+bRv from (7.21) when R is not in M(RX). Here bRv is the pointwise product, so that (bRv)(x)=b(x)(Rv)(x) for all x.
We cannot use Proposition 7.3.3 to prove stability of K unless b(x)<1 for all x∈X. Since this condition is rather strict, we now study weaker conditions that can be valid even when b exceeds 1 in some states. Specifically, we consider
(a) bRv⩽c+Lv for some c∈RX and L∈L(RX) with ρ(L)<1.
(b) r≫0 and R is concave on R+X.
Let V=[0,vˉ] where vˉ:=(I−L)−1(r+c).
7.3.3.3Epstein–Zin Preferences with State-Dependent Discounting¶
Combining the CES-Uzawa aggregator A(x,y)={r(x)α+b(x)yα}1/α with the Kreps–Porteus certainty equivalent operator leads to the Koopmans operator
A fixed point of K corresponds to lifetime value for an agent with Epstein–Zin preferences and state-dependent discounting. (Such set ups are used in research on macroeconomic dynamics and asset pricing – see Section 7.4 for more details).
In what follows we take V=(0,∞)X and assume that h,b∈V and P is irreducible.
To discuss stability of K we introduce the operator B∈L(RX) defined by
(Bv)(x):=b(x)θx′∑v(x′)P(x,x′)whereθ:=αγ.
To prove Proposition 7.3.5, we proceed as in Section 7.2.3.3, constructing a conjugate operator K^ and proving stability of the latter. For this purpose, we introduce
The time additive preference structure in Section 7.2.1 was popularized by Samuelson (1939), who built on earlier work by Fisher (1930) and Ramsey (1928). An axiomatic foundation was supplied by Koopmans (1960). Bastianello & Faro (2023) study the foundations of discounted expected utility (DEU) from a purely subjective framework.
Problems with the time additive DEU model include non-constant discounting, as discussed in Section 6.4, as well as sign effects (gains being discounted more than losses) and magnitude effects (small outcomes being discounted more than large ones. See, for example, Thaler (1981) and Benzion et al. (1989). A critical review of the time additive model and a list of many references can be found in Frederick et al. (2002).
In the stochastic setting, the time additive framework is a subset of the expected utility model (Von Neumann & Morgenstern (1944), Friedman (1956), Savage (1951)). There are many well documented departures from expected utility in experimental data. See the start of Andreoni & Sprenger (2012) and the article Ericson & Laibson (2019) for an introduction to the literature. An interesting historical discussion of time additive expected utility can be found in Becker et al. (1989).
(It is ironic that those most responsible for popularizing the time additive DEU framework have also been among the most critical. For example, Samuelson (1939) stated that it is “completely arbitrary” to assume that the DEU specification holds. He goes on to claim that, in the analysis of savings and consumption, it is “extremely doubtful whether we can learn much from considering such an economic man.” In addition, Stokey & Lucas (1989), whose work helped to standardize DEU as a methodology for quantitative analysis, argued in a separate study that DEU is attractive only because of its relative simplicity Lucas & Stokey, 1984.)
Do the departures from time additive expected utility found in experimental data actually matter for quantitative work? Evidence suggests that the answer is affirmative. In macroeconomics and asset pricing in particular, researchers increasingly use non-additive preferences in order to bring model outputs closer to the data. For example, many quantitative models of asset pricing rely heavily on Epstein–Zin preferences. Representative examples include Epstein & Zin (1991), Tallarini Jr (2000), Bansal & Yaron (2004), Hansen et al. (2008), Bansal et al. (2012), Schorfheide et al. (2018), and Groot et al. (2022). Alternative numerical solution methods are discussed in Pohl et al. (2018).
An excellent introduction to recursive preference models can be found in Backus et al. (2004). Our use of the term “Koopmans operator,” which is not entirely standard, honors early contributions by Nobel laureate Tjalling Koopmans on recursive preferences (see Koopmans (1960) and Koopmans et al. (1964)).
Theoretical properties of recursive preference models have been studied in many papers, including Epstein & Zin (1989), Weil (1990), Boyd (1990), Hansen & Scheinkman (2009), Marinacci & Montrucchio (2010), Bommier et al. (2017), Bloise & Vailakis (2018), Marinacci & Montrucchio (2019), Pohl et al. (2019), Balbus (2020), Borovička & Stachurski (2020), DeJarnette et al. (2020), Christensen (2022), and Becker & Rincon-Zapatero (2023). The paper by Marinacci & Montrucchio (2019) provides a useful alternative approach to existence of unique fixed points in the setting of order preserving maps. Experimental results on Epstein–Zin preferences can be found in Meissner & Pfeiffer (2022).
There is a strong connection between risk-sensitive preferences and the literature on robust control. See, for example, Cagetti et al. (2002), Hansen & Sargent (2007), and Barillas et al. (2009). We return to this point in Chapter 8.
The quantile preferences we considered in Section 7.3.2.3 have been analyzed in static and dynamic settings by Giovannetti (2013), Castro & Galvao (2019), Castro & Galvao (2022) and Castro et al. (2022). Recursive components of the analysis of quantile and Uzawa preference models build on the study of monotone preferences in Bommier et al. (2017).
Some recursive preference specifications involve ambiguity aversion. An introduction to this literature and its applications can be found in Klibanoff et al. (2009), Hayashi & Miao (2011), Hansen & Miao (2018), Bommier et al. (2019) and Hansen & Sargent (2020). Marinacci et al. (2023) discuss the connection between recursivity and attitudes to uncertainty. We discuss ambiguity again in Chapter 8.
Recursive preferences are increasingly applied outside the field of asset pricing, where they first came to prominence. See, for example, Bommier & Villeneuve (2012), Colacito et al. (2018), Jensen (2019), or Augeraud-Véron et al. (2019).
The coin flip application in Section 7.2.1.2 is related to correlation aversion, as discussed in Stanca (2023), and preference for “consumption spreads” as reviewed in Frederick et al. (2002).
Some applications of Theorem 7.1.3 to network analysis can be found in Sargent & Stachurski (2023).
Epstein–Zin preferences were popularized in Epstein & Zin (1989). They are a special case of preferences defined by Kreps & Porteus (1978). Further discussion can be found in Section 7.4.
We could try to truncate the interval to a neighborhood of the fixed point and hope that K^ is a contraction when restricted to this interval. But in higher dimensions we are not sure that a fixed point exists for a broad range of parameters, which makes this idea hard to implement.
Barbu, V., & Precupanu, T. (2012). Convexity and Optimization in Banach Spaces. Springer Science & Business Media.
Fajgelbaum, P. D., Schaal, E., & Taschereau-Dumouchel, M. (2017). Uncertainty traps. The Quarterly Journal of Economics, 132(4), 1641–1692.
Du, Y. (1990). Fixed points of increasing operators in ordered Banach spaces and applications. Applicable Analysis, 38(01–02), 1–20.
Zhang, Z. (2012). Variational, Topological, and Partial Order Methods with Their Applications (Vol. 29). Springer.
Stachurski, J., Wilms, O., & Zhang, J. (2022). Unique solutions to power-transformed affine systems. arXiv, 2212.00275.
Kleinman, B., Liu, E., & Redding, S. J. (2023). Dynamic spatial general equilibrium. Econometrica, 91(2), 385–424.
Liao, J., & Berg, A. (2018). Sharpening Jensen’s inequality. The American Statistician.
Tallarini Jr, T. D. (2000). Risk-sensitive real business cycles. Journal of Monetary Economics, 45(3), 507–532.
Barillas, F., Hansen, L. P., & Sargent, T. (2009). Doubts or variability? Journal of Economic Theory, 144(6), 2388–2418.
Kreps, D. M., & Porteus, E. L. (1978). Temporal resolution of uncertainty and dynamic choice theory. Econometrica, 46(1), 185–200.
Bullen, P. S. (2003). Handbook of Means and Their Inequalities (Vol. 560). Springer Science & Business Media.
Föllmer, H., & Knispel, T. (2011). Entropic risk measures: Coherence vs. convexity, model ambiguity and robust large deviations. Stochastics and Dynamics, 11(02n03), 333–351.
Albuquerque, R., Eichenbaum, M., Luo, V. X., & Rebelo, S. (2016). Valuation risk and asset pricing. The Journal of Finance, 71(6), 2861–2904.
Schorfheide, F., Song, D., & Yaron, A. (2018). Identifying long-run risks: A Bayesian mixed-frequency approach. Econometrica, 86(2), 617–654.
Havranek, T., Horvath, R., Irsova, Z., & Rusnak, M. (2015). Cross-country heterogeneity in intertemporal substitution. Journal of International Economics, 96(1), 100–118.