Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

7 Nonlinear Valuation

Authors
Affiliations
New York University
Australian National University

Dynamic programs are optimization problems where the objective to be maximized is lifetime value. As such, one key topic is how to combine a sequence of rewards into a corresponding lifetime value. So far we have considered linear valuation based on summation over expected discounted rewards, using either constant discount rates (Chapter 1--Chapter 5) or state-dependent discounting (Chapter 6). In this chapter, we consider extensions, where lifetime value is computed from a recursion over the reward sequence instead of a discounted sum. This “recursive preference” approach permits far more general specifications of lifetime value, and is becoming increasingly popular in economics, finance, and computer science (see, e.g., Section 6.4).

This chapter focuses purely on valuation (i.e., combining reward sequences into lifetime values), rather than optimization. Later, in Chapter 8, we will show how to maximize lifetime value in settings where recursive preferences are adopted.

Throughout this chapter, the symbol X\Xsf always represents a finite set.

7.1Beyond Contraction Maps

The most natural way to express lifetime value in recursive preference environments is as a fixed point of a (typically nonlinear) operator. One challenge is that some recursive preference specifications induce operators that fail to be contractions. For this reason, we now invest in additional fixed point theory. All of this theory concerns order preserving maps, since the operators we consider always inherit monotonicity from underlying preferences.

7.1.1Knaster--Tarski for Function Space

If you try to draw an increasing function that maps [0,1][0,1] to itself without touching the 45-degree line, you will find it impossible. Below we state a famous fixed-point theorem due to Bronislaw Knaster (1893--1980) and Alfred Tarski (1901--1983) that generalizes this idea. In the statement, X\Xsf is a finite set and V[v1,v2]V \coloneq [v_1, v_2], where v1,v2v_1, v_2 are functions in RX\RR^\Xsf with v1v2v_1 \leq v_2.

Unlike, say, the fixed-point theorem of Banach (Section 1.2.2.3), Theorem 7.1.1 only yields existence. Uniqueness does not hold in general, as you can easily confirm by sketching the one-dimensional case or completing the following exercise.

7.1.2Concavity, Convexity, and Stability

In this section, we study sufficient conditions for global stability that replace contractivity with shape properties such as concavity and monotonicity. To build intuition, we start with the one-dimensional case and show how these properties can be combined to achieve stability. Readers focused on results can safely skip to Section 7.1.2.2.

7.1.2.1The One-Dimensional Case

In Section 1.2.3.2, we showed that concavity and monotonicity can yield global stability for the Solow--Swan model. Here is a more general result.

Global stability induced by increasing concave functions

Figure 7.1:Global stability induced by increasing concave functions

Figure Figure 7.1 gives one example, where g(x)=1+x/2g(x) = 1 + \sqrt{x}/2. The conditions of Proposition 7.1.2 hold because, given any x>0x > 0, we can find an aa in (0,x)(0,x) that gets mapped strictly up (i.e., g(a)g(a) is above the 45-degree line) and a point b>xb > x that gets mapped down (i.e., g(b)g(b) is below the 45-degree line).

7.1.2.2The Multidimensional Case

Proposition 7.1.2 extends to multiple dimensions. In this section, we present a multidimensional version that covers both convex and concave functions.

To state our result, we extend the definition of convexity and concavity to vector-valued self-maps. The definitions mirror those for scalar-valued functions: A self-map TT on a convex subset DD of RX\RR^\Xsf is called convex if

T(λu+(1λ)v)λTu+(1λ)Tv whenever u,vD and λ[0,1];T(\lambda u + (1-\lambda) v) \leq \lambda Tu + (1-\lambda) Tv \text{ whenever } u,v \in D \text{ and } \lambda \in [0, 1];

and concave if

λTu+(1λ)TvT(λu+(1λ)v) whenever u,vD and λ[0,1].\lambda Tu + (1-\lambda) Tv \leq T(\lambda u + (1-\lambda) v) \text{ whenever } u,v \in D \text{ and } \lambda \in [0, 1].

Here \leq is, as usual, the pointwise order.

We are now ready to state our next fixed-point result, which was first proved in an infinite-dimensional setting by Du (1990). In the statement, X\Xsf is a finite set, V[v1,v2]V \coloneq [v_1, v_2] is a nonempty order interval in (RX,)(\RR^\Xsf, \leq), and TT is a self-map on VV.

Conditions (i) and (ii) are similar -- in fact (ii) holds whenever (i) holds, so (ii) is the weaker (but slightly more complicated) condition. Conditions (iii) and (iv) are similar in the same sense. Figure Figure 7.2 illustrates the convex and the concave versions of the result in one dimension. We encourage you to sketch your own variations to understand the roles that different conditions play.

Du’s theorem: convex and concave cases

Figure 7.2:Du’s theorem: convex and concave cases

A full proof of Theorem 7.1.3 can be found in Du (1990) or Theorem 2.1.2 and Corollary 2.1.1 of Zhang (2012). In our setting, existence follows from the Knaster--Tarski theorem. We prove uniqueness.

7.1.3A Power-Transformed Affine Equation

Du’s theorem provides conditions under which concave or convex order preserving self-maps on order intervals attain global stability. In this section we study maps of this type that have additional structure. While this additional structure is restrictive, it allows us to obtain global stability on unbounded subsets rather than order intervals.

To begin, let X\Xsf be a finite set and consider the equation

v=[h+(Av)1/θ]θ(vV),v = [h + (A v)^{1/\theta}]^\theta \qquad (v \in V),

where θ\theta is a nonzero parameter, AL(RX)A \in \lopx with A0A \geq 0, V=(0,)XV = (0, \infty)^\Xsf, and hVh \in V. This system reduces to the affine model studied in Lemma 6.1.4 when θ=1\theta = 1.

To analyze (7.7), we introduce the self-map

Gv=[h+(Av)1/θ]θ(vV).Gv = [h + (A v)^{1/\theta}]^\theta \qquad (v \in V).

Continuing to assume that h0h \gg 0 and AA is a positive linear operator, we can use Du’s theorem to establish the next result (which generalizes Lemma 6.1.4 on page  ).

The key to proving (i) implies (ii) is that GG is order preserving and either convex or concave, depending on the value of θ\theta. The remaining conditions in Du’s theorem are established over order intervals using ρ(A)1/θ<1\rho(A)^{1/\theta} < 1. By applying an approximation argument, global stability is extended from order intervals to all of VV. Some of these details are contained in the following exercises and a full proof can be found in Stachurski et al. (2022).

Let

Fx(t)={h(x)+t1/θ}θ(t>0).F_x(t) = \left\{ h(x) + t^{1/\theta} \right\}^\theta \qquad (t > 0).

7.2Recursive Preferences

In this section, we compute lifetime values associated with given reward processes in settings that involve nonlinear recursions. These nonlinear recursions are called recursive preferences. We will show how some common specifications of recursive preferences can be translated into lifetime valuations via the fixed-point methods introduced in Chapter 2 and Section 7.1.

7.2.1Motivation: Optimal Savings

We motivate recursive preference models by analyzing consumption decisions.

7.2.1.1A Recursive View of a Standard Model

The time additive model of valuation in Section 3.2.2.3 can be studied from a purely recursive point of view. As a starting point, we state that the value VtV_t of current and future consumption is defined at each point in time tt by the recursion

Vt=u(Ct)+βEtVt+1.V_t = u(C_t) + \beta \, \EE_t V_{t+1}.

The random variables VtV_t and Vt+1V_{t+1} are the unknown objects in this expression. The expectation Et\EE_t conditions on X0,,XtX_0, \ldots, X_t and Ct=c(Xt)C_t = c(X_t). The process (Xt)t0(X_t)_{t \geq 0} is PP-Markov.

Since consumption is a function of (Xt)t0(X_t)_{t \geq 0} and knowledge of the current state XtX_t is sufficient to forecast future values (by the Markov property), it is natural to guess that VtV_t will depend on the Markov chain only through XtX_t. Hence we guess there is a solution of (7.15) takes the form Vt=v(Xt)V_t = v(X_t) for some vRXv \in \RR^\Xsf.

(Here vv is an ansatz, meaning “educated guess.” First we guess the form of a solution and then we try to verify that the guess is correct. So long as we carry out the second step, starting with a guess brings no loss of rigor.)

Under this conjecture, (7.15) can be rewritten as v(Xt)=u(c(Xt))+βEtv(Xt+1)v(X_t) = u(c(X_t)) + \beta \EE_t v(X_{t+1}). Conditioning on Xt=xX_t = x and setting rucr \coloneq u \circ c, this becomes

v(x)=r(x)+βExv(Xt+1)=r(x)+β(Pv)(x)(xX).v(x) = r(x) + \beta \, \EE_x \, v(X_{t+1}) = r(x) + \beta (P v) (x) \qquad (x \in \Xsf).

In vector form, we get v=r+βPvv = r + \beta P v. From the Neumann series lemma, the solution is v=(IβP)1rv^* = (I - \beta P)^{-1} r, which is identical to (3.40).

In summary, (7.15) and the sequential representation (3.39) specify the same lifetime value for consumption paths.

While the recursive formulation in (7.15) now seems redundant, since it produces the same specification that we obtained from the sequential approach, the recursive set up gives us a formula to build on, and hence a pathway to overcoming limitations of the time additive approach. Most of the rest of this chapter will be focused on this agenda.

Pursuing this agenda will produce preferences over consumption paths where the sequential approach has no natural counterpart. This occurs when current value VtV_t is nonlinear in current rewards and continuation values (unlike the linear specification (7.15)). Such specifications are called recursive preferences. When dealing with recursive preference models, the lack of a sequential counterpart means that we are forced to proceed recursively.

7.2.1.2Limitations of Time Additive Preferences

In the previous section, we discussed how the time additive preference specification

v(x)=Ext0βtu(Ct),v(x) = \EE_x \sum_{t \geq 0} \beta^t u(C_t),

also called the discounted expected utility model, can be framed recursively, and how this provides a pathway to go beyond the time additive specification. We are motivated to do so because the time additive specification has been rejected by experimental and observational data in many settings.

In this section, we highlight some of the limitations of time additive preferences. While our discussion is only brief, more background and a list of references can be found in Section 7.4.

One issue with (7.18) is the assumption of a constant positive discount rate, which has been refuted by a long list of empirical studies. This issue was discussed in Section 6.4.

Another limitation of time additive preferences is that agents are risk-neutral in future utility (see, e.g., (7.15), where current value depends linearly on future value). Although risk aversion over consumption can be built in through curvature of uu, this same curvature also determines the elasticity of intertemporal substitution, meaning that the two aspects of preferences cannot be separated. We elaborate on this point in Section 7.3.1.4.

A third issue with time additivity is that agents with such preferences are indifferent to any variation in the joint distribution of rewards that leaves marginal distributions unchanged. To get a sense of what this means, suppose you accept a new job and will be employed by this firm for the rest of your life. Your daily consumption will be entirely determined by your daily wage. Your boss offers you two options:

If you have a strict preference between options A and B, then your choice cannot be rationalized with time additive preferences.

To see why, let ϕ\phi be a probability distribution that represents the lottery just described, putting mass 0.5 on 10,000 and mass 0.5 on 1. Under option A, consumption (Ct)t1(C_t)_{t \geq 1} is given by Ct=C1C_t = C_1 for all tt, where C1ϕC_1 \sim \phi. Under option B, consumption (Ct)t1(C_t)_{t \geq 1} is an iid sequence drawn from ϕ\phi. Either way, lifetime utility is

Et1βtu(Ct)=t1βtEu(Ct)=βuˉ1β,\EE \sum_{t \geq 1} \beta^t u(C_t) = \sum_{t \geq 1} \beta^t \EE u(C_t) = \frac{\beta \bar u}{1-\beta},

where uˉEu(C1)=u(1)/2+u(10,000)/2\bar u \coloneq \EE u(C_1) = u(1)/2 + u(10,000)/2.

The critical part of this argument is the passing of expectations through the sum, which uses time additivity . The implication is that lifetime utility depends only on the marginal distribution of each CtC_t, rather than on the joint distribution of the stochastic process (Ct)t0(C_t)_{t \geq 0}.

7.2.2Risk-Sensitive Preferences

Having motivated recursive preferences, let’s turn to our first example: risk-sensitive preferences. For the consumption problem described in Section 7.2.1.1, imposing risk-sensitive preferences means replacing the recursion v=r+βPvv = r + \beta Pv for vv with

v(x)=r(x)+β1θln{xexp(θv(x))P(x,x)}(xX).v(x) = r(x) + \beta \frac{1}{\theta} \ln \left\{ \sum_{x'} \exp(\theta v(x')) P(x, x') \right\} \qquad (x \in \Xsf).

As before, r(x)=u(c(x))r(x) = u(c(x)) represents current utility when the current state is xx. The parameter θ\theta is a nonzero constant in R\RR.

In (7.20), the transform f(v)=exp(θv)f(v) = \exp(\theta v) is applied to vv before expectation is taken. After the expectation is computed, the transform is undone via f1(v)=(1/θ)ln(v)f^{-1}(v) = (1/\theta) \ln (v). We will show that the agent can be either risk-averse or risk-loving with respect to future outcomes, depending on the value of θ\theta.

7.2.2.1Lifetime Utility

We understand the functional equation (7.20) as “defining” lifetime utility under risk-sensitive preferences. A function vv solving (7.20) gives a lifetime valuation v(x)v(x) to each xXx \in \Xsf, with the interpretation that v(x)v(x) is lifetime utility conditional on initial state xx. This definition of lifetime value is by analogy to the time additive case studied in Section 7.2.1.1, where the function vv solving v=r+βPvv = r + \beta P v measures lifetime utility from each initial state.

In the previous paragraph we wrote “defining” in scare quotes because we can’t be sure we have a definition at this point. Just because we write down a recursive expression for lifetime utility doesn’t mean that corresponding lifetime utility is actually well defined. (For example, we can happily write down the recursive vector equation v=v+1v = v + \1 but no vector vv solving this equation exists.) One aim of this chapter is to provide conditions under which recursions like (7.20) have solutions.

Another issue is uniqueness. Suppose that (7.20) has many solutions. In this case the predictions of the utility model are ambiguous. Our perspective is that the recursive preference specification (7.20) is not correctly formulated unless existence and uniqueness hold. We return to this point in Section 7.2.2.3.

One final comment: even if we can find a vv that solves (7.20), the nonlinearities introduced by risk sensitivity imply that there will be no neat sequential representation analogous to v(x)=Extβtu(Ct)v(x) = \EE_x \sum_t \beta^t u(C_t) from the time additive case. (This connects to Remark 7.2.1, where we discuss recursive preference terminology.)

7.2.2.2Risk-Adjusted Expectation

We want to understand the “expectation-like” expression on the right hand side of (7.20) that replaces the ordinary conditional expectation xv(x)P(x,x)\sum_{x'} v(x') P(x, x') from the time additive case. To this end, we define, for arbitrary random variable ξ\xi and nonzero θR\theta \in \RR,

Eθ[ξ]=1θln{E[exp(θξ)]}.\eE_\theta [\xi] = \frac{1}{\theta} \ln \left\{ \EE [ \exp(\theta \xi) ] \right\}.

The value Eθ[ξ]\eE_\theta[\xi] is called the entropic risk-adjusted expectation of ξ\xi given θ\theta.

The key idea behind the entropic risk-adjusted expectation is that decreasing θ\theta lowers appetite for risk and increasing θ\theta does the opposite.

Expression (7.22) shows that, for the Gaussian case, Eθ[ξ]\eE_\theta[ \xi] equals the mean plus a term that penalizes variance when θ<0\theta < 0 and rewards it when θ>0\theta > 0.

More generally, we have the following result.

7.2.2.3Existence and Uniqueness

Let’s return to investigating lifetime utility under risk-sensitive preferences. To this end, we introduce the risk-sensitive Koopmans operator KθK_\theta on RX\RR^\Xsf via

(Kθv)(x)=r(x)+β1θln{xexp(θv(x))P(x,x)}(xX).(K_\theta \, v)(x) = r(x) + \beta \frac{1}{\theta} \ln \left\{ \sum_{x'} \exp(\theta v(x')) P(x, x') \right\} \qquad (x \in \Xsf).

Evidently, for given nonzero θ\theta, a function vRXv \in \RR^\Xsf solves the risk-sensitive preference lifetime utility specification (7.20) if and only if vv is a fixed point of KθK_\theta. This explains the significance of the following result:

We postpone a proof of Proposition 7.2.2 because we will prove a more general result in Section 7.3.2.2. For now we note the following implications.

  1. For each nonzero θ\theta, lifetime utility is both well-defined and uniquely defined for risk-sensitive preferences (i.e., (7.20) has a unique solution).

  2. The unique solution, denoted henceforth by vv^*, can be computed by successive approximation using KθK_\theta.

7.2.2.4The Gaussian Case

As a tractable case, let’s suppose that r(x)=xr(x) = x and that Xt+1=ρXt+σWt+1X_{t+1} = \rho X_t + \sigma W_{t+1} where (Wt)t1(W_t)_{t \geq 1} is iid and standard normal. Here ρ<1|\rho| < 1 and σ0\sigma \geq 0 controls volatility of the state. Rather than discretizing the state process, we leave it as continuous and proceed by hand.

In this setting, the functional equation (7.20) for vv becomes

v(x)=x+βEθ[v(ρx+σW)],v(x) = x + \beta \eE_\theta[ v(\rho x + \sigma W)],

for each xXx \in \Xsf, where WW is standard normal.

Since ρx+σW\rho x + \sigma W is Gaussian, the expression (7.22) for the risk-adjusted expectation of a normal random variable leads us to conjecture that the solution vv will be affine, i.e., v(x)=ax+bv(x) = a x + b for some a,bRa, b \in \RR. This conjecture turns out to be correct:

We can see that, under the stated assumptions, lifetime value vv is increasing in the state variable xx. However, impacts of the parameters generally depend on θ\theta. For example, if θ>0\theta > 0, increasing σ\sigma shifts up lifetime utility. If θ<0\theta < 0, then lifetime value decreases with σ\sigma. This is as we expect: Lifetime utility is affected positively or negatively by volatility, depending on whether or not the agent is risk averse or risk loving.

Figure Figure 7.3 shows the true solution v(x)=ax+bv(x) = ax + b to the risk-sensitive lifetime utility model, as well as an approximate fixed point from a discrete approximation. The discrete approximation is computed by applying successive approximation to KθK_\theta after discretizing the state process via Tauchen’s method. The parameters and discretization are shown in Listing 1.

Approximate and true solutions in the Gaussian case

Figure 7.3:Approximate and true solutions in the Gaussian case

1
2
3
4
5
6
7
8
9
10
11
12
13
using LinearAlgebra, QuantEcon

function create_rs_utility_model(;
        n=180,      # size of state space
        β=0.95,     # time discount factor
        ρ=0.96,     # correlation coef in AR(1)
        σ=0.1,      # volatility
        θ=-1.0)     # risk aversion
    mc = tauchen(n, ρ, σ, 0, 10)  # n_std = 10
    x_vals, P = mc.state_values, mc.p 
    r = x_vals      # special case u(c(x)) = x
    return (; β, θ, ρ, σ, r, x_vals, P)
end

Program 1:Risk sensitive utility model parameters (rs_utility.jl)

7.2.3Epstein--Zin Preferences

One of the most popular specifications of recursive preferences in quantitative research is Epstein--Zin utility.[1] This class of preferences has been used to study asset pricing, business cycles, monetary policy, fiscal policy, optimal taxation, climate policy, pension plans, and other topics. In this section, we introduce the Epstein--Zin specification and discuss how to solve it. We will see that the specification, while highly nonlinear, is nonetheless well behaved.

7.2.3.1Specification

With Epstein--Zin preferences, the relationship Vt=u(Ct)+βEtVt+1V_t = u(C_t) + \beta \EE_t V_{t+1} is replaced by

Vt={(1β)Ctα+β[EtVt+1γ]α/γ}1/α,V_t = \left\{ (1-\beta) C_t^\alpha + \beta [\EE_t V_{t+1}^\gamma]^{\alpha / \gamma} \right\}^{1/\alpha},

where γ\gamma, α\alpha are nonzero parameters and β(0,1)\beta \in (0,1). As for risk-sensitive preferences, lack of time additivity implies that there is no neat sequential representation for lifetime value. As a result, we must work directly with the recursive expression (7.29).

Assume as before that Ct=c(Xt)C_t = c(X_t), where cR+Xc \in \RR_+^\Xsf and (Xt)t0(X_t)_{t \geq 0} is PP-Markov on finite set X\Xsf. We conjecture a solution of the form Vt=v(Xt)V_t = v(X_t) for some vVR+Xv \in V \coloneq \RR_+^\Xsf. Under this conjecture, the Epstein--Zin Koopmans operator corresponding to (7.29) is

(Kv)(x)={(1β)c(x)α+β[xv(x)γP(x,x)]α/γ}1/α.(Kv)(x) = \left\{ (1-\beta) c(x)^\alpha + \beta \left[ \sum_{x'} v(x')^\gamma P(x, x') \right]^{\alpha/\gamma} \right\}^{1/\alpha}.

As will be discussed further in Section 7.3.1.1, the parameter γ\gamma governs risk aversion with respect to temporal gambles (where outcomes are resolved in the next period), while β\beta controls impatience and α\alpha parametrizes the intertemporal elasticity of substitution. The fact that all three parameters have distinct effects helps fit data. For example, see Tallarini Jr (2000) and Barillas et al. (2009).

An important question is whether Epstein--Zin preferences are well defined. In particular, what conditions do we need on primitives such that the Koopmans operator KK in (7.30) has a unique fixed point?

7.2.3.2Properties of the Koopmans Operator

To address this question we rewrite (7.30) in vector form as

Kv={h+β[Pvγ]α/γ}1/α,Kv = \left\{ h + \beta [P v^\gamma]^{\alpha /\gamma} \right\}^{1/\alpha},

where hRXh \in \RR^\Xsf. This is equivalent to (7.30) when h=(1β)cαh = (1-\beta) c^\alpha. To avoid fractional powers of negative numbers, we assume throughout that h0h \geq 0.

The set VV is called the interior of the positive cone of RX\RR^\Xsf.

The operator KK is difficult to work with for two reasons. First, linear and nonlinear transformations are intertwined. Second, there are several cases for the parameters that we need to handle in order to understand stability. Nonetheless, by applying a smooth transformation, we will find it easy to show that the Epstein--Zin Koopmans operator KK is globally stable under mild conditions. In particular,

A proof of Proposition 7.2.3 is provided in Section 7.2.3.3.

Proposition 7.2.3 implies that Epstein--Zin utility is well-defined under the stated conditions and, moreover, that the solution can be computed via successive approximation on KK. Listing 2 provides code for performing this operation. Figure Figure 7.4 shows convergence of the sequence of iterates to the fixed point vv^*, under the parameters in Listing 2, given an initial condition v0v_0. The figure plots every 10th iterate, repeated 100 times.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
include("s_approx.jl")
using LinearAlgebra, QuantEcon

function create_ez_utility_model(;
        n=200,      # size of state space
        ρ=0.96,     # correlation coef in AR(1)
        σ=0.1,      # volatility
        β=0.99,     # time discount factor
        α=0.75,     # EIS parameter
        γ=-2.0)     # risk aversion parameter

    mc = tauchen(n, ρ, σ, 0, 5) 
    x_vals, P = mc.state_values, mc.p 
    c = exp.(x_vals)      

    return (; β, ρ, σ, α, γ, c, x_vals, P)
end

function K(v, model)
    (; β, ρ, σ, α, γ, c, x_vals, P) = model

    R = (P * (v.^γ)).^(1/γ)
    return ((1 - β) * c.^α + β * R.^α).^(1/α)
end

function compute_ez_utility(model)
    v_init = ones(length(model.x_vals))
    v_star = successive_approx(v -> K(v, model), 
                               v_init, 
                               tolerance=1e-10)
    return v_star
end

Program 2:Epstein--Zin utility model and Koopmans operator (ez_utility.jl)

Convergence of Koopmans iterates for Epstein--Zin utility

Figure 7.4:Convergence of Koopmans iterates for Epstein--Zin utility

7.2.3.3Proof of the Stability Result

We prove Proposition 7.2.3 by

  1. introducing an operator K^\hat K obtained from KK via a smooth transformation,

  2. proving that (V^,K^)(\hat V, \hat K) and (V,K)(V, K) are topologically conjugate, and

  3. obtaining conditions under which K^\hat K is globally stable on VV.

Throughout this section, the assumptions of Proposition 7.2.3 are in force.

To begin we define K^\hat K via

K^v={h+β(Pv)1/θ}θwhereθγα.\hat K v = \left\{ h + \beta (P v)^{1/\theta} \right\}^\theta \qquad \text{where} \quad \theta \coloneq \frac{\gamma}{\alpha}.

The operator K^\hat K is simpler to work with than KK because it unifies α,γ\alpha, \gamma into a single parameter θ\theta and decomposes the Epstein--Zin update rule into two parts: a linear map PP and a separate nonlinear component.

7.2.3.4Why Not Use Contractivity?

While we can consider studying stability of K^\hat K using contraction arguments, this approach fails under useful parameterizations. To illustrate, suppose that X={x1}\Xsf = \{x_1\}. Then hh is a constant, PP is the identity, vv is a scalar and K^v=F(v)\hat K v = F(v) with F(v)={h+βv1/θ}θF(v) = \left\{ h + \beta v^{1/\theta} \right\}^\theta, as shown in Figure Figure 7.5. Here θ=5\theta = 5, h=0.5h=0.5 and β=0.5\beta=0.5. We see that K^\hat K has infinite slope at zero, so the contraction property fails.[2]

Shape properties of \hat K in one dimension

Figure 7.5:Shape properties of K^\hat K in one dimension

7.3General Representations

We have discussed two well-known examples of recursive preferences. In this section we build a general representation. While various constructions can be found in the decision theory literature, many are not well suited to quantitative work. Here we give a relatively parsimonious operator-theoretic definition.

7.3.1Koopmans Operators

In Section 7.2.2.3 and Section 7.2.3.1 we met risk-sensitive and Epstein--Zin Koopmans operators respectively. In this section, we provide a general definition of a Koopmans operator that will contain these two examples as special cases.

We begin by outlining structure that can be combined to generate Koopmans operators in a Markov environment. The two key components are an aggregation function and a certainty equivalent operator. We then build Koopmans operators from these primitives and connect them to applications. In every setting we consider, lifetime value is identified with the unique fixed point of the Koopmans operator (whenever it exists).

7.3.1.1Certainty Equivalents

The first primitive we consider is a generalization of conditional expectations: Given VRXV \subset \RR^\Xsf, we define a certainty equivalent operator on VV to be a self-map RR on VV such that

  1. RR is order preserving on VV and

  2. all constants are fixed under RR (i.e., R(λ1)=λ1R\, ( \lambda \1) = \lambda \1 for all λR\lambda \in \RR with λ1V\lambda \1 \in V).

The next example is nonlinear. It treats the risk-adjusted expectation that appears in risk-sensitive preferences.

The set of certainty equivalent operators on RX\RR^\Xsf is invariant under convex combinations, as the next exercise asks you to confirm.

7.3.1.2Properties

Let VV be a convex cone in RX\RR^\Xsf. A certainty equivalent operator RR on VV is called

In some instances, a certainty equivalent operator is either convex or concave in the sense of Section 7.1.2.2.

Combining Exercise 7.29 and Example 7.3.5, we have proved

Later we will combine Lemma 7.3.1 with the fixed-point results for convex and concave operators in Section 7.1.2.2 to establish existence and uniqueness of lifetime values for certain kinds of Koopmans operators.

7.3.1.3Monotonicity

Let X\Xsf be partially ordered and let iRXi\RR^\Xsf be the set of increasing functions in RX\RR^\Xsf. Let VV be such that iRXVRXi\RR^\Xsf \subset V \subset \RR^\Xsf and let RR be a certainty equivalent on VV. We call RR monotone increasing if RR is invariant on iRXi\RR^\Xsf. This extends the terminology in Section 3.2.1.3, where we applied it to Markov operators (cf., Exercise 3.16). The concept of monotone increasing certainty equivalent operators is connected to outcomes where lifetime preferences are increasing in the state.

7.3.1.4Aggregation

We mentioned that Koopmans operators are typically constructed by combining a certainty equivalent operator and an aggregation function. Let’s now discuss the second of these components.

Given VRXV \subset \RR^\Xsf, an aggregator AA on VV is a map AA from X×R\Xsf \times \RR to R\RR such that

  1. w(x)=A(x,v(x))w(x) = A(x, v(x)) is in VV whenever vVv \in V and

  2. yA(x,y)y \mapsto A(x, y) is increasing for all xXx \in \Xsf.

Intuitively, an aggregator combines current state and continuation values to measure lifetime value.

Common types of aggregators include the

Here CES stands for “constant elasticity of substitution.” An important special case of both the CES and Uzawa aggregators is the

From these basic types we can also build composite aggregators. For example, we might consider a CES-Uzawa aggregator of the form A(x,y)={r(x)α+b(x)yα}1/αA(x, y) = \{r(x)^\alpha + b(x) y^\alpha\}^{1/\alpha} with r,bRXr, b \in \RR^\Xsf, b0b \geq 0 and α0\alpha \not= 0. As we will see in Section 7.3.3.3, the CES-Uzawa aggregator can be used to construct models with both Epstein--Zin utility and state-dependent discounting (as in, say, Albuquerque et al. (2016) or Schorfheide et al. (2018).)

7.3.1.5Building Koopmans Operators

We are now ready to build Koopmans operators by combining certainty equivalents and aggregators. Given VRXV \subset \RR^\Xsf, we call a self-map KK on VV a Koopmans operator if

K=AR,K = A \circ R,

for some aggregator AA and certainty equivalent operator RR on VV. The expression in (7.46) means that (Kv)(x)=A(x,(Rv)(x))(Kv)(x) = A(x, (Rv)(x)) at vVv \in V and xXx \in \Xsf.

It is generally appropriate to suppose that a uniform increase in continuation values will increase current value. This property holds for KK in (7.46). In particular, it follows from the definitions of AA and RR that KK is an order preserving self-map on VV.

7.3.1.6Comments on CES Aggregation

The CES aggregator is so-named because, in a static utility maximization problem where cc and yy are two goods and utility is U(c,y)=((1β)cα+βyα)1/αU(c,y) = ((1-\beta) c^\alpha + \beta y^\alpha)^{1/\alpha}, the elasticity of substitution is constant and given by 1/(1α)1/(1-\alpha). In the present setting, where aggregation is across time, 1/(1α)1/(1-\alpha) is usually called the elasticity of intertemporal substitution (EIS). The next exercise explains.

The fact that EIS =1/(1α)= 1/(1-\alpha) under the CES aggregator is significant because the EIS can be measured from data using regression and other techniques. While estimates vary significantly, the detailed meta-analysis by Havranek et al. (2015) suggests 0.5 as a plausible average value for international studies, with rich countries tending slightly higher. Basu & Bundick (2017) use 0.8 when calibrating to US data. Under these estimates, the relationship EIS =1/(1α)= 1/(1-\alpha) implies a value for α\alpha between -1.0 and -0.25.

7.3.1.7Lifetime Value

In Section 7.3.1.5 we constructed a generic Koopmans operator using an aggregator and a certainty equivalent operator. In this section, we connect this Koopmans operator to lifetime values and discuss the significance of global stability.

To begin, fix set X\Xsf and function class VRXV \subset \RR^\Xsf. Let K=ARK = A \circ R be a Koopmans operator for some aggregator AA and certainty equivalent operator RR on VV. The lifetime value generated by KK is the unique fixed point of KK in VV, whenever it exists. Given such a vv, the value v(x)v(x) is interpreted as lifetime value conditional on initial state xx.

In many applications, our existence and uniqueness proofs for fixed points of KK will also establish global stability. For Koopmans operators, global stability has the following interpretation: for wVw \in V, mNm \in \NN and xXx \in \Xsf, the value (Kmw)(x)(K^m w)(x) gives total finite-horizon utility over periods 0,,m0, \ldots, m under the preferences embedded in KK, with initial state xx and terminal condition ww. Hence global stability implies that, for any choice of terminal condition, finite-horizon utility converges to infinite-horizon utility as the time horizon converges to infinity. The next exercise helps to illustrate this point.

Exercise 7.33 confirms that, at least for the time additive case, global stability of KK is equivalent to the statement that a finite-horizon valuation with arbitrary terminal condition ww converges to the infinite-horizon valuation.

7.3.1.8Monotone Lifetime Values

Let X=(X,)\Xsf = (\Xsf, \preceq) be partially ordered, let iRXi\RR^\Xsf be the set of increasing functions in RX\RR^\Xsf, and let VV be such that iRXVRXi\RR^\Xsf \subset V \subset \RR^\Xsf. Let KK be a Koopmans operator on VV, so that Kv=ARKv = A \circ R for some aggregator AA and certainty equivalent operator RR on VV. Suppose that KK has a unique fixed point vVv^* \in V. A natural question is: when is vv^* increasing in the state?

7.3.2A Blackwell-Type Condition

Let RR be a certainty equivalent operator on V=RXV = \RR^\Xsf and let AA be an aggregator on VV. Let KK be the Koopmans operator on VV defined by (Kv)(x)=A(x,(Rv)(x))(Kv)(x) = A(x, (Rv)(x)). When RR is constant-subadditive, we can often establish global stability of KK on VV via a contraction mapping argument. This section gives details.

7.3.2.1Blackwell Aggregators

We call an aggregator AA on VV a Blackwell aggregator if there exists a β(0,1)\beta \in (0,1) such that

A(x,y+λ)A(x,y)+βλ,A(x, y + \lambda) \leq A(x, y) + \beta \lambda,

for all xXx \in \Xsf, yRy \in \RR and λR+\lambda \in \RR_+.

The next proposition states conditions for global stability in settings where aggregators have the Blackwell property.

The stability of time additive preferences is a special case of Proposition 7.3.3.

7.3.2.2The Risk-Sensitive Case

We can now complete the proof of Proposition 7.2.2, which concerned global stability of the Koopmans operator generated by risk-sensitive preferences.

7.3.2.3Quantile Preferences

Consider a setting where V=RXV = \RR^\Xsf and KτAaddRτK_\tau \coloneq A_{\textsc{add}} \circ R_\tau. That is,

(Kτv)(x)=r(x)+β(Rτv)(x)(xX),(K_\tau v)(x) = r(x) + \beta (R_\tau v)(x) \qquad (x \in \Xsf),

for β(0,1)\beta \in (0,1), τ[0,1]\tau \in [0,1], rRXr \in \RR^\Xsf and RτR_\tau as described in Exercise 7.22. Since RτR_\tau is constant-subadditive (Exercise 7.25) and the additive aggregator is Blackwell, KτK_\tau is globally stable (Proposition 7.3.3). The operator KτK_\tau represents quantile preferences, as described in Castro & Galvao (2019) and other studies (see Section 7.4). The value τ\tau parameterizes attitude to risk, a point we return to in Section 8.2.1.4.

7.3.3Uzawa Aggregation

Let’s consider the Koopmans operator K=AuzawaRK = A_{\textsc{uzawa}} \circ R, where VV is some subset of RX\RR^\Xsf and RR is a certainty equivalent operator on VV. In particular,

(Kv)(x)=r(x)+b(x)(Rv)(x)(xX,  vV),(Kv)(x) = r(x) + b(x) (Rv)(x) \qquad (x \in \Xsf, \; v \in V),

with r,bRXr, b \in \RR^\Xsf and b0b \geq 0. We are interested in conditions that imply KK is globally stable on VV.

7.3.3.1The Case of Conditional Expectation

Let V=RXV = \RR^\Xsf and suppose R=PR = P for some PM(RX)P \in \mopx, so that RR is ordinary conditional expectations. Then KK becomes Kv=r+LvKv = r + Lv where LL(RX)L \in \lopx with L(x,x)=b(x)P(x,x)L(x,x') = b(x)P(x,x'). By Exercise 1.20, KK is globally stable on VV whenever ρ(L)<1\rho(L) < 1.

This kind of structure arises when households derive utility from a consumption path while their discount factor fluctuates according to some state variable (see, e.g., Krusell & Smith (1998), Toda (2019), Cao (2020), and Hubmer et al. (2020)). For a given consumption path (Ct)(C_t), lifetime values takes the form

v(x)=Ext=0(i=1tβi)u(Ct),v(x) = \EE_x \, \sum_{t=0}^\infty \left( \prod_{i=1}^t \beta_i \right) u(C_t),

where uu is a flow utility function and {βt}\{\beta_t\} is a discount factor process. Suppose Ct=c(Xt)C_t = c(X_t) and βt=b(Xt)\beta_t = b(X_t) where b0b \geq 0 and (Xt)(X_t) is PP-Markov for some PM(RX)P \in \mopx. Set rucr \coloneq u \circ c and L(x,x)b(x)P(x,x)L(x,x') \coloneq b(x) P(x,x'). By Theorem 6.1.1, the condition ρ(L)<1\rho(L) < 1 implies that vv in (7.56) is the unique fixed point of Kv=r+Lv=r+bPvKv = r + L v = r + b Pv. In other words, lifetime value under (7.56) is the unique fixed point of the Koopmans operator when the aggregator is of Uzawa type and the certainty equivalent is conditional expectation.

How does this relate to optimization? Recall our discussion of state-dependent MDPs in Chapter 6. There, the policy operator TσT_\sigma in (6.29) is a special case of (7.55) when the discount factor depends only on the current state and action.

With some additional requirements, the condition ρ(L)<1\rho(L)<1 is necessary as well as sufficient for existence of a unique fixed point for Kv=r+LvKv = r + Lv. Indeed, if b0b \gg 0 and PP is irreducible, then LL is also irreducible and a positive linear operator. Applying Lemma 6.1.4, we see that r0r \gg 0 and ρ(L)1\rho(L) \geq 1 implies Kv=r+LvKv = r + Lv has no fixed point in V{vRX:v0}V \coloneq \setntn{v \in \RR^\Xsf}{v \gg 0}.

7.3.3.2Stability via Concavity

Now consider Kv=r+bRvKv = r + b Rv from (7.55) when RR is not in M(RX)\mopx. Here bRvb Rv is the pointwise product, so that (bRv)(x)=b(x)(Rv)(x)(bRv)(x) = b(x) (Rv)(x) for all xx.

We cannot use Proposition 7.3.3 to prove stability of KK unless b(x)<1b(x) < 1 for all xXx \in \Xsf. Since this condition is rather strict, we now study weaker conditions that can be valid even when bb exceeds 1 in some states. Specifically, we consider

  1. bRvc+Lvb Rv \leq c + Lv for some cRXc \in \RR^\Xsf and LL(RX)L \in \lopx with ρ(L)<1\rho(L) < 1.

  2. r0r \gg 0 and RR is concave on R+X\RR^\Xsf_+.

Let V=[0,vˉ]V = [0, \bar v] where vˉ(IL)1(r+c)\bar v \coloneq (I-L)^{-1}(r + c).

7.3.3.3Epstein--Zin Preferences with State-Dependent Discounting

Combining the CES-Uzawa aggregator A(x,y)={r(x)α+b(x)yα}1/αA(x, y) = \{r(x)^\alpha + b(x) y^\alpha\}^{1/\alpha} with the Kreps--Porteus certainty equivalent operator leads to the Koopmans operator

Kv={h+b[Pvγ]α/γ}1/α,withh,bR+X.Kv = \left\{ h + b \left[ Pv^\gamma \right]^{\alpha/\gamma} \right\}^{1/\alpha}, \quad \text{with} \quad h, b \in \RR^\Xsf_+.

A fixed point of KK corresponds to lifetime value for an agent with Epstein--Zin preferences and state-dependent discounting. (Such set ups are used in research on macroeconomic dynamics and asset pricing -- see Section 7.4 for more details).

In what follows we take V=(0,)XV = (0, \infty)^\Xsf and assume that h,bVh, b \in V and PP is irreducible.

To discuss stability of KK we introduce the operator BL(RX)B \in \lopx defined by

(Bv)(x)b(x)θxv(x)P(x,x)where    θγα.(Bv)(x) \coloneq b(x)^\theta \sum_{x'} v(x') P(x, x') \quad \text{where} \;\; \theta \coloneq \frac{\gamma}{\alpha}.

To prove Proposition 7.3.5, we proceed as in Section 7.2.3.3, constructing a conjugate operator K^\hat K and proving stability of the latter. For this purpose, we introduce

K^v={h+(Bv)1/θ}θ(vV),\hat K v = \left\{ h + (B v)^{1/\theta} \right\}^\theta \qquad (v \in V),

Also, let Φ\Phi be defined by Φv=vγ\Phi v = v^\gamma.

7.4Chapter Notes

The time additive preference structure in Section 7.2.1 was popularized by Samuelson (1939), who built on earlier work by Fisher (1930) and Ramsey (1928). An axiomatic foundation was supplied by Koopmans (1960). Bastianello & Faro (2023) study the foundations of discounted expected utility (DEU) from a purely subjective framework.

Problems with the time additive DEU model include non-constant discounting, as discussed in Section 6.4, as well as sign effects (gains being discounted more than losses) and magnitude effects (small outcomes being discounted more than large ones. See, for example, Thaler (1981) and Benzion et al. (1989). A critical review of the time additive model and a list of many references can be found in Frederick et al. (2002).

In the stochastic setting, the time additive framework is a subset of the expected utility model (Von Neumann & Morgenstern (1944), Friedman (1956), Savage (1951)). There are many well documented departures from expected utility in experimental data. See the start of Andreoni & Sprenger (2012) and the article Ericson & Laibson (2019) for an introduction to the literature. An interesting historical discussion of time additive expected utility can be found in Becker et al. (1989).

(It is ironic that those most responsible for popularizing the time additive DEU framework have also been among the most critical. For example, Samuelson (1939) stated that it is “completely arbitrary” to assume that the DEU specification holds. He goes on to claim that, in the analysis of savings and consumption, it is “extremely doubtful whether we can learn much from considering such an economic man.” In addition, Stokey & Lucas (1989), whose work helped to standardize DEU as a methodology for quantitative analysis, argued in a separate study that DEU is attractive only because of its relative simplicity Lucas & Stokey, 1984.)

Do the departures from time additive expected utility found in experimental data actually matter for quantitative work? Evidence suggests that the answer is affirmative. In macroeconomics and asset pricing in particular, researchers increasingly use non-additive preferences in order to bring model outputs closer to the data. For example, many quantitative models of asset pricing rely heavily on Epstein--Zin preferences. Representative examples include Epstein & Zin (1991), Tallarini Jr (2000), Bansal & Yaron (2004), Hansen et al. (2008), Bansal et al. (2012), Schorfheide et al. (2018), and Groot et al. (2022). Alternative numerical solution methods are discussed in Pohl et al. (2018).

An excellent introduction to recursive preference models can be found in Backus et al. (2004). Our use of the term “Koopmans operator,” which is not entirely standard, honors early contributions by Nobel laureate Tjalling Koopmans on recursive preferences (see Koopmans (1960) and Koopmans et al. (1964)).

Theoretical properties of recursive preference models have been studied in many papers, including Epstein & Zin (1989), Weil (1990), Boyd (1990), Hansen & Scheinkman (2009), Marinacci & Montrucchio (2010), Bommier et al. (2017), Bloise & Vailakis (2018), Marinacci & Montrucchio (2019), Pohl et al. (2019), Balbus (2020), Borovička & Stachurski (2020), DeJarnette et al. (2020), Christensen (2022), and Becker & Rincon-Zapatero (2023). The paper by Marinacci & Montrucchio (2019) provides a useful alternative approach to existence of unique fixed points in the setting of order preserving maps. Experimental results on Epstein--Zin preferences can be found in Meissner & Pfeiffer (2022).

There is a strong connection between risk-sensitive preferences and the literature on robust control. See, for example, Cagetti et al. (2002), Hansen & Sargent (2007), and Barillas et al. (2009). We return to this point in Chapter 8.

The quantile preferences we considered in Section 7.3.2.3 have been analyzed in static and dynamic settings by Giovannetti (2013), Castro & Galvao (2019), Castro & Galvao (2022) and Castro et al. (2022). Recursive components of the analysis of quantile and Uzawa preference models build on the study of monotone preferences in Bommier et al. (2017).

Some recursive preference specifications involve ambiguity aversion. An introduction to this literature and its applications can be found in Klibanoff et al. (2009), Hayashi & Miao (2011), Hansen & Miao (2018), Bommier et al. (2019) and Hansen & Sargent (2020). Marinacci et al. (2023) discuss the connection between recursivity and attitudes to uncertainty. We discuss ambiguity again in Chapter 8.

Recursive preferences are increasingly applied outside the field of asset pricing, where they first came to prominence. See, for example, Bommier & Villeneuve (2012), Colacito et al. (2018), Jensen (2019), or Augeraud-Véron et al. (2019).

The coin flip application in Section 7.2.1.2 is related to correlation aversion, as discussed in Stanca (2023), and preference for “consumption spreads” as reviewed in Frederick et al. (2002).

Some applications of Theorem 7.1.3 to network analysis can be found in Sargent & Stachurski (2023).

Footnotes
  1. Epstein--Zin preferences were popularized in Epstein & Zin (1989). They are a special case of preferences defined by Kreps & Porteus (1978). Further discussion can be found in Section 7.4.

  2. We could try to truncate the interval to a neighborhood of the fixed point and hope that K^\hat K is a contraction when restricted to this interval. But in higher dimensions we are not sure that a fixed point exists for a broad range of parameters, which makes this idea hard to implement.

References
  1. Barbu, V., & Precupanu, T. (2012). Convexity and Optimization in Banach Spaces. Springer Science & Business Media.
  2. Fajgelbaum, P. D., Schaal, E., & Taschereau-Dumouchel, M. (2017). Uncertainty traps. The Quarterly Journal of Economics, 132(4), 1641–1692.
  3. Du, Y. (1990). Fixed points of increasing operators in ordered Banach spaces and applications. Applicable Analysis, 38(01–02), 1–20.
  4. Zhang, Z. (2012). Variational, Topological, and Partial Order Methods with Their Applications (Vol. 29). Springer.
  5. Stachurski, J., Wilms, O., & Zhang, J. (2022). Unique solutions to power-transformed affine systems. arXiv, 2212.00275.
  6. Kleinman, B., Liu, E., & Redding, S. J. (2023). Dynamic spatial general equilibrium. Econometrica, 91(2), 385–424.
  7. Liao, J., & Berg, A. (2018). Sharpening Jensen’s inequality. The American Statistician.
  8. Tallarini Jr, T. D. (2000). Risk-sensitive real business cycles. Journal of Monetary Economics, 45(3), 507–532.
  9. Barillas, F., Hansen, L. P., & Sargent, T. (2009). Doubts or variability? Journal of Economic Theory, 144(6), 2388–2418.
  10. Kreps, D. M., & Porteus, E. L. (1978). Temporal resolution of uncertainty and dynamic choice theory. Econometrica, 46(1), 185–200.
  11. Bullen, P. S. (2003). Handbook of Means and Their Inequalities (Vol. 560). Springer Science & Business Media.
  12. Föllmer, H., & Knispel, T. (2011). Entropic risk measures: Coherence vs. convexity, model ambiguity and robust large deviations. Stochastics and Dynamics, 11(02n03), 333–351.
  13. Albuquerque, R., Eichenbaum, M., Luo, V. X., & Rebelo, S. (2016). Valuation risk and asset pricing. The Journal of Finance, 71(6), 2861–2904.
  14. Schorfheide, F., Song, D., & Yaron, A. (2018). Identifying long-run risks: A Bayesian mixed-frequency approach. Econometrica, 86(2), 617–654.
  15. Havranek, T., Horvath, R., Irsova, Z., & Rusnak, M. (2015). Cross-country heterogeneity in intertemporal substitution. Journal of International Economics, 96(1), 100–118.