Nonlinear Valuation - Dynamic Programming Volume I: Finite States

Dynamic programs are optimization problems where the objective to be maximized is lifetime value. As such, one key topic is how to combine a sequence of rewards into a corresponding lifetime value. So far we have considered linear valuation based on summation over expected discounted rewards, using either constant discount rates (Chapter 1–Chapter 5) or state-dependent discounting (Chapter 6). In this chapter, we consider extensions, where lifetime value is computed from a recursion over the reward sequence instead of a discounted sum. This “recursive preference” approach permits far more general specifications of lifetime value, and is becoming increasingly popular in economics, finance, and computer science (see, e.g., Section 6.4).

This chapter focuses purely on valuation (i.e., combining reward sequences into lifetime values), rather than optimization. Later, in Chapter 8, we will show how to maximize lifetime value in settings where recursive preferences are adopted.

Throughout this chapter, the symbol $\Xsf$ always represents a finite set.

7.1Beyond Contraction Maps¶

The most natural way to express lifetime value in recursive preference environments is as a fixed point of a (typically nonlinear) operator. One challenge is that some recursive preference specifications induce operators that fail to be contractions. For this reason, we now invest in additional fixed point theory. All of this theory concerns order preserving maps, since the operators we consider always inherit monotonicity from underlying preferences.

7.1.1Knaster–Tarski for Function Space¶

If you try to draw an increasing function that maps $[0,1]$ to itself without touching the 45-degree line, you will find it impossible. Below we state a famous fixed-point theorem due to Bronislaw Knaster (1893–1980) and Alfred Tarski (1901–1983) that generalizes this idea. In the statement, $\Xsf$ is a finite set and $V \coloneq [v_1, v_2]$ , where $v_1, v_2$ are functions in $\RR^\Xsf$ with $v_1 \leq v_2$ .

Unlike, say, the fixed-point theorem of Banach (Section 1.2.2.3), Theorem 7.1.1 only yields existence. Uniqueness does not hold in general, as you can easily confirm by sketching the one-dimensional case or completing the following exercise.

7.1.2Concavity, Convexity, and Stability¶

In this section, we study sufficient conditions for global stability that replace contractivity with shape properties such as concavity and monotonicity. To build intuition, we start with the one-dimensional case and show how these properties can be combined to achieve stability. Readers focused on results can safely skip to Section 7.1.2.2.

7.1.2.1The One-Dimensional Case¶

In Section 1.2.3.2, we showed that concavity and monotonicity can yield global stability for the Solow–Swan model. Here is a more general result.

Proof

Regarding existence, fix $x \in U$ and suppose first that $x \leq g(x)$ . Since $g$ is increasing, we have $g(x) \leq g^2(x)$ . Continuing in this fashion shows that $(g^k(x))_{k \geq 0}$ is monotone increasing. Moreover, there exists a $b \in U$ such that $x \leq b$ and $g(b)\leq b$ . Hence $g(x) \leq g(b) \leq b$ . Iterating yields $g^k(x) \leq b$ for all $k$ , so $(g^k(x))_{k \geq 0}$ is increasing and bounded above. Thus, there exists an $x^* \in U$ such that $x_k \coloneq g^k(x)$ converges to $x^*$ (by Theorem A.2.1 and Exercise A.2.4). Since $g$ is concave and hence continuous on any open set (see, e.g., Barbu & Precupanu (2012)), the result in Exercise 1.2.16 implies that $x^* = g(x^*)$ .

If, instead, $g(x) \leq x$ , then a similar argument shows that $(g^k(x))_{k \geq 0}$ is decreasing and bounded. Using analogous reasoning, we obtain a fixed point $x^*$ in $U$ with $g^k(x) \to x^*$ .

To show the uniqueness of the fixed point, assume $g(x)=x$ and $g(y)=y$ for some $x, y \in U$ . We claim that $x=y$ . To see this, suppose without loss of generality that $x\leq y$ . By assumption, there exists an $a \in U$ such that $a \leq x \leq y$ and $g(a)>a$ . Because $a \leq x\leq y$ , we can take $\lambda \in [0,1]$ such that $x = \lambda a + (1-\lambda) y$ . If $\lambda > 0$ , then concavity of $g$ and $g(a)>a$ implies the contradiction

g(x) = g \left(\lambda a + (1-\lambda )y \right) \geq \lambda g(a) + (1-\lambda) g(y) > \lambda a + (1-\lambda) y = x = g(x).

Hence $\lambda=0$ . Since $x = \lambda a + (1-\lambda) y$ , this yields $x=y$ . ◻

Figure 7.1:Global stability induced by increasing concave functions

Figure 7.1 gives one example, where $g(x) = 1 + \sqrt{x}/2$ . The conditions of Proposition 7.1.2 hold because, given any $x > 0$ , we can find an $a$ in $(0,x)$ that gets mapped strictly up (i.e., $g(a)$ is above the 45-degree line) and a point $b > x$ that gets mapped down (i.e., $g(b)$ is below the 45-degree line).

7.1.2.2The Multidimensional Case¶

Proposition 7.1.2 extends to multiple dimensions. In this section, we present a multidimensional version that covers both convex and concave functions.

To state our result, we extend the definition of convexity and concavity to vector-valued self-maps. The definitions mirror those for scalar-valued functions: A self-map $T$ on a convex subset $D$ of $\RR^\Xsf$ is called convex if

T(\lambda u + (1-\lambda) v) \leq \lambda Tu + (1-\lambda) Tv \text{ whenever } u,v \in D \text{ and } \lambda \in [0, 1];

and concave if

\lambda Tu + (1-\lambda) Tv \leq T(\lambda u + (1-\lambda) v) \text{ whenever } u,v \in D \text{ and } \lambda \in [0, 1].

Here $\leq$ is, as usual, the pointwise order.

We are now ready to state our next fixed-point result, which was first proved in an infinite-dimensional setting by Du (1990). In the statement, $\Xsf$ is a finite set, $V \coloneq [v_1, v_2]$ is a nonempty order interval in $(\RR^\Xsf, \leq)$ , and $T$ is a self-map on $V$ .

Conditions (i) and (ii) are similar – in fact (ii) holds whenever (i) holds, so (ii) is the weaker (but slightly more complicated) condition. Conditions (iii) and (iv) are similar in the same sense. Figure 7.2 illustrates the convex and the concave versions of the result in one dimension. We encourage you to sketch your own variations to understand the roles that different conditions play.

Figure 7.2:Du’s theorem: convex and concave cases

A full proof of Theorem 7.1.3 can be found in Du (1990) or Theorem 2.1.2 and Corollary 2.1.1 of Zhang (2012). In our setting, existence follows from the Knaster–Tarski theorem. We prove uniqueness.

7.1.3A Power-Transformed Affine Equation¶

Du’s theorem provides conditions under which concave or convex order preserving self-maps on order intervals attain global stability. In this section we study maps of this type that have additional structure. While this additional structure is restrictive, it allows us to obtain global stability on unbounded subsets rather than order intervals.

To begin, let $\Xsf$ be a finite set and consider the equation

v = [h + (A v)^{1/\theta}]^\theta \qquad (v \in V),

(7.1)

where $\theta$ is a nonzero parameter, $A \in \lopx$ with $A \geq 0$ , $V = (0, \infty)^\Xsf$ , and $h \in V$ . This system reduces to the affine model studied in Lemma 6.1.4 when $\theta = 1$ .

To analyze (7.1), we introduce the self-map

Gv = [h + (A v)^{1/\theta}]^\theta \qquad (v \in V).

(7.2)

Continuing to assume that $h \gg 0$ and $A$ is a positive linear operator, we can use Du’s theorem to establish the next result (which generalizes Lemma 6.1.4).

The key to proving (i) implies (ii) is that $G$ is order preserving and either convex or concave, depending on the value of $\theta$ . The remaining conditions in Du’s theorem are established over order intervals using $\rho(A)^{1/\theta} < 1$ . By applying an approximation argument, global stability is extended from order intervals to all of $V$ . Some of these details are contained in the following exercises and a full proof can be found in Stachurski et al. (2022).

Let

F_x(t) = \left\{ h(x) + t^{1/\theta} \right\}^\theta \qquad (t > 0).

Exercise 7.1.9

Kleinman et al. (2023) study a dynamic discrete choice model of migration with savings and capital accumulation. They show that optimal consumption for landlords in their model is $c_t = \sigma_t R_t k_t$ , where $k_t$ is capital, $R_t$ is the gross rate of return on capital, and $\sigma_t$ is a state-dependent process obeying

\sigma_t^{-1} = 1 + \beta^\psi \left[ \EE_t R_{t+1}^{(\psi-1)/\psi} \sigma_{t+1}^{-1/\psi} \right]^\psi.

(7.3)

Here $\beta$ is a discount factor and $\psi$ is a utility parameter. Assume $R_t = f(X_t)$ where $\Xsf$ is finite, $f \in \RR^\Xsf$ , and $(X_t)$ is $P$ -Markov for some $P \in \mopx$ . Let $A \in \lopx$ be defined by

(Av)(x) = \beta \sum_{x'} f(x')^{(\psi-1)/\psi} v(x')P(x, x').

Prove that there exists a unique solution to (7.3) of the form $\sigma_t = \sigma(X_t)$ for some $\sigma \in \RR^\Xsf$ with $\sigma \gg 0$ if and only if $\rho(A)^\psi < 1$ .

Solution to Exercise 7.1.9

Setting $v_t = \sigma_t^{-1/\psi}$ , we can write (7.3) as

v_t = \left\{ 1 + \beta^\psi \left[ \EE_t R_{t+1}^{(\psi-1)/\psi} v_{t+1} \right]^\psi \right\}^{1/\psi}.

(7.4)

We conjecture a stationary Markov solution $v_t = v(X_t)$ for some $v \in \RR^\Xsf$ with $v \gg 0$ . This $v$ must satisfy

v(x) = \left\{ 1 + \beta^\psi \left[ \sum_{x'} f(x')^{(\psi-1)/\psi} v(x') P(x,x') \right]^\psi \right\}^{1/\psi} \qquad (x \in \Xsf).

Using the definition of $A$ in the exercise, we can write the equation in vector form as $v = [1 + (Av)^\psi]^{1/\psi}$ . It follows from Theorem 7.1.4 that a unique strictly positive solution to this equation exists if and only if $\rho(A)^\psi < 1$ . This proves the claim in the exercise.

7.2Recursive Preferences¶

In this section, we compute lifetime values associated with given reward processes in settings that involve nonlinear recursions. These nonlinear recursions are called recursive preferences. We will show how some common specifications of recursive preferences can be translated into lifetime valuations via the fixed-point methods introduced in Chapter 2 and Section 7.1.

7.2.1Motivation: Optimal Savings¶

We motivate recursive preference models by analyzing consumption decisions.

7.2.1.1A Recursive View of a Standard Model¶

The time additive model of valuation in Section 3.2.2.3 can be studied from a purely recursive point of view. As a starting point, we state that the value $V_t$ of current and future consumption is defined at each point in time $t$ by the recursion

V_t = u(C_t) + \beta \, \EE_t V_{t+1}.

(7.5)

The random variables $V_t$ and $V_{t+1}$ are the unknown objects in this expression. The expectation $\EE_t$ conditions on $X_0, \ldots, X_t$ and $C_t = c(X_t)$ . The process $(X_t)_{t \geq 0}$ is $P$ -Markov.

Since consumption is a function of $(X_t)_{t \geq 0}$ and knowledge of the current state $X_t$ is sufficient to forecast future values (by the Markov property), it is natural to guess that $V_t$ will depend on the Markov chain only through $X_t$ . Hence we guess that a solution of (7.5) takes the form $V_t = v(X_t)$ for some $v \in \RR^\Xsf$ .

(Here $v$ is an ansatz, meaning “educated guess.” First we guess the form of a solution and then we try to verify that the guess is correct. So long as we carry out the second step, starting with a guess brings no loss of rigor.)

Under this conjecture, (7.5) can be rewritten as $v(X_t) = u(c(X_t)) + \beta \EE_t v(X_{t+1})$ . Conditioning on $X_t = x$ and setting $r \coloneq u \circ c$ , this becomes

v(x) = r(x) + \beta \, \EE_x \, v(X_{t+1}) = r(x) + \beta (P v) (x) \qquad (x \in \Xsf).

(7.6)

In vector form, we get $v = r + \beta P v$ . From the Neumann series lemma, the solution is $v^* = (I - \beta P)^{-1} r$ , which is identical to (3.21).

In summary, (7.5) and the sequential representation (3.20) specify the same lifetime value for consumption paths.

While the recursive formulation in (7.5) now seems redundant, since it produces the same specification that we obtained from the sequential approach, the recursive set up gives us a formula to build on, and hence a pathway to overcoming limitations of the time additive approach. Most of the rest of this chapter will be focused on this agenda.

Pursuing this agenda will produce preferences over consumption paths where the sequential approach has no natural counterpart. This occurs when current value $V_t$ is nonlinear in current rewards and continuation values (unlike the linear specification (7.5)). Such specifications are called recursive preferences. When dealing with recursive preference models, the lack of a sequential counterpart means that we are forced to proceed recursively.

7.2.1.2Limitations of Time Additive Preferences¶

In the previous section, we discussed how the time additive preference specification

v(x) = \EE_x \sum_{t \geq 0} \beta^t u(C_t),

(7.7)

also called the discounted expected utility model, can be framed recursively, and how this provides a pathway to go beyond the time additive specification. We are motivated to do so because the time additive specification has been rejected by experimental and observational data in many settings.

In this section, we highlight some of the limitations of time additive preferences. While our discussion is only brief, more background and a list of references can be found in Section 7.4.

One issue with (7.7) is the assumption of a constant positive discount rate, which has been refuted by a long list of empirical studies. This issue was discussed in Section 6.4.

Another limitation of time additive preferences is that agents are risk-neutral in future utility (see, e.g., (7.5), where current value depends linearly on future value). Although risk aversion over consumption can be built in through curvature of $u$ , this same curvature also determines the elasticity of intertemporal substitution, meaning that the two aspects of preferences cannot be separated. We elaborate on this point in Section 7.3.1.4.

A third issue with time additivity is that agents with such preferences are indifferent to any variation in the joint distribution of rewards that leaves marginal distributions unchanged. To get a sense of what this means, suppose you accept a new job and will be employed by this firm for the rest of your life. Your daily consumption will be entirely determined by your daily wage. Your boss offers you two options:

Your boss will flip a coin at the start of your first day on the job. If the coin is heads, you will receive $10,000 a day for the rest of your life. If the coin is tails, you will receive $1 per day for the rest of your life.
Your boss will flip a coin at the start of every day. If the coin is heads, you will receive $10,000. If the coin is tails, you will receive $1.

If you have a strict preference between options A and B, then your choice cannot be rationalized with time additive preferences.

To see why, let $\phi$ be a probability distribution that represents the lottery just described, putting mass 0.5 on 10,000 and mass 0.5 on 1. Under option A, consumption $(C_t)_{t \geq 1}$ is given by $C_t = C_1$ for all $t$ , where $C_1 \sim \phi$ . Under option B, consumption $(C_t)_{t \geq 1}$ is an IID sequence drawn from $\phi$ . Either way, lifetime utility is

\EE \sum_{t \geq 1} \beta^t u(C_t) = \sum_{t \geq 1} \beta^t \EE u(C_t) = \frac{\beta \bar u}{1-\beta},

where $\bar u \coloneq \EE u(C_1) = u(1)/2 + u(10,000)/2$ .

The critical part of this argument is the passing of expectations through the sum, which uses time additivity . The implication is that lifetime utility depends only on the marginal distribution of each $C_t$ , rather than on the joint distribution of the stochastic process $(C_t)_{t \geq 0}$ .

7.2.2Risk-Sensitive Preferences¶

Having motivated recursive preferences, let’s turn to our first example: risk-sensitive preferences. For the consumption problem described in Section 7.2.1.1, imposing risk-sensitive preferences means replacing the recursion $v = r + \beta Pv$ for $v$ with

v(x) = r(x) + \beta \frac{1}{\theta} \ln \left\{ \sum_{x'} \exp(\theta v(x')) P(x, x') \right\} \qquad (x \in \Xsf).

(7.8)

As before, $r(x) = u(c(x))$ represents current utility when the current state is $x$ . The parameter $\theta$ is a nonzero constant in $\RR$ .

In (7.8), the transform $f(v) = \exp(\theta v)$ is applied to $v$ before expectation is taken. After the expectation is computed, the transform is undone via $f^{-1}(v) = (1/\theta) \ln (v)$ . We will show that the agent can be either risk-averse or risk-loving with respect to future outcomes, depending on the value of $\theta$ .

7.2.2.1Lifetime Utility¶

We understand the functional equation (7.8) as “defining” lifetime utility under risk-sensitive preferences. A function $v$ solving (7.8) gives a lifetime valuation $v(x)$ to each $x \in \Xsf$ , with the interpretation that $v(x)$ is lifetime utility conditional on initial state $x$ . This definition of lifetime value is by analogy to the time additive case studied in Section 7.2.1.1, where the function $v$ solving $v = r + \beta P v$ measures lifetime utility from each initial state.

In the previous paragraph we wrote “defining” in scare quotes because we can’t be sure we have a definition at this point. Just because we write down a recursive expression for lifetime utility doesn’t mean that corresponding lifetime utility is actually well defined. (For example, we can happily write down the recursive vector equation $v = v + \1$ but no vector $v$ solving this equation exists.) One aim of this chapter is to provide conditions under which recursions like (7.8) have solutions.

Another issue is uniqueness. Suppose that (7.8) has many solutions. In this case the predictions of the utility model are ambiguous. Our perspective is that the recursive preference specification (7.8) is not correctly formulated unless existence and uniqueness hold. We return to this point in Section 7.2.2.3.

One final comment: even if we can find a $v$ that solves (7.8), the nonlinearities introduced by risk sensitivity imply that there will be no neat sequential representation analogous to $v(x) = \EE_x \sum_t \beta^t u(C_t)$ from the time additive case. (This connects to Remark 7.2.1, where we discuss recursive preference terminology.)

7.2.2.2Risk-Adjusted Expectation¶

We want to understand the “expectation-like” expression on the right hand side of (7.8) that replaces the ordinary conditional expectation $\sum_{x'} v(x') P(x, x')$ from the time additive case. To this end, we define, for arbitrary random variable $\xi$ and nonzero $\theta \in \RR$ ,

\eE_\theta [\xi] = \frac{1}{\theta} \ln \left\{ \EE [ \exp(\theta \xi) ] \right\}.

The value $\eE_\theta[\xi]$ is called the entropic risk-adjusted expectation of $\xi$ given $\theta$ .

The key idea behind the entropic risk-adjusted expectation is that decreasing $\theta$ lowers appetite for risk and increasing $\theta$ does the opposite.

Expression (7.9) shows that, for the Gaussian case, $\eE_\theta[ \xi]$ equals the mean plus a term that penalizes variance when $\theta < 0$ and rewards it when $\theta > 0$ .

More generally, we have the following result.

Proof

Fix $\theta \in \RR$ and let $f \colon \RR \to (0,\infty)$ be defined by $f(x) = \exp(\theta x)$ . Note that $f'(x) = \theta \exp(\theta x)$ and $f''(x) = \theta^2 \exp(\theta x)$ . Thus $f$ is convex and either increasing or decreasing depending on whether $\theta$ is positive or negative. Then $\eE_\theta[\xi] = f^{-1}( \EE f(\xi))$ . By Jensen’s inequality,

\EE [ f(\xi) ] \geq f(\EE [\xi] ).

If $\theta > 0$ , then $f^{-1}$ is increasing, so applying $f^{-1}$ to both sides gives $\eE_\theta[\xi] \geq \EE [ \xi]$ . If $\theta < 0$ , then $f^{-1}$ is decreasing, so applying $f^{-1}$ to both sides gives $\eE_\theta[\xi] \leq \EE [ \xi]$ . This proves the two weak inequalities in Lemma 7.2.1. To obtain strict inequalities we can apply the same argument using a strict version of Jensen’s inequality (see, e.g., Liao & Berg (2018)), which is valid when $\var[\xi] > 0$ . ◻

7.2.2.3Existence and Uniqueness¶

Let’s return to investigating lifetime utility under risk-sensitive preferences. To this end, we introduce the risk-sensitive Koopmans operator $K_\theta$ on $\RR^\Xsf$ via

(K_\theta \, v)(x) = r(x) + \beta \frac{1}{\theta} \ln \left\{ \sum_{x'} \exp(\theta v(x')) P(x, x') \right\} \qquad (x \in \Xsf).

(7.10)

Evidently, for given nonzero $\theta$ , a function $v \in \RR^\Xsf$ solves the risk-sensitive preference lifetime utility specification (7.8) if and only if $v$ is a fixed point of $K_\theta$ . This explains the significance of the following result:

We postpone a proof of Proposition 7.2.2 because we will prove a more general result in Section 7.3.2.2. For now we note the following implications.

(i) For each nonzero $\theta$ , lifetime utility is both well-defined and uniquely defined for risk-sensitive preferences (i.e., (7.8) has a unique solution).

(ii) The unique solution, denoted henceforth by $v^*$ , can be computed by successive approximation using $K_\theta$ .

7.2.2.4The Gaussian Case¶

As a tractable case, let’s suppose that $r(x) = x$ and that $X_{t+1} = \rho X_t + \sigma W_{t+1}$ where $(W_t)_{t \geq 1}$ is IID and standard normal. Here $|\rho| < 1$ and $\sigma \geq 0$ controls volatility of the state. Rather than discretizing the state process, we leave it as continuous and proceed by hand.

In this setting, the functional equation (7.8) for $v$ becomes

v(x) = x + \beta \eE_\theta[ v(\rho x + \sigma W)],

(7.11)

for each $x \in \Xsf$ , where $W$ is standard normal.

Since $\rho x + \sigma W$ is Gaussian, the expression (7.9) for the risk-adjusted expectation of a normal random variable leads us to conjecture that the solution $v$ will be affine, i.e., $v(x) = a x + b$ for some $a, b \in \RR$ . This conjecture turns out to be correct:

We can see that, under the stated assumptions, lifetime value $v$ is increasing in the state variable $x$ . However, impacts of the parameters generally depend on $\theta$ . For example, if $\theta > 0$ , increasing $\sigma$ shifts up lifetime utility. If $\theta < 0$ , then lifetime value decreases with $\sigma$ . This is as we expect: Lifetime utility is affected positively or negatively by volatility, depending on whether or not the agent is risk averse or risk loving.

Figure 7.3 shows the true solution $v(x) = ax + b$ to the risk-sensitive lifetime utility model, as well as an approximate fixed point from a discrete approximation. The discrete approximation is computed by applying successive approximation to $K_\theta$ after discretizing the state process via Tauchen’s method. The parameters and discretization are shown in Listing 1.

Figure 7.3:Approximate and true solutions in the Gaussian case

1
2
3
4
5
6
7
8
9
10
11
12
13
using LinearAlgebra, QuantEcon

function create_rs_utility_model(;
        n=180,      # size of state space
        β=0.95,     # time discount factor
        ρ=0.96,     # correlation coef in AR(1)
        σ=0.1,      # volatility
        θ=-1.0)     # risk aversion
    mc = tauchen(n, ρ, σ, 0, 10)  # n_std = 10
    x_vals, P = mc.state_values, mc.p 
    r = x_vals      # special case u(c(x)) = x
    return (; β, θ, ρ, σ, r, x_vals, P)
end

Program 1:Risk sensitive utility model parameters (rs_utility.jl)

Exercise 7.2.6

Dropping the Gaussian assumption, suppose now that consumption is IID with $C_t = c(X_t)$ where $(X_t)_{t \geq 0}$ is IID with distribution $\phi$ on finite set $\Xsf$ . Now the operator $K_\theta$ becomes

(K_\theta v)(x) = r(x) + \beta \frac{1}{\theta} \ln \left\{ \sum_{x'} \exp(\theta v(x')) \phi(x') \right\} \qquad (x \in \Xsf).

Although iterating on $K_\theta$ is convergent, there is a more efficient method that reduces to solving a one-dimensional equation. Propose such a method and confirm that it is convergent. (Hint: Consider reviewing Section 4.2.2.2.)

7.2.3Epstein–Zin Preferences¶

One of the most popular specifications of recursive preferences in quantitative research is Epstein–Zin utility.^[1] This class of preferences has been used to study asset pricing, business cycles, monetary policy, fiscal policy, optimal taxation, climate policy, pension plans, and other topics. In this section, we introduce the Epstein–Zin specification and discuss how to solve it. We will see that the specification, while highly nonlinear, is nonetheless well behaved.

7.2.3.1Specification¶

With Epstein–Zin preferences, the relationship $V_t = u(C_t) + \beta \EE_t V_{t+1}$ is replaced by

V_t = \left\{ (1-\beta) C_t^\alpha + \beta [\EE_t V_{t+1}^\gamma]^{\alpha / \gamma} \right\}^{1/\alpha},

(7.12)

where $\gamma$ , $\alpha$ are nonzero parameters and $\beta \in (0,1)$ . As for risk-sensitive preferences, lack of time additivity implies that there is no neat sequential representation for lifetime value. As a result, we must work directly with the recursive expression (7.12).

Assume as before that $C_t = c(X_t)$ , where $c \in \RR_+^\Xsf$ and $(X_t)_{t \geq 0}$ is $P$ -Markov on finite set $\Xsf$ . We conjecture a solution of the form $V_t = v(X_t)$ for some $v \in V \coloneq \RR_+^\Xsf$ . Under this conjecture, the Epstein–Zin Koopmans operator corresponding to (7.12) is

(Kv)(x) = \left\{ (1-\beta) c(x)^\alpha + \beta \left[ \sum_{x'} v(x')^\gamma P(x, x') \right]^{\alpha/\gamma} \right\}^{1/\alpha}.

(7.13)

As will be discussed further in Section 7.3.1.1, the parameter $\gamma$ governs risk aversion with respect to temporal gambles (where outcomes are resolved in the next period), while $\beta$ controls impatience and $\alpha$ parametrizes the intertemporal elasticity of substitution. The fact that all three parameters have distinct effects helps fit data. For example, see Tallarini Jr (2000) and Barillas et al. (2009).

An important question is whether Epstein–Zin preferences are well defined. In particular, what conditions do we need on primitives such that the Koopmans operator $K$ in (7.13) has a unique fixed point?

7.2.3.2Properties of the Koopmans Operator¶

To address this question we rewrite (7.13) in vector form as

Kv = \left\{ h + \beta [P v^\gamma]^{\alpha /\gamma} \right\}^{1/\alpha},

(7.14)

where $h \in \RR^\Xsf$ . This is equivalent to (7.13) when $h = (1-\beta) c^\alpha$ . To avoid fractional powers of negative numbers, we assume throughout that $h \geq 0$ .

The set $V$ is called the interior of the positive cone of $\RR^\Xsf$ .

The operator $K$ is difficult to work with for two reasons. First, linear and nonlinear transformations are intertwined. Second, there are several cases for the parameters that we need to handle in order to understand stability. Nonetheless, by applying a smooth transformation, we will find it easy to show that the Epstein–Zin Koopmans operator $K$ is globally stable under mild conditions. In particular,

A proof of Proposition 7.2.3 is provided in Section 7.2.3.3.

Proposition 7.2.3 implies that Epstein–Zin utility is well-defined under the stated conditions and, moreover, that the solution can be computed via successive approximation on $K$ . Listing 2 provides code for performing this operation. Figure 7.4 shows convergence of the sequence of iterates to the fixed point $v^*$ , under the parameters in Listing 2, given an initial condition $v_0$ . The figure plots every 10th iterate, repeated 100 times.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
include("s_approx.jl")
using LinearAlgebra, QuantEcon

function create_ez_utility_model(;
        n=200,      # size of state space
        ρ=0.96,     # correlation coef in AR(1)
        σ=0.1,      # volatility
        β=0.99,     # time discount factor
        α=0.75,     # EIS parameter
        γ=-2.0)     # risk aversion parameter

    mc = tauchen(n, ρ, σ, 0, 5) 
    x_vals, P = mc.state_values, mc.p 
    c = exp.(x_vals)      

    return (; β, ρ, σ, α, γ, c, x_vals, P)
end

function K(v, model)
    (; β, ρ, σ, α, γ, c, x_vals, P) = model

    R = (P * (v.^γ)).^(1/γ)
    return ((1 - β) * c.^α + β * R.^α).^(1/α)
end

function compute_ez_utility(model)
    v_init = ones(length(model.x_vals))
    v_star = successive_approx(v -> K(v, model), 
                               v_init, 
                               tolerance=1e-10)
    return v_star
end

Program 2:Epstein--Zin utility model and Koopmans operator (ez_utility.jl)

Figure 7.4:Convergence of Koopmans iterates for Epstein–Zin utility

7.2.3.3Proof of the Stability Result¶

We prove Proposition 7.2.3 by

(i) introducing an operator $\hat K$ obtained from $K$ via a smooth transformation,

(ii) proving that $(\hat V, \hat K)$ and $(V, K)$ are topologically conjugate, and

(iii) obtaining conditions under which $\hat K$ is globally stable on $V$ .

Throughout this section, the assumptions of Proposition 7.2.3 are in force.

To begin we define $\hat K$ via

\hat K v = \left\{ h + \beta (P v)^{1/\theta} \right\}^\theta \qquad \text{where} \quad \theta \coloneq \frac{\gamma}{\alpha}.

(7.15)

The operator $\hat K$ is simpler to work with than $K$ because it unifies $\alpha, \gamma$ into a single parameter $\theta$ and decomposes the Epstein–Zin update rule into two parts: a linear map $P$ and a separate nonlinear component.

7.2.3.4Why Not Use Contractivity?¶

While we can consider studying stability of $\hat K$ using contraction arguments, this approach fails under useful parameterizations. To illustrate, suppose that $\Xsf = \{x_1\}$ . Then $h$ is a constant, $P$ is the identity, $v$ is a scalar and $\hat K v = F(v)$ with $F(v) = \left\{ h + \beta v^{1/\theta} \right\}^\theta$ , as shown in Figure 7.5. Here $\theta = 5$ , $h=0.5$ and $\beta=0.5$ . We see that $\hat K$ has infinite slope at zero, so the contraction property fails.^[2]

Shape properties of \hat K in one dimension — Figure 7.5:Shape properties of $\hat K$ in one dimension

7.3General Representations¶

We have discussed two well-known examples of recursive preferences. In this section we build a general representation. While various constructions can be found in the decision theory literature, many are not well suited to quantitative work. Here we give a relatively parsimonious operator-theoretic definition.

7.3.1Koopmans Operators¶

In Section 7.2.2.3 and Section 7.2.3.1 we met risk-sensitive and Epstein–Zin Koopmans operators respectively. In this section, we provide a general definition of a Koopmans operator that will contain these two examples as special cases.

We begin by outlining structure that can be combined to generate Koopmans operators in a Markov environment. The two key components are an aggregation function and a certainty equivalent operator. We then build Koopmans operators from these primitives and connect them to applications. In every setting we consider, lifetime value is identified with the unique fixed point of the Koopmans operator (whenever it exists).

7.3.1.1Certainty Equivalents¶

The first primitive we consider is a generalization of conditional expectations: Given $V \subset \RR^\Xsf$ , we define a certainty equivalent operator on $V$ to be a self-map $R$ on $V$ such that

(i) $R$ is order preserving on $V$ and

(ii) all constants are fixed under $R$ (i.e., $R\, ( \lambda \1) = \lambda \1$ for all $\lambda \in \RR$ with $\lambda \1 \in V$ ).

The next example is nonlinear. It treats the risk-adjusted expectation that appears in risk-sensitive preferences.

Exercise 7.3.4

Let $V = \RR^\Xsf$ and fix $P \in \mopx$ and $\tau \in [0,1]$ . Let $R_\tau$ be the quantile certainty equivalent. That is, $(R_\tau \, v)(x) = Q_\tau \,v(X)$ where $X \sim P(x, \cdot)$ and $Q_\tau$ is the quantile functional. More specifically,

(R_\tau \, v)(x) = \min \left\{ y \in \RR \;\Big|\; \sum_{x'} \1\{v(x') \leq y\} P(x,x') \geq \tau \right\} \qquad (v \in V, \; x \in \Xsf).

Confirm that $R_\tau$ defines a certainty equivalent operator on $V$ .

Solution to Exercise 7.3.4

Since $V$ is all of $\RR^\Xsf$ , the condition $R_\tau \colon V \to V$ is trivially satisfied. Regarding monotonicity, fix $v, w \in V$ with $v \leq w$ and $x \in \Xsf$ . Let $X$ be a draw from $P(x, \cdot)$ . Then $(R_\tau \, v)(x) = Q_\tau[v(X)] \leq Q_\tau[w(X)] = (R_\tau \, w)(x)$ , where the inequality is by Exercise 2.2.35. Moreover, given $\lambda \in \RR$ and a random variable $Y$ with $\PP\{Y = \lambda\} = 1$ , we clearly have $Q_\tau(Y) = \lambda$ . It follows that $R_\tau \, \lambda \1 = \lambda \1$ . Hence $R_\tau$ is a certainty equivalent operator, as was to be shown.

The set of certainty equivalent operators on $\RR^\Xsf$ is invariant under convex combinations, as the next exercise asks you to confirm.

7.3.1.2Properties¶

Let $V$ be a convex cone in $\RR^\Xsf$ . A certainty equivalent operator $R$ on $V$ is called

positive homogeneous on $V$ if $R(\lambda v) = \lambda Rv$ for all $v \in V$ and $\lambda \geq 0$ with $\lambda v \in V$ ,
superadditive on $V$ if $R(v + w) \geq Rv + Rw$ for all $v, w \in V$ with $v + w \in V$ ,
subadditive on $V$ if $R(v + w) \leq Rv + Rw$ for all $v, w \in V$ with $v + w \in V$ ,
constant-subadditive on $V$ if $R (v + \lambda \1) \leq R v + \lambda \1$ for all $v \in V$ and $\lambda \geq 0$ with $v + \lambda \1 \in V$ .

Solution to Exercise 7.3.8

Fix $v \in V$ , $P \in \mopx$ and $\lambda \in \RR_+$ . Let $X$ be a draw from $P(x, \cdot)$ . We have

\begin{aligned} (R_\theta (v + \lambda))(x) & = \frac{1}{\theta} \ln \left\{ \EE \exp[\theta (v(X) + \lambda)] \right\} \\ & = \frac{1}{\theta} \ln \left\{ \EE \exp[\theta v(X)] \cdot \exp(\theta \lambda) \right\} \\ & = \frac{1}{\theta} \ln \left\{ \EE \exp[\theta v(X)] \right\} + \lambda. \end{aligned}

Hence constant-subadditivity holds.

In some instances, a certainty equivalent operator is either convex or concave in the sense of Section 7.1.2.2.

Combining Exercise 7.3.11 and Example 7.3.5, we have proved

Later we will combine Lemma 7.3.1 with the fixed-point results for convex and concave operators in Section 7.1.2.2 to establish existence and uniqueness of lifetime values for certain kinds of Koopmans operators.

7.3.1.3Monotonicity¶

Let $\Xsf$ be partially ordered and let $i\RR^\Xsf$ be the set of increasing functions in $\RR^\Xsf$ . Let $V$ be such that $i\RR^\Xsf \subset V \subset \RR^\Xsf$ and let $R$ be a certainty equivalent on $V$ . We call $R$ monotone increasing if $R$ is invariant on $i\RR^\Xsf$ . This extends the terminology in Section 3.2.1.3, where we applied it to Markov operators (cf., Exercise 3.2.4). The concept of monotone increasing certainty equivalent operators is connected to outcomes where lifetime preferences are increasing in the state.

7.3.1.4Aggregation¶

We mentioned that Koopmans operators are typically constructed by combining a certainty equivalent operator and an aggregation function. Let’s now discuss the second of these components.

Given $V \subset \RR^\Xsf$ , an aggregator $A$ on $V$ is a map $A$ from $\Xsf \times \RR$ to $\RR$ such that

(i) $w(x) = A(x, v(x))$ is in $V$ whenever $v \in V$ and

(ii) $y \mapsto A(x, y)$ is increasing for all $x \in \Xsf$ .

Intuitively, an aggregator combines current state and continuation values to measure lifetime value.

Common types of aggregators include the

Leontief aggregator $A_{\textsc{MIN}}(x,y) = \min\{ r(x), \beta y \}$ with $r \in \RR^\Xsf$ and $\beta \geq 0$ ,
Uzawa aggregator $A_{\textsc{UZAWA}}(x,y) = r(x) + b(x) y$ with $r \in \RR^\Xsf$ and $b \in \RR^\Xsf_+$ , and
CES aggregator $A_{\textsc{CES}}(x, y) = \{r(x)^\alpha + \beta y^\alpha\}^{1/\alpha}$ with $r \in (0,\infty)^\Xsf$ , $\beta \geq 0$ and $\alpha \neq 0$ .

Here CES stands for “constant elasticity of substitution.” An important special case of both the CES and Uzawa aggregators is the

additive aggregator $A_{\textsc{ADD}}(x,y) = r(x) + \beta y$ with $r \in \RR^\Xsf$ and $\beta \geq 0$ .

From these basic types we can also build composite aggregators. For example, we might consider a CES-Uzawa aggregator of the form $A(x, y) = \{r(x)^\alpha + b(x) y^\alpha\}^{1/\alpha}$ with $r, b \in \RR^\Xsf$ , $b \geq 0$ and $\alpha \neq 0$ . As we will see in Section 7.3.3.3, the CES-Uzawa aggregator can be used to construct models with both Epstein–Zin utility and state-dependent discounting (as in, say, Albuquerque et al. (2016) or Schorfheide et al. (2018).)

7.3.1.5Building Koopmans Operators¶

We are now ready to build Koopmans operators by combining certainty equivalents and aggregators. Given $V \subset \RR^\Xsf$ , we call a self-map $K$ on $V$ a Koopmans operator if

K = A \circ R,

(7.18)

for some aggregator $A$ and certainty equivalent operator $R$ on $V$ . The expression in (7.18) means that $(Kv)(x) = A(x, (Rv)(x))$ at $v \in V$ and $x \in \Xsf$ .

It is generally appropriate to suppose that a uniform increase in continuation values will increase current value. This property holds for $K$ in (7.18). In particular, it follows from the definitions of $A$ and $R$ that $K$ is an order preserving self-map on $V$ .

7.3.1.6Comments on CES Aggregation¶

The CES aggregator is so-named because, in a static utility maximization problem where $c$ and $y$ are two goods and utility is $U(c,y) = ((1-\beta) c^\alpha + \beta y^\alpha)^{1/\alpha}$ , the elasticity of substitution is constant and given by $1/(1-\alpha)$ . In the present setting, where aggregation is across time, $1/(1-\alpha)$ is usually called the elasticity of intertemporal substitution (EIS). The next exercise explains.

The fact that EIS $= 1/(1-\alpha)$ under the CES aggregator is significant because the EIS can be measured from data using regression and other techniques. While estimates vary significantly, the detailed meta-analysis by Havranek et al. (2015) suggests 0.5 as a plausible average value for international studies, with rich countries tending slightly higher. Basu & Bundick (2017) use 0.8 when calibrating to US data. Under these estimates, the relationship EIS $= 1/(1-\alpha)$ implies a value for $\alpha$ between -1.0 and -0.25.

7.3.1.7Lifetime Value¶

In Section 7.3.1.5 we constructed a generic Koopmans operator using an aggregator and a certainty equivalent operator. In this section, we connect this Koopmans operator to lifetime values and discuss the significance of global stability.

To begin, fix set $\Xsf$ and function class $V \subset \RR^\Xsf$ . Let $K = A \circ R$ be a Koopmans operator for some aggregator $A$ and certainty equivalent operator $R$ on $V$ . The lifetime value generated by $K$ is the unique fixed point of $K$ in $V$ , whenever it exists. Given such a $v$ , the value $v(x)$ is interpreted as lifetime value conditional on initial state $x$ .

In many applications, our existence and uniqueness proofs for fixed points of $K$ will also establish global stability. For Koopmans operators, global stability has the following interpretation: for $w \in V$ , $m \in \NN$ and $x \in \Xsf$ , the value $(K^m w)(x)$ gives total finite-horizon utility over periods $0, \ldots, m$ under the preferences embedded in $K$ , with initial state $x$ and terminal condition $w$ . Hence global stability implies that, for any choice of terminal condition, finite-horizon utility converges to infinite-horizon utility as the time horizon converges to infinity. The next exercise helps to illustrate this point.

Solution to Exercise 7.3.15

Iterating forward from $V_0$ gives

V_0 = u(C_0) + \beta \EE_0 \, V_1 = u(C_0) + \beta \EE_0 \left[ u(C_1) + \beta \EE_1 \, V_2 \right] = u(C_0) + \beta \EE_0 \, u(C_1) + \beta^2 \EE_0 \, V_2.

Continuing forward until time $m$ yields $V_0 = \sum_{t=0}^{m-1} \beta^t \EE_0 \, u(C_t) + \beta^m \EE_0 \, V_m$ . Shifting to functional form and using $r = u \circ c$ , the last expression becomes

v = \sum_{t=0}^{m-1} (\beta P)^t r + (\beta P)^m w.

By Exercise 1.2.17, this is just $K^m w$ when $K$ is the associated Koopmans operator $Kv = r + \beta P v$ and, moreover, $K^m w \to v^* \coloneq (I - \beta P)^{-1} r$ .

Exercise 7.3.15 confirms that, at least for the time additive case, global stability of $K$ is equivalent to the statement that a finite-horizon valuation with arbitrary terminal condition $w$ converges to the infinite-horizon valuation.

7.3.1.8Monotone Lifetime Values¶

Let $\Xsf = (\Xsf, \preceq)$ be partially ordered, let $i\RR^\Xsf$ be the set of increasing functions in $\RR^\Xsf$ , and let $V$ be such that $i\RR^\Xsf \subset V \subset \RR^\Xsf$ . Let $K$ be a Koopmans operator on $V$ , so that $Kv = A \circ R$ for some aggregator $A$ and certainty equivalent operator $R$ on $V$ . Suppose that $K$ has a unique fixed point $v^* \in V$ . A natural question is: when is $v^*$ increasing in the state?

7.3.2A Blackwell-Type Condition¶

Let $R$ be a certainty equivalent operator on $V = \RR^\Xsf$ and let $A$ be an aggregator on $V$ . Let $K$ be the Koopmans operator on $V$ defined by $(Kv)(x) = A(x, (Rv)(x))$ . When $R$ is constant-subadditive, we can often establish global stability of $K$ on $V$ via a contraction mapping argument. This section gives details.

7.3.2.1Blackwell Aggregators¶

We call an aggregator $A$ on $V$ a Blackwell aggregator if there exists a $\beta \in (0,1)$ such that

A(x, y + \lambda) \leq A(x, y) + \beta \lambda,

(7.19)

for all $x \in \Xsf$ , $y \in \RR$ and $\lambda \in \RR_+$ .

The next proposition states conditions for global stability in settings where aggregators have the Blackwell property.

Proof

Let the primitives be as stated. In view of Lemma 2.2.4, and taking into account the fact that $K$ is order preserving, we need only show that there exists a $\beta \in (0,1)$ with $K(v + \lambda) \leq Kv + \beta \lambda$ for all $v \in V$ and $\lambda \in \RR_+$ . To see this, fix $v \in V$ and $\lambda \in \RR_+$ . Applying constant-subadditivity of $R$ and monotonicity of $A$ , we have

K(v + \lambda) = A(\cdot, R(v + \lambda)) \leq A(\cdot, Rv + \lambda).

Since $A$ is a Blackwell aggregator, the last term is bounded by $A(\cdot, Rv) + \beta \lambda$ with $\beta < 1$ . Hence $K(v+\lambda) \leq Kv + \beta \lambda$ , and $K$ is a contraction of modulus $\beta$ on $V$ . ◻

The stability of time additive preferences is a special case of Proposition 7.3.3.

7.3.2.2The Risk-Sensitive Case¶

We can now complete the proof of Proposition 7.2.2, which concerned global stability of the Koopmans operator generated by risk-sensitive preferences.

7.3.2.3Quantile Preferences¶

Consider a setting where $V = \RR^\Xsf$ and $K_\tau \coloneq A_{\textsc{ADD}} \circ R_\tau$ . That is,

(K_\tau v)(x) = r(x) + \beta (R_\tau v)(x) \qquad (x \in \Xsf),

(7.20)

for $\beta \in (0,1)$ , $\tau \in [0,1]$ , $r \in \RR^\Xsf$ and $R_\tau$ as described in Exercise 7.3.4. Since $R_\tau$ is constant-subadditive (Exercise 7.3.7) and the additive aggregator is Blackwell, $K_\tau$ is globally stable (Proposition 7.3.3). The operator $K_\tau$ represents quantile preferences, as described in Castro & Galvao (2019) and other studies (see Section 7.4). The value $\tau$ parameterizes attitude to risk, a point we return to in Section 8.2.1.4.

7.3.3Uzawa Aggregation¶

Let’s consider the Koopmans operator $K = A_{\textsc{UZAWA}} \circ R$ , where $V$ is some subset of $\RR^\Xsf$ and $R$ is a certainty equivalent operator on $V$ . In particular,

(Kv)(x) = r(x) + b(x) (Rv)(x) \qquad (x \in \Xsf, \; v \in V),

(7.21)

with $r, b \in \RR^\Xsf$ and $b \geq 0$ . We are interested in conditions that imply $K$ is globally stable on $V$ .

7.3.3.1The Case of Conditional Expectation¶

Let $V = \RR^\Xsf$ and suppose $R = P$ for some $P \in \mopx$ , so that $R$ is ordinary conditional expectations. Then $K$ becomes $Kv = r + Lv$ where $L \in \lopx$ with $L(x,x') = b(x)P(x,x')$ . By Exercise 1.2.17, $K$ is globally stable on $V$ whenever $\rho(L) < 1$ .

This kind of structure arises when households derive utility from a consumption path while their discount factor fluctuates according to some state variable (see, e.g., Krusell & Smith (1998), Toda (2019), Cao (2020), and Hubmer et al. (2020)). For a given consumption path $(C_t)$ , lifetime values takes the form

v(x) = \EE_x \, \sum_{t=0}^\infty \left( \prod_{i=1}^t \beta_i \right) u(C_t),

(7.22)

where $u$ is a flow utility function and $\{\beta_t\}$ is a discount factor process. Suppose $C_t = c(X_t)$ and $\beta_t = b(X_t)$ where $b \geq 0$ and $(X_t)$ is $P$ -Markov for some $P \in \mopx$ . Set $r \coloneq u \circ c$ and $L(x,x') \coloneq b(x) P(x,x')$ . By Theorem 6.1.1, the condition $\rho(L) < 1$ implies that $v$ in (7.22) is the unique fixed point of $Kv = r + L v = r + b Pv$ . In other words, lifetime value under (7.22) is the unique fixed point of the Koopmans operator when the aggregator is of Uzawa type and the certainty equivalent is conditional expectation.

How does this relate to optimization? Recall our discussion of state-dependent MDPs in Chapter 6. There, the policy operator $T_\sigma$ in (6.16) is a special case of (7.21) when the discount factor depends only on the current state and action.

With some additional requirements, the condition $\rho(L)<1$ is necessary as well as sufficient for existence of a unique fixed point for $Kv = r + Lv$ . Indeed, if $b \gg 0$ and $P$ is irreducible, then $L$ is also irreducible and a positive linear operator. Applying Lemma 6.1.4, we see that $r \gg 0$ and $\rho(L) \geq 1$ implies $Kv = r + Lv$ has no fixed point in $V \coloneq \setntn{v \in \RR^\Xsf}{v \gg 0}$ .

7.3.3.2Stability via Concavity¶

Now consider $Kv = r + b Rv$ from (7.21) when $R$ is not in $\mopx$ . Here $b Rv$ is the pointwise product, so that $(bRv)(x) = b(x) (Rv)(x)$ for all $x$ .

We cannot use Proposition 7.3.3 to prove stability of $K$ unless $b(x) < 1$ for all $x \in \Xsf$ . Since this condition is rather strict, we now study weaker conditions that can be valid even when $b$ exceeds 1 in some states. Specifically, we consider

(a) $b Rv \leq c + Lv$ for some $c \in \RR^\Xsf$ and $L \in \lopx$ with $\rho(L) < 1$ .

(b) $r \gg 0$ and $R$ is concave on $\RR^\Xsf_+$ .

Let $V = [0, \bar v]$ where $\bar v \coloneq (I-L)^{-1}(r + c)$ .

7.3.3.3Epstein–Zin Preferences with State-Dependent Discounting¶

Combining the CES-Uzawa aggregator $A(x, y) = \{r(x)^\alpha + b(x) y^\alpha\}^{1/\alpha}$ with the Kreps–Porteus certainty equivalent operator leads to the Koopmans operator

Kv = \left\{ h + b \left[ Pv^\gamma \right]^{\alpha/\gamma} \right\}^{1/\alpha}, \quad \text{with} \quad h, b \in \RR^\Xsf_+.

(7.23)

A fixed point of $K$ corresponds to lifetime value for an agent with Epstein–Zin preferences and state-dependent discounting. (Such set ups are used in research on macroeconomic dynamics and asset pricing – see Section 7.4 for more details).

In what follows we take $V = (0, \infty)^\Xsf$ and assume that $h, b \in V$ and $P$ is irreducible.

To discuss stability of $K$ we introduce the operator $B \in \lopx$ defined by

(Bv)(x) \coloneq b(x)^\theta \sum_{x'} v(x') P(x, x') \quad \text{where} \;\; \theta \coloneq \frac{\gamma}{\alpha}.

To prove Proposition 7.3.5, we proceed as in Section 7.2.3.3, constructing a conjugate operator $\hat K$ and proving stability of the latter. For this purpose, we introduce

\hat K v = \left\{ h + (B v)^{1/\theta} \right\}^\theta \qquad (v \in V),

(7.24)

Also, let $\Phi$ be defined by $\Phi v = v^\gamma$ .

7.4Chapter Notes¶

The time additive preference structure in Section 7.2.1 was popularized by Samuelson (1939), who built on earlier work by Fisher (1930) and Ramsey (1928). An axiomatic foundation was supplied by Koopmans (1960). Bastianello & Faro (2023) study the foundations of discounted expected utility (DEU) from a purely subjective framework.

Problems with the time additive DEU model include non-constant discounting, as discussed in Section 6.4, as well as sign effects (gains being discounted more than losses) and magnitude effects (small outcomes being discounted more than large ones). See, for example, Thaler (1981) and Benzion et al. (1989). A critical review of the time additive model and a list of many references can be found in Frederick et al. (2002).

In the stochastic setting, the time additive framework is a subset of the expected utility model (Von Neumann & Morgenstern (1944), Friedman (1956), Savage (1951)). There are many well documented departures from expected utility in experimental data. See the start of Andreoni & Sprenger (2012) and the article Ericson & Laibson (2019) for an introduction to the literature. An interesting historical discussion of time additive expected utility can be found in Becker et al. (1989).

(It is ironic that those most responsible for popularizing the time additive DEU framework have also been among the most critical. For example, Samuelson (1939) stated that it is “completely arbitrary” to assume that the DEU specification holds. He goes on to claim that, in the analysis of savings and consumption, it is “extremely doubtful whether we can learn much from considering such an economic man.” In addition, Stokey & Lucas (1989), whose work helped to standardize DEU as a methodology for quantitative analysis, argued in a separate study that DEU is attractive only because of its relative simplicity Lucas & Stokey, 1984.)

Do the departures from time additive expected utility found in experimental data actually matter for quantitative work? Evidence suggests that the answer is affirmative. In macroeconomics and asset pricing in particular, researchers increasingly use non-additive preferences in order to bring model outputs closer to the data. For example, many quantitative models of asset pricing rely heavily on Epstein–Zin preferences. Representative examples include Epstein & Zin (1991), Tallarini Jr (2000), Bansal & Yaron (2004), Hansen et al. (2008), Bansal et al. (2012), Schorfheide et al. (2018), and Groot et al. (2022). Alternative numerical solution methods are discussed in Pohl et al. (2018).

An excellent introduction to recursive preference models can be found in Backus et al. (2004). Our use of the term “Koopmans operator,” which is not entirely standard, honors early contributions by Nobel laureate Tjalling Koopmans on recursive preferences (see Koopmans (1960) and Koopmans et al. (1964)).

Theoretical properties of recursive preference models have been studied in many papers, including Epstein & Zin (1989), Weil (1990), Boyd (1990), Hansen & Scheinkman (2009), Marinacci & Montrucchio (2010), Bommier et al. (2017), Bloise & Vailakis (2018), Marinacci & Montrucchio (2019), Pohl et al. (2019), Balbus (2020), Borovička & Stachurski (2020), DeJarnette et al. (2020), Christensen (2022), and Becker & Rincon-Zapatero (2023). The paper by Marinacci & Montrucchio (2019) provides a useful alternative approach to existence of unique fixed points in the setting of order preserving maps. Experimental results on Epstein–Zin preferences can be found in Meissner & Pfeiffer (2022).

There is a strong connection between risk-sensitive preferences and the literature on robust control. See, for example, Cagetti et al. (2002), Hansen & Sargent (2007), and Barillas et al. (2009). We return to this point in Chapter 8.

The quantile preferences we considered in Section 7.3.2.3 have been analyzed in static and dynamic settings by Giovannetti (2013), Castro & Galvao (2019), Castro & Galvao (2022) and Castro et al. (2022). Recursive components of the analysis of quantile and Uzawa preference models build on the study of monotone preferences in Bommier et al. (2017).

Some recursive preference specifications involve ambiguity aversion. An introduction to this literature and its applications can be found in Klibanoff et al. (2009), Hayashi & Miao (2011), Hansen & Miao (2018), Bommier et al. (2019) and Hansen & Sargent (2020). Marinacci et al. (2023) discuss the connection between recursivity and attitudes to uncertainty. We discuss ambiguity again in Chapter 8.

Recursive preferences are increasingly applied outside the field of asset pricing, where they first came to prominence. See, for example, Bommier & Villeneuve (2012), Colacito et al. (2018), Jensen (2019), or Augeraud-Véron et al. (2019).

The coin flip application in Section 7.2.1.2 is related to correlation aversion, as discussed in Stanca (2023), and preference for “consumption spreads” as reviewed in Frederick et al. (2002).

Some applications of Theorem 7.1.3 to network analysis can be found in Sargent & Stachurski (2023).

Footnotes¶

Epstein–Zin preferences were popularized in Epstein & Zin (1989). They are a special case of preferences defined by Kreps & Porteus (1978). Further discussion can be found in Section 7.4.
↩
We could try to truncate the interval to a neighborhood of the fixed point and hope that $\hat K$ is a contraction when restricted to this interval. But in higher dimensions we are not sure that a fixed point exists for a broad range of parameters, which makes this idea hard to implement.
↩

References¶

Barbu, V., & Precupanu, T. (2012). Convexity and Optimization in Banach Spaces. Springer Science & Business Media.
Fajgelbaum, P. D., Schaal, E., & Taschereau-Dumouchel, M. (2017). Uncertainty traps. The Quarterly Journal of Economics, 132(4), 1641–1692.
Du, Y. (1990). Fixed points of increasing operators in ordered Banach spaces and applications. Applicable Analysis, 38(01–02), 1–20.
Zhang, Z. (2012). Variational, Topological, and Partial Order Methods with Their Applications (Vol. 29). Springer.
Stachurski, J., Wilms, O., & Zhang, J. (2022). Unique solutions to power-transformed affine systems. arXiv, 2212.00275.
Kleinman, B., Liu, E., & Redding, S. J. (2023). Dynamic spatial general equilibrium. Econometrica, 91(2), 385–424.
Liao, J., & Berg, A. (2018). Sharpening Jensen’s inequality. The American Statistician.
Tallarini Jr, T. D. (2000). Risk-sensitive real business cycles. Journal of Monetary Economics, 45(3), 507–532.
Barillas, F., Hansen, L. P., & Sargent, T. (2009). Doubts or variability? Journal of Economic Theory, 144(6), 2388–2418.
Kreps, D. M., & Porteus, E. L. (1978). Temporal resolution of uncertainty and dynamic choice theory. Econometrica, 46(1), 185–200.
Bullen, P. S. (2003). Handbook of Means and Their Inequalities (Vol. 560). Springer Science & Business Media.
Föllmer, H., & Knispel, T. (2011). Entropic risk measures: Coherence vs. convexity, model ambiguity and robust large deviations. Stochastics and Dynamics, 11(02n03), 333–351.
Albuquerque, R., Eichenbaum, M., Luo, V. X., & Rebelo, S. (2016). Valuation risk and asset pricing. The Journal of Finance, 71(6), 2861–2904.
Schorfheide, F., Song, D., & Yaron, A. (2018). Identifying long-run risks: A Bayesian mixed-frequency approach. Econometrica, 86(2), 617–654.
Havranek, T., Horvath, R., Irsova, Z., & Rusnak, M. (2015). Cross-country heterogeneity in intertemporal substitution. Journal of International Economics, 96(1), 100–118.

7 Nonlinear Valuation

7.1Beyond Contraction Maps¶

7.1.1Knaster–Tarski for Function Space¶

7.1.2Concavity, Convexity, and Stability¶

7.1.2.1The One-Dimensional Case¶

7.1.2.2The Multidimensional Case¶

7.1.3A Power-Transformed Affine Equation¶

7.2Recursive Preferences¶

7.2.1Motivation: Optimal Savings¶

7.2.1.1A Recursive View of a Standard Model¶

7.2.1.2Limitations of Time Additive Preferences¶

7.2.2Risk-Sensitive Preferences¶

7.2.2.1Lifetime Utility¶

7.2.2.2Risk-Adjusted Expectation¶

7.2.2.3Existence and Uniqueness¶

7.2.2.4The Gaussian Case¶

7.2.3Epstein–Zin Preferences¶

7.2.3.1Specification¶

7.2.3.2Properties of the Koopmans Operator¶

7.2.3.3Proof of the Stability Result¶

7.2.3.4Why Not Use Contractivity?¶

7.3General Representations¶

7.3.1Koopmans Operators¶

7.3.1.1Certainty Equivalents¶

7.3.1.2Properties¶

7.3.1.3Monotonicity¶

7.3.1.4Aggregation¶

7.3.1.5Building Koopmans Operators¶

7.3.1.6Comments on CES Aggregation¶

7.3.1.7Lifetime Value¶

7.3.1.8Monotone Lifetime Values¶

7.3.2A Blackwell-Type Condition¶

7.3.2.1Blackwell Aggregators¶

7.3.2.2The Risk-Sensitive Case¶

7.3.2.3Quantile Preferences¶

7.3.3Uzawa Aggregation¶

7.3.3.1The Case of Conditional Expectation¶

7.3.3.2Stability via Concavity¶

7.3.3.3Epstein–Zin Preferences with State-Dependent Discounting¶

7.4Chapter Notes¶