In this chapter we describe how to extend the MDP model to handle time-varying discount factors, a specification now widely used in macroeconomics and finance.
We introduce formulas for infinite-horizon lifetime valuations under stochastic discounting and provide necessary and sufficient conditions for existence of finite solutions.
In Section 3.2.2.2 we discussed firm valuation in a setting where the interest rate is constant. But data show that interest rates are time-varying, even for safe assets such as US Treasury bills. Figure Figure 6.1 shows nominal interest rate on one year Treasury bills since the 1950s, whereas Figure Figure 6.2 shows an estimate of the real interest rate for 10 year T-bills since 2012. Both nominal and real interest rates are evidently time varying.
Figure 6.1:Nominal US interest rates (plot_interest_rates_nominal.jl)
Figure 6.2:Real US interest rates (plot_interest_rates_real.jl)
The sequence (βt)t⩾0 is called a discount factor process and ∏i=0tβi is the discount factor for period t payoffs evaluated at time zero. We are interested in expected discounted sums of the form
In (6.11) we passed expectations through an infinite sum. This operation is valid under the assumption ρ(L)<1. A complete proof can be found in Section B.2.
First we develop an alternative representation of the spectral radius based on expectations. The next result is proved via a local spectral radius argument. In the statement, βt is as defined in (6.3) and L is the operator in (6.5).
The expression in (6.13) connects the spectral radius with the long-run properties of the discount factor process. The connection becomes even simpler when P is irreducible, as the next exercise asks you to show.
Exercise 6.3 shows that the spectral radius is a long-run (geometric) average of the discount factor process. For the conclusions of Theorem 6.1.1 to hold, we need this long-run average to be less than unity.
Figure Figure 6.3 illustrates the condition ρ(L)<1 when βt=Xt and P is a Markov matrix produced by discretization of the AR1 process
The discussion in Section 3.1.3 tells us that the stationary distribution ψ∗ of (6.17) is normally distributed with mean μ and standard deviation s. The parameter a controls autocorrelation. In the figure we set μ to 0.96, which, since βt=Xt, is the stationary mean of the discount factor process. The parameters a and s are varied in the figure, and the contour plot shows the corresponding value of ρ(L). The process (6.17) is discretized via the Tauchen method with the size of the state space set to 6 (which avoids negative values for β(x)).
Figure 6.3:ρ(L) for different values of (a,s) (discount_spec_rad.jl)
The figure shows that ρ(L) tends to increase with both the volatility and the autocorrelation of the state process. This seems natural given the expression on the right hand side of (6.15), since sequences of large values of βi compound in the product ∏i=0tβi, pushing up the long-run average value, and such sequences occur more often when autocorrelation and volatility are large.
We finish this section with a lemma that simplifies computation of the spectral radius in settings where the process (βt) depends only on a subset of the state variables -- a setting that is common in applications. In the statement of the lemma, the state space X takes the form X=Y×Z. We fix Q∈M(RZ) and R∈M(RY). The discount operator L is
Let (Zt) and (Yt) be Q-Markov and R-Markov respectively, so that, with P as the pointwise product of Q and R, the process (Xt):=((Zt,Yt)) is P-Markov. We set LZ(z,z′):=b(z,z′)Q(z,z′).
In Section 6.1.1 we studied settings where lifetime value is a function v on a state space X that satisfies an equation of the form v=h+Lv. The unknown is v∈RX where X is a finite set, h∈RX is given and L is a linear operator from RX to itself. We discussed the fact that ρ(L)<1 is sufficient for v=h+Lv to have a unique solution.
In some settings the condition ρ(L)<1 is also necessary. For example, let
State-dependent discounting breaks the contractivity properties that we exploited in Chapter 5, when we studied optimality of MDPs (see, e.g., the proof of Proposition 5.1.1). Here we introduce a generalization of Banach’s fixed-point theorem that can deliver global stability under weaker conditions. For the remainder of this section, X is any finite set.
Example 6.1.2 illustrates the connection between Theorem 6.1.5 and the Neumann series lemma. Theorem 6.1.5 is more general because it can be applied in nonlinear settings. But the Neumann series lemma remains important because, when applicable, it provides inverse and power series representations of the fixed point.
On one hand, if T is a contraction map on U⊂RX with respect to a given norm ∥⋅∥a, we cannot necessarily say that T is a contraction with respect to some other norm ∥⋅∥b on RX. On the other hand, if T is an eventual contraction on U with respect to some given norm on RX, then T is eventually contracting with respect to every norm on RX. The next exercise asks you to verify this.
In Section 2.2.3.3 we studied a sufficient condition for order-preserving self maps to be contractions. The next proposition provides an analogous result for eventual contractions. In the statement of the proposition, U is a subset of RX such that v,c∈U and c⩾0 implies v+c∈U.
We can now turn to dynamic programs in which the objective is to maximize a lifetime value in the presence of state-dependent discounting. First, we present an extension of the MDP model from Chapter 5 that admits state-dependent discounting. Then we provide weak conditions under which optimal policies exist and Bellman’s principle of optimality holds.
We are ready to extend the MDP model to include state-dependent discounting. We construct a framework and then provide weak conditions for optimality based on spectral radius methods.
To provide a framework for dynamic programs with state-dependent discounting, we begin with an MDP (Γ,β,r,P) with state space X, action space A and feasible state action pairs G. We then replace the constant discount factor β with a function β from G×X to R+. We call the resulting model an MDP with state-dependent discounting. The Bellman equation takes the form
where x∈X and v∈RX. Notice that the discount factor depends on all relevant information: The current action, the current state and the stochastically determined next period state.
For MDPs with state-dependent discounting, we can obtain standard optimality results by assuming a that there exists a b<1 such that β(x,a,x′)⩽b for all (x,a,x′)∈G×X. In this setting it is easy to show that lifetime values are finite, and to extend the optimality results for regular MDPs found in Proposition 5.1.1.
Unfortunately, the assumption discussed in the previous paragraph is too strict for many applications. (We return to this point in Section 6.2.1.6.) We will state an optimality result under weaker conditions.
Notice that we can now write (6.29) as Tσv=rσ+Lσv. In line with our discussion of MDPs in Chapter 5, when Tσ has a unique fixed point we denote it by vσ and interpret it as lifetime value.
As discussed, the value vσ(x) has the interpretation of lifetime value of policy σ conditional on initial state x. We can reinforce this interpretation by connecting Lemma 6.2.1 to Theorem 6.1.1. The next exercise asks you to work through all the steps.
Given v∈RX, a policy σ is called v-greedy if σ(x) is a maximizer of the right-hand side of (6.35) for all x in X. Equivalently, σ is v-greedy whenever Tσv=Tv.
When Assumption 6.2.1 holds and, as a result, Tσ has a unique fixed point vσ for each σ∈Σ, we let v∗ denote the value function, which is defined as v∗:=∨σ∈Σvσ. As for the regular MDP case, a policy σ is called optimal if vσ=v∗.
We can now state our main optimality result for MDPs with state-dependent discounting.
Algorithms for solving an MDP with state-dependent discounting include value function iteration (VFI), Howard policy iteration (HPI), and optimistic policy iteration (OPI). The algorithms for VFI and OPI are identical to those given for regular MDPs (see Section 5.1.4), provided that the correct operators T and Tσ are used, and that the definition of a v-greedy policy is as given in Section 6.2.1.1. The algorithm for HPI is almost identical, with the only change being that computation of lifetime values involves Lσ. Details are given in Algorithm 6.1.
We prove in Chapter 8 that, under the conditions of Assumption 6.2.1, VFI, OPI and HPI are all convergent, and that HPI converges to an exact optimal policy in a finite number of steps.
Some applications use an exogenous state component to drive a discount factor process. In this section we set up such a model and obtain optimality conditions by applying Proposition 6.2.2.
The first step is to decompose the state Xt into a pair (Yt,Zt), where (Yt)t⩾0 is endogenous (i.e., affected by the actions of the controller) and (Zt)t⩾0 is purely exogenous. In particular, the primitives consist of
This exogenous discount model is a special case of the general MDP with state-dependent discounting. Indeed, we can write (6.36) as (6.35) by setting x:=(y,z) and defining
The following proposition provides a relatively simple sufficient condition for the core optimality results in the setting of the exogenous discount model.
In Section 6.2.1.2 we mentioned that requiring supβ<1 is too strict for some applications. For example, the real interest rate rt shown in Figure Figure 6.2 is sometimes negative. Using long historical records, Farmer et al. (2023) find that the discount rate is negative around 1/3 of the time. This means that the associated discount factor βt=1/(1+rt) is sometimes greater than 1 and supβ<1 fails.
In macroeconomics, empirically motivated time-varying discount factor specifications lead to models where βt>1 occurs with positive probability. For example, Hills et al. (2019) study a model that can be embedded in the MDP framework just described. Figure Figure 6.4 shows a simulation of one of the discount factor processes used in their model, prior to discretization. The exogenous state and discount factor process takes the form βt=bZt, where (Zt) is an exogenous state obeying Zt+1=1−ρ+ρZt+σϵt+1 with (ϵt) iid and standard normal. Clearly supβ<1 fails for this model too.
Let’s now consider the weaker condition ρ(L)<1 described in Proposition 6.2.3 and check whether it holds. Following Hills et al. (2019), we discretize the dynamics of (Zt) via a Tauchen approximation, producing a stochastic matrix Q on a finite set Z.[2] The set of values for βt ranges between 0.95 and 1.04, so that βt>1 remains possible. Nonetheless, with L(z,z′)=β(z)Q(z,z′) we obtain ρ(L)=0.9996. Hence Proposition 6.2.3 applies.
Figure 6.4:Discount factor process (β)t⩾0 in Hills et al. (2019).
at each x∈X, where X:={0,…,K}, x is the current inventory level, a is the current inventory order, r(x,a) is current profits (defined in (5.12)), f(x,a,d):=(x−d)∨0+a and d is an iid demand shock with distribution ϕ. Let’s now add a time-varying discount rate and investigate its impact on optimal choices.
We add time-varying discounting by replacing the constant β in (6.40) with a stochastic process (βt) where βt=1/(1+rt). We suppose that the dynamics can be expressed as βt=β(Zt), where the exogenous process (Zt)t⩾0 is Q-Markov on Z. After relabeling the endogenous state Xt as Yt and x as y, in line with the notation in Section 6.2.1.5, the Bellman equation becomes v(y,z)=maxa∈Γ(y,z)B((y,z),a,v) where
then R(y,a,y′) is the probability of realizing next period inventory level y′ when the current level is y and the action is a. Hence we can rewrite (6.41) as
We have now created a version of the MDP with exogenous state-dependent discounting described in Section 6.2.1.5. Letting L(z,z′):=β(z)Q(z,z′) and applying Proposition 6.2.3, we see that all of the standard optimality results hold whenever ρ(L)<1.
Figure Figure 6.5 shows how inventory evolves under an optimal program when the parameters of the problem are as given in Listing 1. (The code preallocates and computes arrays representing r, R, and Q in (6.43) and includes a test for ρ(L)<1.) We set β(z)=z and take (Zt) to be a discretization of an AR(1) process. Figure Figure 6.5 was created by simulating (Zt) according to Q and inventory (Yt) according to Yt+1=(Yt−Dt+1)∨0+At, where At follows the optimal policy.
The outcome is similar to Figure Figure 5.7, in the sense that inventory falls slowly and then jumps up. As before, fixed costs induce this lumpy behavior. However, a new phenomenon is now present: Inventories trend up when interest rates fall and down when they rise. (The interest rate rt is calculated via βt=1/(1+rt) at each t.) High interest rates foreshadow high interest rates due to positive autocorrelation (ρ>0), which in turn devalue future profits and hence encourage managers to economize on stock.
Figure 6.5:Inventory dynamics with time-varying interest rates
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
using LinearAlgebra, Random, Distributions, QuantEcon
f(y, a, d) = max(y - d, 0) + a # Inventory update
function create_sdd_inventory_model(;
ρ=0.98, ν=0.002, n_z=20, b=0.97, # Z state parameters
K=40, c=0.2, κ=0.8, p=0.6, # firm and demand parameters
d_max=100) # truncation of demand shock
ϕ(d) = (1 - p)^d * p # demand distribution
d_vals = collect(0:d_max)
ϕ_vals = ϕ.(d_vals)
y_vals = collect(0:K) # inventory levels
n_y = length(y_vals)
mc = tauchen(n_z, ρ, ν)
z_vals, Q = mc.state_values .+ b, mc.p
ρL = maximum(abs.(eigvals(z_vals .* Q)))
@assert ρL < 1 "Error: ρ(L) ≥ 1."
R = zeros(n_y, n_y, n_y)
for (i_y, y) in enumerate(y_vals)
for (i_y′, y′) in enumerate(y_vals)
for (i_a, a) in enumerate(0:(K - y))
hits = f.(y, a, d_vals) .== y′
R[i_y, i_a, i_y′] = dot(hits, ϕ_vals)
end
end
end
r = fill(-Inf, n_y, n_y)
for (i_y, y) in enumerate(y_vals)
for (i_a, a) in enumerate(0:(K - y))
cost = c * a + κ * (a > 0)
r[i_y, i_a] = dot(min.(y, d_vals), ϕ_vals) - cost
end
end
return (; K, c, κ, p, r, R, y_vals, z_vals, Q)
end
Program 1:Investment model with time-varying discounting (inventory_sdd.jl)
Figure Figure 6.6 shows execution time for VFI and OPI at different choices of m (see Section 6.2.1.3 for the interpretation of m). As for the optimal savings problem we studied in Chapter 5, OPI is around 1 order of magnitude faster when m is close to 50 (cf. Figure Figure 5.8).
Figure 6.6:OPV vs VFI timings for the inventory problem
This section provides a brief introduction to asset pricing in a Markov environment. While the topic of asset pricing is fascinating in its own right, our main aim is to provide additional practice in handling linear valuation problems. (Readers who wish to push ahead with their study of dynamic programming can safely skip to Chapter 7.)
We first discuss risk-neutral pricing and show why this assumption is typically implausible. Next, we introduce stochastic discount factors and stationary asset pricing.
Consider the problem of assigning a current price Πt to an asset that confers on its owner the right to payoff Gt+1. The payoff is stochastic and realized next period. One simple idea is to use risk-neutral pricing, which implies that
for some constant discount factor β∈(0,1). If the payoff is in k periods, then we modify the price to EtβkGt+k. In essence, risk-neutral pricing says that cost equals expected reward, discounted to present value by compounding a constant rate of discount. (A rate of discount, say ρ, is linked to a discount factor, say β, by β=1/(1+ρ)≈exp(−ρ).)
Although risk neutrality allows for simple pricing, assuming risk neutrality for all investors is not plausible.
To give one example, suppose that we take the asset that pays Gt+1 in (6.44) and replace it with another asset that pays Ht+1=Gt+1+ϵt+1, where ϵt+1 is independent of Gt+1, Etϵt+1=0 and Varϵt+1>0. In effect, we are adding risk to the original payoff without changing its mean.
Under risk neutrality, the price of this new asset is
This outcome contradicts the idea that investors typically want compensation for bearing risk.
A helpful way to think about the same point is to consider the rate of return rt+1:=(Gt+1−Πt)/Πt on holding an asset with payoff Gt+1. From (6.44) we have Etβ(1+rt+1)=1, or
Since the right-hand side does not depend on Gt+1, risk neutrality implies that all assets have the same expected rate of return. But this contradicts the finding that, on average, riskier assets tend to have higher rates of return that compensate investors for bearing risk.
To go beyond risk neutral-pricing, let’s start with a model containing one asset and one agent. It is straightforward to price the asset and compare it to the risk neutral case.
A representative agent takes the price Πt of a risky asset as given and solves
is called the stochastic discount factor or pricing kernel. We call this particular form of the pricing kernel shown in (6.52)Lucas stochastic discount factor (Lucas SDF) in honor of Lucas (1978).
In the CRRA case, the Lucas SDF applies heavier discounting to assets that concentrate payoffs in states of the world where the agent is already enjoying strong consumption growth. Conversely, the SDF attaches higher weights to future payoffs that occur when consumption growth is low because such payoffs hedge against the risk of drawing low consumption states.
The standard neoclassical theory of asset pricing generalizes the Lucas discounting specification by assuming only that there exists a positive random variable Mt+1 such that the price of an asset with payoff Gt+1 is
In line with the preceding discussion, Mt+1 is called a stochastic discount factor (SDF). Equation (6.54) generalizes (6.51) by refraining from restricting the SDF (apart from assuming positivity).
Actually, it can be shown that there exists an SDF Mt+1 such that (6.54) is always valid under relatively weak assumptions. In particular, a single SDF Mt+1 can be used to price any asset in the market, so if Ht+1 is a another stochastic payoff then the current price of an asset with this payoff is EtMt+1Ht+1.
We do not prove these claims, since our interest is in understanding forward-looking equations in Markov environments. Some relevant references are listed in Section 6.4.
A common assumption in quantitative applications is that all underlying randomness is driven by a Markov model. In this spirit, we take (Xt) to be P-Markov on finite-state X, where P∈M(RX), and suppose further that the SDF and payoff have the forms
Now we are ready to look at pricing a stationary cash flow over an infinite horizon, a basic problem in asset pricing. We will apply the Markov structure assumed in Section 6.3.1.4. In all that follows, (Xt) is P-Markov on X and Mt+1 is defined as in Section 6.3.1.4.
We seek the time t price, denoted by Πt, for an ex dividend contract on the dividend stream (Dt)t⩾0. The contract provides the owner with the right to the dividend stream. The “ex dividend” component means that, should the dividend stream be traded at time t, the dividend paid at time t goes to the seller rather than the buyer. As a result, purchasing at t and selling at t+1 pays Πt+1+Dt+1. Hence, applying the asset pricing rule (6.54), at time t price Πt of the contract must satisfy
Asset prices can be expressed as infinite sums. Let’s show this for cum dividend contracts (although the case of ex dividend contracts is similar). In Exercise 6.13 you found that the state-contingent price vector π for a cum dividend contract on the dividend stream (Dt)t⩾0 obeys
and ρ(A)<1. As before, Dt=d(Xt) and (Xt)t⩾0 is P-Markov on X. Applying the uniqueness component of the Neumann series lemma and Theorem 6.1.1, we see that the function π also obeys
where Mt+1:=m(Xt,Xt+1) for t⩾0 and M0:=1. This expression agrees with our intuition: The price of the contract is the expected present value of the dividend stream, with the time t dividend discounted by the composite factor M1⋯Mt.
Until now, our discussion of asset pricing has assumed that dividends are stationary. However, dividends typically grow over time, along with other economic measures such as GDP. In this section, we solve for the price of a dividend stream when dividends exhibit random growth.
where κ is a fixed function, (Xt) is the state process and (ηt) is iid. We let ϕ be the density of each ηt and assume that (Xt) is P-Markov on a finite set X. Let’s suppose as before that the SDF obeys Mt+1=m(Xt,Xt+1) for some positive function m.
Since dividends grow over time, so will the price of the asset. As such, we should no longer seek a fixed function π such that Πt=π(Xt) for all t, since the resulting price process (Πt) will fail to grow. Instead, we try to solve for the price-dividend ratioVt:=Πt/Dt, which we hope will be stationary.
After conditioning on Xt=x, (6.65) leads us to conjecture existence of a function v such that
for all x∈X. We understand (6.66) as an equation to be solved for the unknown object v∈RX. If we can find a solution v∗ to (6.66), then setting Vt=v∗(Xt) yields a process (Vt) that obeys (6.65).
The price-dividend process (Vt∗) defined by Vt∗=v∗(Xt) solves (6.65). The price can be recovered via Πt=Vt∗Dt.
6.3.2.2Application: Markov Growth with a Lucas SDF¶
1
2
3
4
5
6
7
8
9
10
11
12
13
using QuantEcon, LinearAlgebra
"Creates an instance of the asset pricing model with Markov state."
function create_asset_pricing_model(;
n=200, # state grid size
ρ=0.9, ν=0.2, # state persistence and volatility
β=0.99, γ=2.5, # discount and preference parameter
μ_c=0.01, σ_c=0.02, # consumption growth mean and volatility
μ_d=0.02, σ_d=0.1) # dividend growth mean and volatility
mc = tauchen(n, ρ, ν)
x_vals, P = exp.(mc.state_values), mc.p
return (; x_vals, P, β, γ, μ_c, σ_c, μ_d, σ_d)
end
Program 2:Asset pricing model with Lucas SDF (pd_ratio.jl)
Figure Figure 6.7 shows the price-dividend ratio function v∗ for the specification given in Listing 2, as well as for an alternative mean dividend growth rate μd. The state process is a Tauchen discretization of an AR(1) process with positive autocorrelation. An increase in the state predicts higher dividends, which tends to increase the price. At the same time, higher x also predicts higher consumption growth, which acts negatively on the price. For values of γ greater than 1, the second effect dominates and the price-dividend ratio slopes down.
Figure 6.7:Price-dividend ratio as a function of the state
In Section 6.3.1.5 we used the Neumann series lemma to solve for the equilibrium price vector π. However, some modifications to the basic model introduce nonlinearities that render the Neumann series lemma inapplicable. For example, Harrison & Kreps (1978) analyze a setting with heterogeneous beliefs and incomplete markets, leading to failure of the standard asset pricing equation. This results in a nonlinear equation for prices.
We treat the Harrison & Kreps (1978) model only briefly. There are two types of agents. Type i believes that the state updates according to stochastic matrix Pi for i=1,2. Agents are risk-neutral, so m(x,y)≡β∈(0,1). Harrison & Kreps (1978) show that, for their model, the equilibrium condition (6.58) becomes
We are assuming d⩾0, so T is indeed a self-map on R+X.
By construction, a vector π∈R+X is a fixed point of T if and only if it is a vector of prices that solves (6.74). Hence, we have successfully converted our equilibrium problem into a fixed point problem.
We aim to show that T is a contraction. To this end, pick any p,q∈R+X. Applying the inequality from Lemma 2.2.2, we obtain
Thus, on R+X, the map T is a contraction of modulus β with respect to the sup norm.
Since R+X is a closed subset of RX, we conclude that T has a unique fixed point in this set. Hence, the system (6.74) has a unique solution π∗ in R+X, representing equilibrium prices. This fixed point can be computed by successive approximation.
Asset pricing is discussed in many sources, including Hansen & Renault (2010), Ross (2009), Cochrane (2009), Duffie (2010) and Campbell (2017). Asset pricing is part of many applications and extensions in macroeconomics, public finance, international economics, and other fields. Some of these are described in Ljungqvist & Sargent (2018).
Dynamic programming with state-dependent discounting is becoming more common in macroeconomics and finance. Representative examples include Krusell & Smith (1998), Woodford (2011), Christiano et al. (2014), Albuquerque et al. (2016), Saijo (2017), Basu & Bundick (2017), Groot et al. (2018), Schorfheide et al. (2018), Hills et al. (2019), Toda (2019), Fagereng et al. (2019), Hubmer et al. (2020) and Cao (2020). For more on the theory of state-dependent discounting, see Jasso-Fuentes et al. (2020), Toda (2021) or Stachurski & Zhang (2021). An analysis of sovereign default with time-varying interest rates is provided by Bloise & Vailakis (2022).
Another challenge to the standard model with constant discount rates comes from empirical and experimental studies that find evidence of “hyperbolic discounting,” where valuations across time fall rapidly at first and then more slowly. Provocative reviews of hyperbolic and quasi-hyperbolic discounting can be found in Frederick et al. (2002) and Rubinstein (2003). Cao & Werning (2018) provide conditions under which predictions from optimal savings models with quasi-hyperbolic discounting are robust. Balbus et al. (2018) analyze uniqueness of time-consistent stationary Markov policies for quasi-hyperbolic households under uncertainty. Balbus et al. (2022) study equilibria in dynamic models with recursive payoffs and generalized discounting. Noor & Takeoka (2022) addresses the topic of optimal discounting. Additional references include Diamond & Köszegi (2003), Dasgupta & Maskin (2005), Karp (2005), Amador et al. (2006), Balbus et al. (2018), Fedus et al. (2019), Hens & Schindler (2020), Jaśkiewicz & Nowak (2021), and Drugeon & Wigniolle (2021).
This chapter focused on time additive models with state-dependent discounting. More general preference specifications with this feature include Albuquerque et al. (2016), Schorfheide et al. (2018), Pohl et al. (2018), Gomez-Cram & Yaron (2020), and Groot et al. (2022). In Chapter 8 we consider state-dependent discounting in general settings that accommodate such nonlinearities.
We are assuming that randomness in interest rates is a function of the same Markov state that influences profits. There is very little loss of generality in making this assumption. In fact, the two processes can still be statistically independent. For example, if we take Xt to have the form Xt=(Yt,Zt), where (Yt) and (Zt) are independent Markov chains, then we can take βt to be a function of Yt and πt to be a function of Zt. The resulting interest and profit processes are statistically independent.
The parameters are ρ=0.85, σ=0.0062, and b=0.99875. In line with Hills et al. (2019), we discretize the model via mc = tauchen(n, ρ, σ, 1 - ρ, m){.julia} with m=4.5 and n=15.
Krusell, P., & Smith, A. A., Jr. (1998). Income and wealth heterogeneity in the macroeconomy. Journal of Political Economy, 106(5), 867–896.
Marimon, R. (1984). General Equilibrium and Growth under Uncertainty: the Turnpike Property.
Marimon, R. (1989). Stochastic turnpike property and stationary equilibrium. Journal of Economic Theory, 47(2), 282–306.
Stachurski, J., & Zhang, J. (2021). Dynamic programming with state-dependent discounting. Journal of Economic Theory, 192, 105190.
Farmer, J. D., Geanakoplos, J., Richiardi, M. G., Montero, M., Perelló, J., & Masoliver, J. (2023). Discounting the distant future: What do historical bond prices imply about the long term discount rate? [Techreport]. arXiv, 2312.17157.
Hills, T. S., Nakata, T., & Schmidt, S. (2019). Effective lower bound risk. European Economic Review, 120, 103321.
Cochrane, J. H. (2009). Asset Pricing: Revised Edition. Princeton University Press.
Lucas, R. E. (1978). Asset prices in an exchange economy. Econometrica, 46(6), 1429–1445.
Harrison, J. M., & Kreps, D. M. (1978). Speculative investor behavior in a stock market with heterogeneous expectations. The Quarterly Journal of Economics, 92(2), 323–336.
Hansen, L. P., & Renault, E. (2010). Pricing kernels. Encyclopedia of Quantitative Finance.
Ross, S. A. (2009). Neoclassical Finance. Princeton University Press.
Duffie, D. (2010). Dynamic Asset Pricing Theory. Princeton University Press.
Campbell, J. Y. (2017). Financial Decisions and Markets: A Course in Asset Pricing. Princeton University Press.
Ljungqvist, L., & Sargent, T. (2018). Recursive Macroeconomic Theory (4th ed.). MIT Press.
Woodford, M. (2011). Simple analytics of the government expenditure multiplier. American Economic Journal: Macroeconomics, 3(1), 1–35.