Intertemporal Portfolio Choice

Introduction

The two-date portfolio problem studied in the previous notebook is a useful starting point, but it misses an important feature of real investment decisions: the investment opportunity set changes over time. Interest rates fluctuate, risk premia vary, and volatility clusters. A forward-looking investor cares not only about the return on her portfolio today, but also about how the portfolio performs when future investment opportunities are favorable or unfavorable.

This notebook extends the portfolio choice problem to an infinite horizon, where asset returns depend on a vector of state variables z_t that evolves over time. The state variables summarize everything relevant about the current investment opportunity set — dividend yields, yield spreads, volatility indices, or any other variable with predictive content for future returns.

The extension has two major consequences. First, the stochastic discount factor (SDF) now depends on both the level of wealth and the state of the investment opportunity set. Second, the optimal portfolio is no longer a pure mean-variance portfolio: agents also take positions to hedge against adverse shifts in the investment opportunity set, a phenomenon known as the hedging demand component of Merton’s ICAPM (Merton 1973).

We proceed in four steps. We begin by deriving the Bellman equation for the infinite-horizon problem and its first-order conditions. The envelope condition then establishes that the marginal value of wealth equals marginal utility, delivering the SDF. A linear approximation of the value function yields a tractable SDF, a multi-factor risk premium formula, and an explicit expression for the optimal portfolio.

The Infinite-Horizon Problem

An investor with wealth W_t chooses a consumption plan \{c_{t+i}\}_{i=0}^\infty and a portfolio plan \{\pmb{\alpha}_{t+i}\}_{i=0}^\infty to maximize expected discounted utility: \max_{\{c_{t+i},\,\pmb{\alpha}_{t+i}\}_{i=0}^\infty} \operatorname{E}_t\left(\sum_{i=0}^\infty \beta^i\, u(c_{t+i})\right), subject to the wealth accumulation equation W_{t+1} = r^w_{t+1}(W_t - c_t), \qquad r^w_{t+1} = \pmb{\alpha}_t'\left(\mathbf{r}_{t+1} - r^f_{t+1}\pmb{\imath}\right) + r^f_{t+1}, Here \pmb{\alpha}_t is the vector of risky-asset portfolio weights, \mathbf{r}_{t+1} is the vector of risky gross returns, and r^f_{t+1} is the gross risk-free return. Current savings W_t - c_t are invested between t and t+1, so next period’s wealth is the only resource available for future consumption.

The investment opportunity set varies over time through a vector of state variables z_t. These state variables evolve exogenously and affect the conditional distribution of returns. For example, risky returns may satisfy \mathbf{r}_{t+1} = \operatorname{E}(\mathbf{r}_{t+1}\mid z_t) + \sigma(\mathbf{r}_{t+1}\mid z_t)\,\varepsilon_{t+1}, where \varepsilon_{t+1} is a mean-zero innovation. The relevant state at date t is therefore the pair (W_t, z_t): wealth summarizes current resources, while z_t summarizes current investment opportunities.

Bellman Equation

The value function gives the highest lifetime utility attainable from a given state (W_t, z_t): V(W_t, z_t) = \max_{\{c_{t+i},\,\pmb{\alpha}_{t+i}\}_{i=0}^\infty} \operatorname{E}_t\left(\sum_{i=0}^\infty \beta^i\, u(c_{t+i})\right), where the maximum is over all feasible consumption and portfolio policies from date t onward.

To obtain the recursive formulation, separate current utility from continuation utility: V(W_t, z_t) = \max_{\{c_t,\,\pmb{\alpha}_t\}} \left\{ u(c_t) + \max_{\{c_{t+1+i},\,\pmb{\alpha}_{t+1+i}\}_{i=0}^\infty} \operatorname{E}_t\left(\sum_{i=1}^\infty \beta^i\, u(c_{t+i})\right) \right\}.

Re-index the continuation sum and apply the law of iterated expectations, \operatorname{E}_t[\cdot] = \operatorname{E}_t[\operatorname{E}_{t+1}[\cdot]]: \max_{\{c_{t+1+i},\,\pmb{\alpha}_{t+1+i}\}_{i=0}^\infty} \operatorname{E}_t\left(\sum_{i=1}^\infty \beta^i\, u(c_{t+i})\right) = \operatorname{E}_t\left[ \beta\, \max_{\{c_{t+1+i},\,\pmb{\alpha}_{t+1+i}\}_{i=0}^\infty} \operatorname{E}_{t+1}\left(\sum_{i=0}^\infty \beta^i\, u(c_{t+1+i})\right) \right].

After c_t and \pmb{\alpha}_t are chosen, the problem at t+1 has the same form as the original one, but starts from the random state (W_{t+1}, z_{t+1}). The inner maximization is therefore V(W_{t+1}, z_{t+1}), and the Bellman equation becomes V(W_t, z_t) = \max_{\{c_t,\,\pmb{\alpha}_t\}}\left\{u(c_t) + \beta\,\operatorname{E}_t\left[V(W_{t+1}, z_{t+1})\right]\right\}, subject to W_{t+1} = r^w_{t+1}(W_t - c_t), \qquad r^w_{t+1} = \pmb{\alpha}_t'(\mathbf{r}_{t+1} - r^f_{t+1}\pmb{\imath}) + r^f_{t+1}.

The Bellman equation compresses the infinite-horizon problem into a sequence of identical one-period decisions. At each date, the investor chooses current consumption and portfolio weights, recognizing that today’s choices determine tomorrow’s state.

First-Order Conditions

The Bellman equation has two control variables, current consumption c_t and portfolio weights \pmb{\alpha}_t. Differentiating with respect to each control, and using \partial W_{t+1}/\partial c_t = -r^w_{t+1} and \partial W_{t+1}/\partial \pmb{\alpha}_t = (\mathbf{r}_{t+1} - r^f_{t+1}\pmb{\imath})(W_t - c_t), gives the first-order conditions:

FOC for consumption: u'(c_t) = \operatorname{E}_t\left[\beta\, r^w_{t+1}\, V_W(W_{t+1}, z_{t+1})\mid z_t\right],

FOC for portfolio weights: \operatorname{E}_t\left[\beta\,(\mathbf{r}_{t+1} - r^f_{t+1}\pmb{\imath})\,(W_t - c_t)\, V_W(W_{t+1}, z_{t+1})\mid z_t\right] = \mathbf{0},

Here V_W \equiv \partial V/\partial W denotes the marginal value of wealth. In an interior solution, savings W_t - c_t are positive, so this common factor can be divided out of the portfolio condition: \operatorname{E}_t\left[\beta\,\mathbf{r}^{\,e}_{t+1}\, V_W(W_{t+1}, z_{t+1})\mid z_t\right] = \mathbf{0}, where \mathbf{r}^{\,e}_{t+1} = \mathbf{r}_{t+1} - r^f_{t+1}\pmb{\imath} is the vector of excess returns.

These two conditions have a simple interpretation. The consumption FOC says that the marginal utility lost by consuming one less unit today must equal the expected discounted marginal value of the extra wealth carried into tomorrow. The portfolio FOC says that, at the optimum, no marginal reallocation toward any risky asset can improve lifetime utility once returns are weighted by the future marginal value of wealth.

Envelope Condition

Let \left(c_t^*(W_t, z_t),\, \pmb{\alpha}_t^*(W_t, z_t)\right) denote the optimal policies. Substituting them into the Bellman equation gives V(W_t, z_t) = u\left(c_t^*\right) + \operatorname{E}_t\left[\beta\, V\left(r^{w*}_{t+1}(W_t - c_t^*),\, z_{t+1}\right)\right].

To see how the value function responds to an extra dollar of wealth, differentiate both sides with respect to W_t: \begin{aligned} V_W(W_t, z_t) &= u'(c_t^*)\,\frac{\partial c_t^*}{\partial W_t} + \operatorname{E}_t\left[\beta\, V_W(W_{t+1}, z_{t+1})\left( \frac{\partial \pmb{\alpha}_t^{*\prime}}{\partial W_t}\mathbf{r}^{\,e}_{t+1}(W_t - c_t^*) + r^{w*}_{t+1}\left(1 - \frac{\partial c_t^*}{\partial W_t}\right) \right)\middle| z_t\right] \\[6pt] &= \underbrace{\left(u'(c_t^*) - \operatorname{E}_t\left[\beta\, r^{w*}_{t+1}\, V_W(W_{t+1}, z_{t+1})\right]\right)}_{=\,0\ \text{by FOC for }c_t} \frac{\partial c_t^*}{\partial W_t} \\ &\quad + \beta\,\frac{\partial \pmb{\alpha}_t^{*\prime}}{\partial W_t}(W_t - c_t^*) \underbrace{\operatorname{E}_t\left[V_W(W_{t+1}, z_{t+1})\,\mathbf{r}^{\,e}_{t+1}\right]}_{=\,\mathbf{0}\ \text{by FOC for }\pmb{\alpha}_t} + \operatorname{E}_t\left[\beta\, V_W(W_{t+1}, z_{t+1})\, r^{w*}_{t+1}\right]. \end{aligned}

The first two terms vanish by the first-order conditions, leaving the envelope condition: V_W(W_t, z_t) = \operatorname{E}_t\left[\beta\, V_W(W_{t+1}, z_{t+1})\, r^{w*}_{t+1}\right] = u'(c_t^*(W_t, z_t)).

This is the same logic as in the two-date problem. At the optimum, the marginal value of one more dollar of wealth equals the marginal utility of consuming that dollar today. The middle expression shows the same dollar viewed as an investment: its value is the expected discounted marginal utility tomorrow, scaled by the portfolio return.

Stochastic Discount Factor

The envelope condition suggests the natural one-period pricing kernel: m_{t+1} = \beta\,\frac{V_W(W_{t+1}, z_{t+1})}{V_W(W_t, z_t)} = \beta\,\frac{u'(c_{t+1}^*)}{u'(c_t^*)}.

The first-order conditions imply that m_{t+1} prices excess returns. From the portfolio FOC, \operatorname{E}_t\left[m_{t+1}\,\mathbf{r}^{\,e}_{t+1}\mid z_t\right] = \mathbf{0} \implies \operatorname{E}_t\left[m_{t+1}\,\mathbf{r}_{t+1}\mid z_t\right] = r^f_{t+1}\,\operatorname{E}_t\left[m_{t+1}\mid z_t\right]\pmb{\imath}. The consumption FOC and the envelope condition also imply \operatorname{E}_t\left[m_{t+1} r^w_{t+1}\mid z_t\right] = 1. Since the risk-free asset is also traded, it must satisfy the same pricing equation: \operatorname{E}_t\left[m_{t+1} r^f_{t+1}\mid z_t\right] = r^f_{t+1}\,\operatorname{E}_t\left[m_{t+1}\mid z_t\right] = 1. Therefore \operatorname{E}_t\left[m_{t+1}\,\mathbf{r}_{t+1}\mid z_t\right] = \pmb{\imath}, so m_{t+1} is a valid SDF. The representation m_{t+1} = \beta u'(c_{t+1})/u'(c_t) is the familiar consumption-based pricing kernel of the intertemporal asset-pricing literature (Lucas 1978; Breeden 1979). The equivalent form m_{t+1} = \beta V_W(W_{t+1}, z_{t+1})/V_W(W_t, z_t) is often more convenient in dynamic settings because it expresses prices directly in terms of the value function.

Lucas, Robert E. 1978. “Asset Prices in an Exchange Economy.” Econometrica 46 (6): 1429–45.
Breeden, Douglas T. 1979. “An Intertemporal Asset Pricing Model with Stochastic Consumption and Investment Opportunities.” Journal of Financial Economics 7 (3): 265–96.

Linearization

To move beyond the formal Bellman equation, we need an approximation for the marginal value of wealth. Rather than solve for V in closed form, we linearize V_W around the current state (W_t, z_t): V_W(W_{t+1}, z_{t+1}) \approx V_W(W_t, z_t) + V_{WW}(W_t, z_t)(W_{t+1} - W_t) + V_{Wz}(W_t, z_t)'(z_{t+1} - z_t), where V_{WW} = \partial^2 V/\partial W^2 and V_{Wz} = \partial^2 V/\partial W\partial z is the vector of cross-derivatives with respect to wealth and the state variables.

Divide through by V_W(W_t, z_t) and introduce three objects that summarize the local behavior of the value function:

  • Relative risk aversion: rra_t = -\dfrac{W_t\, V_{WW}(W_t, z_t)}{V_W(W_t, z_t)} \geq 0, which captures the curvature of indirect utility.
  • Consumption-to-wealth ratio: \delta_t = c_t/W_t \in (0,1).
  • Hedging demand coefficients: h_t = -V_{Wz}(W_t, z_t)/V_W(W_t, z_t), which measure how the marginal value of wealth responds to movements in the state variables.

The wealth dynamics translate the Taylor expansion into return space. Since W_{t+1} = r^w_{t+1}(W_t - c_t) = r^w_{t+1}W_t(1-\delta_t), \frac{W_{t+1} - W_t}{W_t} = r^w_{t+1}(1 - \delta_t) - 1.

Substituting into the definition of the SDF yields the linear SDF approximation m_{t+1} \approx \beta\left[1 - rra_t\left(r^w_{t+1}(1-\delta_t) - 1\right) - h_t'\,\Delta z_{t+1}\right], where \Delta z_{t+1} = z_{t+1} - z_t. This approximation makes the economics transparent. The SDF moves with two kinds of shocks: shocks to next-period wealth, scaled by risk aversion, and shocks to the state variables, scaled by the hedging coefficients. If the opportunity set were constant so that z_t did not move, the second channel would disappear and the model would collapse to a single-factor specification.

This representation immediately delivers a covariance decomposition for any asset return r^i_{t+1}: \operatorname{Cov}_t(m_{t+1}, r^i_{t+1}) = \beta\left[-rra_t(1-\delta_t)\,\operatorname{Cov}_t(r^w_{t+1}, r^i_{t+1}) - h_t'\,\operatorname{Cov}_t(\Delta z_{t+1}, r^i_{t+1})\right].

Risk Premia and Portfolio Demand

Multi-Factor Risk Premia

The SDF pricing condition \operatorname{E}_t[m_{t+1} r^{e,i}_{t+1}] = 0 and the covariance decomposition above imply \operatorname{E}_t\, r^{e,i}_{t+1} = \beta^{w,i}_t\,\lambda^w_t + \left(\beta^{z,i}_t\right)'\lambda^h_t, where the betas measure asset i’s exposure to each source of risk, \beta^{w,i}_t = \frac{\operatorname{Cov}_t(r^i_{t+1}, r^w_{t+1})}{\sigma^2_t(r^w_{t+1})}, \qquad \beta^{z,i}_t = \frac{\operatorname{Cov}_t(r^i_{t+1}, \Delta z_{t+1})}{\sigma^2_t(\Delta z_{t+1})}, and the risk prices reward each unit of exposure: \lambda^w_t = \beta\, rra_t(1-\delta_t)\,\sigma^2_t(r^w_{t+1}), \qquad \lambda^h_t = \beta\, h_t\,\sigma^2_t(\Delta z_{t+1}).

This is Merton’s Intertemporal CAPM (ICAPM) (Merton 1973): expected excess returns are driven by two types of betas. The first, \beta^{w,i}_t, measures the traditional covariance with the market portfolio (the “wealth beta”). The second, \beta^{z,i}_t, measures covariance with state variable innovations — assets that hedge against deteriorating investment opportunities earn lower expected returns because investors value them precisely when opportunities are scarce. In discrete-time empirical work, this state-variable perspective is closely related to the intertemporal pricing formulation in Campbell (1993).

Merton, Robert C. 1973. “An Intertemporal Capital Asset Pricing Model.” Econometrica 41 (5): 867–87.
Campbell, John Y. 1993. “Intertemporal Asset Pricing Without Consumption Data.” American Economic Review, 487–512.

Optimal Portfolio Demand

To find the optimal portfolio \pmb{\alpha}_t, note that \operatorname{Cov}_t(r^w_{t+1}, r^i_{t+1}) = \pmb{\Sigma}_t \pmb{\alpha}_t where \pmb{\Sigma}_t = \operatorname{Cov}_t(\mathbf{r}_{t+1}) is the conditional covariance matrix of risky returns. Let \pmb{\mu}^e_t = \operatorname{E}_t(\mathbf{r}^{\,e}_{t+1}) and \pmb{\Sigma}^{z,r}_t = \operatorname{Cov}_t(\Delta z_{t+1}, \mathbf{r}_{t+1}) (a matrix whose rows correspond to state variables and columns to assets). The expected excess return can be written in vector form as \pmb{\mu}^e_t = \beta\, rra_t(1-\delta_t)\,\pmb{\Sigma}_t \pmb{\alpha}_t + \beta\, \pmb{\Sigma}^{z,r\prime}_t h_t.

Solving for the portfolio weights gives \pmb{\alpha}_t = \frac{1}{\beta\, rra_t(1-\delta_t)}\left(\pmb{\Sigma}_t^{-1}\pmb{\mu}^e_t - \beta\,\pmb{\Sigma}_t^{-1}\pmb{\Sigma}^{z,r\prime}_t h_t\right).

This expression separates naturally into a mean-variance component and a hedging component. The first term, \frac{1}{\beta\, rra_t(1-\delta_t)}\pmb{\Sigma}_t^{-1}\pmb{\mu}^e_t, is the familiar tangency-portfolio demand, scaled inversely by risk aversion and by the share of wealth that is saved. It is exactly the term that would survive in a static model with a fixed investment opportunity set.

The second term, -\frac{1}{rra_t(1-\delta_t)}\pmb{\Sigma}_t^{-1}\pmb{\Sigma}^{z,r\prime}_t h_t, is the hedging demand. It reflects positions taken not because they have high expected returns, but because they help offset adverse movements in the state variables. The vector \pmb{\Sigma}_t^{-1}\pmb{\Sigma}^{z,r\prime}_t h_t identifies the portfolio that best spans innovations in investment opportunities, while the sign and magnitude of h_t determine how aggressively the investor wants that hedge. If h_t = \mathbf{0} — either because there are no state variables or because V_{Wz} = 0 — this term disappears and the standard mean-variance solution is recovered.