Derivation of the Formula for a 95% Prediction Interval for an AR(1) Forecast

Exploring the cinematic intuition of Derivation of the Formula for a 95% Prediction Interval for an AR(1) Forecast.

Visualizing...

Our institutional research engineers are currently mapping the formal proof for Derivation of the Formula for a 95% Prediction Interval for an AR(1) Forecast.

Apply for Institutional Early Access →

The Formal Theorem

Given a stationary AR(1) process Yt=c+ϕYt1+ϵt Y_t = c + \phi Y_{t-1} + \epsilon_t , where ϕ<1 |\phi| < 1 , c c is a constant, and ϵt \epsilon_t is an independent and identically distributed (i.i.d.) white noise process with E[ϵt]=0 E[\epsilon_t] = 0 and Var(ϵt)=σϵ2 Var(\epsilon_t) = \sigma_{\epsilon}^2 . Assuming ϵt \epsilon_t are normally distributed, the h h -step-ahead point forecast for Yt+h Y_{t+h} , based on observations up to time t t , is denoted by Y^t+ht \hat{Y}_{t+h|t} . The variance of the h h -step-ahead forecast error, et+h=Yt+hY^t+ht e_{t+h} = Y_{t+h} - \hat{Y}_{t+h|t} , is given by σh2=σϵ2j=0h1ϕ2j \sigma_h^2 = \sigma_{\epsilon}^2 \sum_{j=0}^{h-1} \phi^{2j} . A (1α)×100% (1-\alpha) \times 100\% prediction interval for Yt+h Y_{t+h} is:
Y^t+ht±zα/2σϵ2j=0h1ϕ2j \hat{Y}_{t+h|t} \pm z_{\alpha/2} \sqrt{\sigma_{\epsilon}^2 \sum_{j=0}^{h-1} \phi^{2j}}
For a 95% prediction interval, α=0.05 \alpha = 0.05 , and the critical value zα/2=z0.0251.96 z_{\alpha/2} = z_{0.025} \approx 1.96 .

Analytical Intuition.

Imagine you're a seasoned time-traveler, tasked with predicting the exact trajectory of a comet (our time series Yt Y_t ) known to follow a predictable orbital pattern but subject to unpredictable cosmic dust disturbances (the random shocks ϵt \epsilon_t ). You've observed its path perfectly up to a specific point in time, t t . Your mission is not just to estimate its future position h h days from now (your point forecast Y^t+ht \hat{Y}_{t+h|t} ), but to define a 'safe zone' – an ellipse within which you're 95% confident the comet will actually be found.
This 'safe zone' is our prediction interval. The width of this ellipse isn't static; it expands like a ripple in a cosmic pond. The further into the future you project (larger h h ), the more the uncertainty from those tiny cosmic dust disturbances accumulates. This cumulative uncertainty is precisely captured by the sum of squared ϕ \phi terms multiplied by σϵ2 \sigma_{\epsilon}^2 , representing the expanding 'cone of uncertainty'. The zα/2 z_{\alpha/2} factor is your 'confidence multiplier', defining how wide your safe zone needs to be to achieve that 95% certainty. It's about quantifying the unpredictable and providing a robust range for the future.
CAUTION

Institutional Warning.

A common pitfall is confusing a prediction interval (for a single future observation Yt+h Y_{t+h} ) with a confidence interval (for the conditional mean E[Yt+hFt] E[Y_{t+h}|\mathcal{F}_t] or a model parameter). Students also frequently omit or miscalculate the cumulative variance contribution from past innovations, especially for h>1 h > 1 .

Academic Inquiries.

01

Why is zα/2 z_{\alpha/2} used, and why is it approximately 1.96 for a 95% interval?

The zα/2 z_{\alpha/2} value comes from the standard normal distribution. When constructing a (1α)×100% (1-\alpha) \times 100\% prediction interval, we want to capture the central (1α) (1-\alpha) portion of the distribution. This means α/2 \alpha/2 probability is left in each tail. For a 95% interval, α=0.05 \alpha = 0.05 , so α/2=0.025 \alpha/2 = 0.025 . The value z0.025 z_{0.025} is the point such that P(Z>z0.025)=0.025 P(Z > z_{0.025}) = 0.025 for a standard normal random variable Z Z , which is approximately 1.96.

02

How does the autoregressive parameter ϕ \phi influence the width of the prediction interval?

The parameter ϕ \phi determines the persistence of the series. If ϕ \phi is close to 0, the process is closer to white noise, and the interval width quickly stabilizes to ±zα/2σϵ \pm z_{\alpha/2} \sigma_{\epsilon} . If ϕ |\phi| is large (closer to 1), the influence of past values is stronger, and the uncertainty accumulates more rapidly over time, leading to a much wider prediction interval for larger h h .

03

What happens to the prediction interval as h h approaches infinity for a stationary AR(1) process?

For a stationary AR(1) process (where ϕ<1 |\phi| < 1 ), as h h \to \infty , the point forecast Y^t+ht \hat{Y}_{t+h|t} converges to the unconditional mean of the process, E[Yt]=c/(1ϕ) E[Y_t] = c / (1-\phi) . The sum j=0h1ϕ2j \sum_{j=0}^{h-1} \phi^{2j} converges to 1/(1ϕ2) 1 / (1-\phi^2) . Thus, the forecast error variance converges to σϵ2/(1ϕ2) \sigma_{\epsilon}^2 / (1-\phi^2) , which is the unconditional variance of the AR(1) process, Var(Yt) Var(Y_t) . The prediction interval will stabilize around the unconditional mean with a width determined by the unconditional variance.

04

Does this formula account for the uncertainty in estimating model parameters c c , ϕ \phi , and σϵ2 \sigma_{\epsilon}^2 ?

No, this formula assumes that the model parameters c c , ϕ \phi , and σϵ2 \sigma_{\epsilon}^2 are known true values. In practice, these parameters are estimated from data, introducing an additional source of uncertainty. For finite samples, accounting for parameter estimation uncertainty would typically result in wider prediction intervals, often requiring the use of t-distributions instead of z-distributions, or more complex bootstrap methods, especially for smaller sample sizes.

Standardized References.

  • Definitive Institutional SourceShumway, R.H. & Stoffer, D.S., Time Series Analysis and Its Applications: With R Examples (4th ed.).

Institutional Citation

Reference this proof in your academic research or publications.

NICEFA Visual Mathematics. (2026). Derivation of the Formula for a 95% Prediction Interval for an AR(1) Forecast: Visual Proof & Intuition. Retrieved from https://nicefa.org/library/time-series-analysis/derivation-of-the-formula-for-a-95--prediction-interval-for-an-ar-1--forecast

Dominate the Logic.

"Abstract theory is just a movement we haven't seen yet."