STA457_Lecture11
STA457_Lecture11
Lecture 11
Lijia Wang
Last Time:
1 Forecasting
2 Estimation
Today:
1 Integrated ARMA (ARIMA) models
2 Building ARIMA models
µt = µt−1 + νt ,
∇xt = νt + ∇yt ,
is stationary.
∇d xt = (1 − B)d xt
where δ = µ(1 − ϕ1 − ϕ2 − · · · − ϕp ).
n
For example, if d = 1, given forecasts yn+m for m = 1, 2, . . ., we have
n n n
yn+m = xn+m − xn+m−1 , so that
n n n
xn+m = yn+m + xn+m−1
n
with initial condition xn+1 n
= yn+1 + xn .
xt = xt−1 + wt − λwt−1 ,
with |λ| < 1, for t = 1, 2, . . ., and x0 = 0. We could also include a drift
term in the formula.
If we write
yt = wt − λwt−1 ,
we may write the IMA(1,1) as P xt = xt−1 + yt . Because |λ| < 1, yt has an
invertible representation, yt = ∞ j
j=1 λ yt−j + wt , and substituting
yt = xt − xt−1 , we may write
∞
X
xt = (1 − λ)λj−1 xt−j + wt .
j=1
There are a few basic steps to fitting ARIMA models to time series data.
These steps involve:
1 Plotting the data and interpreting the plot.
2 Possibly transforming the data (e.g., log, first difference).
3 Identifying the dependence orders of the model (p, d, q).
4 Parameter estimation.
5 Diagnostics of residuals (interpretation, normality assumptions, ACF
graphs).
6 Model choice.
xt − x̂ t−1
et = q t ,
P̂tt−1
When reports of GNP and similar economic indicators are given, it is often
in growth rate (percent change) rather than in actual (or adjusted) values
that is of interest. The growth rate, say, xt = ∇log(yt ), is plotted, and it
appears to be a stable process.
Figure: Sample ACF and PACF of the GNP quarterly growth rate
Inspecting the sample ACF and PACF, we might feel that two models are
suitable for the data:
1 The ACF is cutting off at lag 2 and the PACF is tailing off. This
would suggest the GNP growth rate follows an MA(2) process, or log
GNP follows an ARIMA(0,1,2) model.
2 The ACF is tailing off and the PACF is cutting off at lag 1. This
suggests an AR(1) model for the growth rate, or ARIMA(1,1,0) for
log GNP.
Rather than focus on one model, we will fit both models.
x̂t = .008(.001) +.303(.065) ŵt−1 +.204(.064) ŵt−2 +ŵt with σ̂w = .0094
The figure shows the p-values associated with the Ljung-Box Q-statistic,
at lags H = 3 through H = 20 (with corresponding degrees of freedom
H − 2).