Consider aus_airpassengers, the total number of passengers (in millions) from Australian air carriers for the period 1970-2011.
Use ARIMA() to find an appropriate ARIMA model. What model was selected. Check that the residuals look like white noise. Plot forecasts for the next 10 periods.
aus_airpassengers |>autoplot(Passengers)
fit <- aus_airpassengers |>model(arima =ARIMA(Passengers))report(fit)
Both containing increasing trends, but the ARIMA(0,2,1) model has an implicit trend due to the double-differencing, while the ARIMA(0,1,0) with drift models the trend directly via the trend coefficient.
The intervals are narrower when there are fewer differences.
Plot forecasts from an ARIMA(2,1,2) model with drift and compare these to part b. Remove the constant and see what happens.
Warning: Model specification induces a quadratic or higher order polynomial trend.
This is generally discouraged, consider removing the constant or reducing the number of differences.
The forecast trend is now quadratic, and there is a warning that this is generally a bad idea.
fpp3 9.11, Ex8
For the United States GDP series (from global_economy):
If necessary, find a suitable Box-Cox transformation for the data;
Notice the effect of the transformation on the forecasts. Increase the forecast horizon to see what happens. Notice also the width of the prediction intervals.
The point forecasts are similar, however the ETS forecast intervals are much wider.
fpp3 9.11, Ex15
Consider the number of Snowshoe Hare furs traded by the Hudson Bay Company between 1845 and 1935 (data set pelt).
Produce a time plot of the time series.
pelt |>autoplot(Hare)
Assume you decide to fit the following model: \[ y_t = c + \phi_1 y_{t-1} + \phi_2 y_{t-2} + \phi_3 y_{t-3} + \phi_4 y_{t-4} + \varepsilon_t, \] where \(\varepsilon_t\) is a white noise series. What sort of ARIMA model is this (i.e., what are \(p\), \(d\), and \(q\))?
This is an ARIMA(4,0,0), hence \(p=4\), \(d=0\) and \(q=0\).
By examining the ACF and PACF of the data, explain why this model is appropriate.
pelt |>gg_tsdisplay(Hare, plot="partial")
fit <- pelt |>model(AR4 =ARIMA(Hare ~pdq(4,0,0)))fit |>gg_tsresiduals()
The significant spike at lag 4 of the PACF indicates an AR(4).
The residuals from this model are clearly whhite noise.
fpp3 9.11, Ex16
The population of Switzerland from 1960 to 2017 is in data set global_economy.
You decide to fit the following model to the series: \[y_t = c + y_{t-1} + \phi_1 (y_{t-1} - y_{t-2}) + \phi_2 (y_{t-2} - y_{t-3}) + \phi_3( y_{t-3} - y_{t-4}) + \varepsilon_t\] where \(y_t\) is the Population in year \(t\) and \(\varepsilon_t\) is a white noise series. What sort of ARIMA model is this (i.e., what are \(p\), \(d\), and \(q\))?
This is an ARIMA(3,1,0), hence \(p=3\), \(d=1\) and \(q=0\).
Explain why this model was chosen using the ACF and PACF of the differenced series.
The significant spike at lag 3 in the PACF, coupled with the exponential decay in the ACF, for the differenced series, signals an AR(3) for the differenced series.