A time series comprises observations taken at a sequence of time
points.
In most situations these time points will be (approximately) evenly
spaced. For example:
- Monthly rainfall figures
- Quarterly unemployment rates
Linear models can be applied to time series data, but typically it
will prove necessary to generalize the structure of the error terms from
the simple independent errors that we have assumed so far.
Tourism in Victoria
Data are number of room nights occupied in hotels, motels and
guesthouses in Victoria. Observations are monthly from January 1980 to
December 1994. Data source: Australian Bureau of Statistics.
Download Motel.csv
## Tourism <- read.csv(file = "Motel.csv", header = TRUE)
str(Tourism)
'data.frame': 180 obs. of 4 variables:
$ Year : int 1980 1980 1980 1980 1980 1980 1980 1980 1980 1980 ...
$ Month : int 1 2 3 4 5 6 7 8 9 10 ...
$ RoomNights: int 276986 260633 291551 275383 275302 231693 238829 274215 277808 299060 ...
$ AvePrice : num 27.7 28.7 28.6 28.3 28.7 ...
Tourism$Yr = Tourism$Year - 1979 + Tourism$Month/12
head(Tourism)
Year Month RoomNights AvePrice Yr
1 1980 1 276986 27.70 1.083333
2 1980 2 260633 28.67 1.166667
3 1980 3 291551 28.60 1.250000
4 1980 4 275383 28.34 1.333333
5 1980 5 275302 28.66 1.416667
6 1980 6 231693 28.57 1.500000
Note that Month
is coded 1, 2, ..., 12
and
is currently an integer valued variable.
Yr
is constructed to represent a “fractional year after
1979” (accounting for month).
We will use the average price variable in the practical exercise for
this week.
Tourism$NYear = 1979 + Tourism$Yr # to get nicely spaced points along x axis
plot(RoomNights ~ NYear, xlab = "Year", type = "l", data = Tourism)
Variation in a Time Series
Possible sources of variation in a time series are:
- Secular trend (or just trend): tendency of
the series to increase or decrease over a long period of time.
- Seasonal variation: describes fluctuations
that recur during specific parts of the year (e.g. quarterly or
monthly).
- Residual variation (or innovations): the
part of the variation which is not explained by long term trend or
seasonal effects.
- An additional cyclical source of variation
(corresponding to business cycles, for example) is sometimes
identified.
Modelling A Time Series
Time series data can be modelled using linear models (although there
are a number of alternative approaches).
- Long term trend can be modelled using polynomial regression.
- Seasonal effects can be represented by specifying the seasons
(e.g. months, quarters) as a factor in the model.
- Additional covariates can sometimes be incorporated in such models
(e.g. standard economic indicators may be included to help explain
variation in sales data)
Back to the Tourism Data
Model Fitting and ANOVA
Tourism$Month <- factor(Tourism$Month)
Tourism.lm <- lm(RoomNights ~ Yr + Month, data = Tourism)
anova(Tourism.lm)
Analysis of Variance Table
Response: RoomNights
Df Sum Sq Mean Sq F value Pr(>F)
Yr 1 5.4099e+11 5.4099e+11 2494.398 < 2.2e-16 ***
Month 11 1.5589e+11 1.4172e+10 65.346 < 2.2e-16 ***
Residuals 167 3.6219e+10 2.1688e+08
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Call:
lm(formula = RoomNights ~ Yr + Month, data = Tourism)
Residuals:
Min 1Q Median 3Q Max
-37721 -8900 -1826 7931 48624
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 261275.9 4321.6 60.458 < 2e-16 ***
Yr 12705.7 254.1 50.010 < 2e-16 ***
Month2 -30740.9 5377.5 -5.717 4.90e-08 ***
Month3 24115.9 5377.7 4.484 1.35e-05 ***
Month4 -1464.5 5377.9 -0.272 0.785712
Month5 -18682.5 5378.2 -3.474 0.000654 ***
Month6 -65076.5 5378.5 -12.099 < 2e-16 ***
Month7 -43764.3 5379.0 -8.136 8.89e-14 ***
Month8 -29006.2 5379.5 -5.392 2.35e-07 ***
Month9 -11274.0 5380.2 -2.095 0.037636 *
Month10 27159.2 5380.9 5.047 1.16e-06 ***
Month11 17231.1 5381.7 3.202 0.001635 **
Month12 -57892.7 5382.5 -10.756 < 2e-16 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 14730 on 167 degrees of freedom
Multiple R-squared: 0.9506, Adjusted R-squared: 0.947
F-statistic: 267.8 on 12 and 167 DF, p-value: < 2.2e-16
plot(RoomNights ~ NYear, xlab = "Year", data = Tourism)
lines(Tourism$NYear, fitted(Tourism.lm), col = "blue")
title("Fitted Linear Model")
More Model Fitting in R
Tourism.lm.2 <- lm(RoomNights ~ poly(Yr, 2) + Month, data = Tourism)
anova(Tourism.lm, Tourism.lm.2)
Analysis of Variance Table
Model 1: RoomNights ~ Yr + Month
Model 2: RoomNights ~ poly(Yr, 2) + Month
Res.Df RSS Df Sum of Sq F Pr(>F)
1 167 3.6219e+10
2 166 3.5762e+10 1 456501120 2.119 0.1474
No evidence that quadratic trend improves on linear
(P=0.1418).
Residuals Versus Time
plot(Tourism$Yr, resid(Tourism.lm), type = "l", xlab = "Year-1979")
The time plot of residuals for the linear trend model suggests that
there are extended periods when the residuals are almost all negative
(1986-1988) and extended periods where the residuals are almost all
positive (1989-1990).
Such behaviour should not be observed if the errors are
independent.
However, for time series data it is common for some residual
correlation between residuals to remain even when the trend and seasonal
variation has been removed.
This type of correlation in the sequence of residuals is usually
called autocorrelation.
Stationary Processes and Autocorrelation
Consider a random process in time: Zt where
t=1,2,… and Zt represents the value of the
process at time t.
This process is said to be (weakly) stationary if:
- E[Zt] and Var(Zt) do not
change with time t.
- The correlation Corr(Zt, Zt+k)
depends only on the time lag k.
It is common to model the residuals from a time series as a
stationary random process with zero mean.
For a stationary process, the autocorrelation
function (or ACF) is defined by \[\rho(k) = \mbox{Corr}(Z_t, Z_{t+k})\]
The ACF (autocorrelation against time lag) can be plotted in R using
the acf()
command.
Tourism Data: ACF Plot for Residuals
- The ACF plot indicates a correlation of about \(0.4\) at lag one. Hence consecutive
residuals are positively dependent.
- The dashed horizontal lines on the plot are a 95% confidence
interval under the assumption that the true autocorrelation is
zero.
- Any correlation lying within this confidence interval may be just
noise.
- Any correlation lying outside this confidence interval is probably
indicative of true serial dependence in the data.
For the tourism data it seems that there is serial dependence in the
data, since the correlations at lags 1, 2, 3 and 4 all extend beyond the
confidence interval bounds.
The existence of several significant correlations is common when
autocorrelation exists. If the correlation between the
ith and jth variables is high,
and so is the correlation between the jth and
kth variables, we ought to expect the correlation
between the ith and kth
variables to also be high. When we are thinking about time series, the
correlation for data from lags 0 and 1 is the same as the data from lags
1 and 2 because they are both sets formed by pairs of successive
observations. If observations are correlated with their preceding
observations, and those preceding observations are correlated with their
preceding observations, then there is likely to be a correlation between
observations and those that are back two time steps. This logic
continues to three, four, and greater time lags and is especially likely
when the correlation at the first lag is very, very high.
LS0tDQp0aXRsZTogIkxlY3R1cmUgMzQ6IEludHJvZHVjdGlvbiB0byBMaW5lYXIgTW9kZWxsaW5nIGZvciBUaW1lIFNlcmllcyINCnN1YnRpdGxlOiAxNjEuMjUxIFJlZ3Jlc3Npb24gTW9kZWxsaW5nDQphdXRob3I6ICJQcmVzZW50ZWQgYnkgSm9uYXRoYW4gTWFyc2hhbGwgPEouQy5tYXJzaGFsbEBtYXNzZXkuYWMubno+IiAgDQpkYXRlOiAiV2VlayAxMiBvZiBTZW1lc3RlciAyLCBgciBsdWJyaWRhdGU6OnllYXIobHVicmlkYXRlOjpub3coKSlgIg0Kb3V0cHV0Og0KICBodG1sX2RvY3VtZW50Og0KICAgIGNvZGVfZG93bmxvYWQ6IHRydWUNCiAgICB0aGVtZTogeWV0aQ0KICAgIGhpZ2hsaWdodDogdGFuZ28NCiAgaHRtbF9ub3RlYm9vazoNCiAgICBjb2RlX2Rvd25sb2FkOiB0cnVlDQogICAgdGhlbWU6IHlldGkNCiAgICBoaWdobGlnaHQ6IHRhbmdvDQogIGJlYW1lcl9wcmVzZW50YXRpb246IGRlZmF1bHQNCiAgd29yZF9kb2N1bWVudDogZGVmYXVsdA0KICBpb3NsaWRlc19wcmVzZW50YXRpb246IGRlZmF1bHQNCiAgc2xpZHlfcHJlc2VudGF0aW9uOiANCiAgICB0aGVtZTogeWV0aQ0KICAgIGhpZ2hsaWdodDogdGFuZ28NCiAgcGRmX2RvY3VtZW50OiBkZWZhdWx0DQotLS0NCg0KDQoNCg0KPCEtLS0gRGF0YSBpcyBvbg0KaHR0cHM6Ly9yLXJlc291cmNlcy5tYXNzZXkuYWMubnovZGF0YS8xNjEyNTEvDQotLS0+DQoNCmBgYHtyIHNldHVwLCBwdXJsPUZBTFNFLCBpbmNsdWRlPUZBTFNFfQ0KbGlicmFyeShrbml0cikNCm9wdHNfY2h1bmskc2V0KGRldj1jKCJwbmciLCAicGRmIikpDQpvcHRzX2NodW5rJHNldChmaWcuaGVpZ2h0PTYsIGZpZy53aWR0aD03LCBmaWcucGF0aD0iRmlndXJlcy8iLCBmaWcuYWx0PSJ1bmxhYmVsbGVkIikNCm9wdHNfY2h1bmskc2V0KGNvbW1lbnQ9IiIsIGZpZy5hbGlnbj0iY2VudGVyIiwgdGlkeT1UUlVFKQ0Kb3B0aW9ucyhrbml0ci5rYWJsZS5OQSA9ICcnKQ0KbGlicmFyeSh0aWR5dmVyc2UpDQpsaWJyYXJ5KGJyb29tKQ0KYGBgDQoNCg0KPCEtLS0gRG8gbm90IGVkaXQgYW55dGhpbmcgYWJvdmUgdGhpcyBsaW5lLiAtLS0+DQoNCg0KQSB0aW1lIHNlcmllcyBjb21wcmlzZXMgb2JzZXJ2YXRpb25zIHRha2VuIGF0IGEgc2VxdWVuY2Ugb2YgdGltZSBwb2ludHMuDQoNCkluIG1vc3Qgc2l0dWF0aW9ucyB0aGVzZSB0aW1lIHBvaW50cyB3aWxsIGJlIChhcHByb3hpbWF0ZWx5KSBldmVubHkgc3BhY2VkLiBGb3IgZXhhbXBsZToNCiAgICANCi0gTW9udGhseSByYWluZmFsbCBmaWd1cmVzDQotIFF1YXJ0ZXJseSB1bmVtcGxveW1lbnQgcmF0ZXMNCg0KTGluZWFyIG1vZGVscyBjYW4gYmUgYXBwbGllZCB0byB0aW1lIHNlcmllcyBkYXRhLCBidXQgdHlwaWNhbGx5IGl0DQogICAgd2lsbCBwcm92ZSBuZWNlc3NhcnkgdG8gZ2VuZXJhbGl6ZSB0aGUgc3RydWN0dXJlIG9mIHRoZSBlcnJvciB0ZXJtcw0KICAgIGZyb20gdGhlIHNpbXBsZSBpbmRlcGVuZGVudCBlcnJvcnMgdGhhdCB3ZSBoYXZlIGFzc3VtZWQgc28gZmFyLg0KDQojIyBUb3VyaXNtIGluIFZpY3RvcmlhDQoNCg0KDQoNCg0KRGF0YSBhcmUgbnVtYmVyIG9mIHJvb20gbmlnaHRzIG9jY3VwaWVkIGluIGhvdGVscywgbW90ZWxzIGFuZCBndWVzdGhvdXNlcyBpbiBWaWN0b3JpYS4NCk9ic2VydmF0aW9ucyBhcmUgbW9udGhseSBmcm9tIEphbnVhcnkgMTk4MCB0byBEZWNlbWJlciAxOTk0Lg0KRGF0YSBzb3VyY2U6IEF1c3RyYWxpYW4gQnVyZWF1IG9mIFN0YXRpc3RpY3MuDQoNCiFbQSBtb3RlbCByb29tLl0oLi4vZ3JhcGhpY3MvbW90ZWwuanBnKSBbXSgpDQoNCg0KYHIgeGZ1bjo6ZW1iZWRfZmlsZSgiLi4vLi4vZGF0YS9Nb3RlbC5jc3YiKWANCg0KDQpgYGB7ciBnZXRUb3VyaXNtRGF0YSwgZWNobz0tMSwgZXZhbD0tMn0NClRvdXJpc20gPC0gcmVhZC5jc3YoZmlsZT0iLi4vLi4vZGF0YS9Nb3RlbC5jc3YiLCBoZWFkZXI9VFJVRSkNClRvdXJpc20gPC0gcmVhZC5jc3YoZmlsZT0iTW90ZWwuY3N2IiwgaGVhZGVyPVRSVUUpDQpzdHIoVG91cmlzbSkNClRvdXJpc20kWXIgPSBUb3VyaXNtJFllYXIgLTE5NzkgKyBUb3VyaXNtJE1vbnRoLzEyDQpoZWFkKFRvdXJpc20pDQpgYGANCg0KTm90ZSB0aGF0IGBNb250aGAgaXMgY29kZWQgYDEsIDIsIC4uLiwgMTJgIGFuZCBpcyBjdXJyZW50bHkgYW4gaW50ZWdlciB2YWx1ZWQgdmFyaWFibGUuDQoNCmBZcmAgaXMgY29uc3RydWN0ZWQgdG8gcmVwcmVzZW50IGEgImZyYWN0aW9uYWwgeWVhciBhZnRlciAxOTc5IiAoYWNjb3VudGluZyBmb3INCiAgICBtb250aCkuIA0KDQoNCldlIHdpbGwgdXNlIHRoZSBhdmVyYWdlIHByaWNlIHZhcmlhYmxlIGluIHRoZSBwcmFjdGljYWwgZXhlcmNpc2UgZm9yIHRoaXMgd2Vlay4NCg0KYGBge3IgcGxvdFRvdXJpc20sIGZpZy5jYXA9IlRoZXJlIGlzIGNsZWFyIGV2aWRlbmNlIG9mIHVwd2FyZCB0cmVuZCBhbmQgc2Vhc29uYWwgKG1vbnRobHkpIHZhcmlhdGlvbi4ifQ0KVG91cmlzbSROWWVhciA9IDE5NzkgKyBUb3VyaXNtJFlyICMgdG8gZ2V0IG5pY2VseSBzcGFjZWQgcG9pbnRzIGFsb25nIHggYXhpcw0KcGxvdChSb29tTmlnaHRzfk5ZZWFyLCB4bGFiPSAiWWVhciIsIHR5cGU9ImwiLCBkYXRhPVRvdXJpc20pDQpgYGANCg0KDQojIyBWYXJpYXRpb24gaW4gYSBUaW1lIFNlcmllcw0KDQpQb3NzaWJsZSBzb3VyY2VzIG9mIHZhcmlhdGlvbiBpbiBhIHRpbWUgc2VyaWVzIGFyZToNCiAgICANCi0gKioqU2VjdWxhciB0cmVuZCoqKiAob3IganVzdCB0cmVuZCk6ICB0ZW5kZW5jeSBvZiB0aGUgc2VyaWVzIHRvIGluY3JlYXNlIG9yIGRlY3JlYXNlIG92ZXIgYSBsb25nDQogICAgICAgIHBlcmlvZCBvZiB0aW1lLg0KLSAqKipTZWFzb25hbCB2YXJpYXRpb24qKio6IGRlc2NyaWJlcyBmbHVjdHVhdGlvbnMgdGhhdA0KICAgICAgICByZWN1ciBkdXJpbmcgc3BlY2lmaWMgcGFydHMgb2YgdGhlIHllYXIgKGUuZy4gcXVhcnRlcmx5IG9yDQogICAgICAgIG1vbnRobHkpLg0KLSBSZXNpZHVhbCB2YXJpYXRpb24gKG9yICoqKmlubm92YXRpb25zKioqKTogdGhlIHBhcnQgb2YgdGhlIHZhcmlhdGlvbiB3aGljaCBpcyBub3QgZXhwbGFpbmVkIGJ5IGxvbmcgdGVybSAgICAgICAgdHJlbmQgb3Igc2Vhc29uYWwgZWZmZWN0cy4NCi0gQW4gYWRkaXRpb25hbCAqKipjeWNsaWNhbCoqKiBzb3VyY2Ugb2YgdmFyaWF0aW9uIChjb3JyZXNwb25kaW5nIHRvIGJ1c2luZXNzIGN5Y2xlcywgZm9yIGV4YW1wbGUpIGlzIHNvbWV0aW1lcw0KICAgIGlkZW50aWZpZWQuDQoNCiMjIE1vZGVsbGluZyBBIFRpbWUgU2VyaWVzDQoNClRpbWUgc2VyaWVzIGRhdGEgY2FuIGJlIG1vZGVsbGVkIHVzaW5nIGxpbmVhciBtb2RlbHMgKGFsdGhvdWdoIHRoZXJlDQogICAgYXJlIGEgbnVtYmVyIG9mIGFsdGVybmF0aXZlIGFwcHJvYWNoZXMpLg0KDQotIExvbmcgdGVybSB0cmVuZCBjYW4gYmUgbW9kZWxsZWQgdXNpbmcgcG9seW5vbWlhbCByZWdyZXNzaW9uLg0KLSBTZWFzb25hbCBlZmZlY3RzIGNhbiBiZSByZXByZXNlbnRlZCBieSBzcGVjaWZ5aW5nIHRoZSBzZWFzb25zIChlLmcuIG1vbnRocywgcXVhcnRlcnMpIGFzIGEgZmFjdG9yIGluIHRoZSBtb2RlbC4NCi0gQWRkaXRpb25hbCBjb3ZhcmlhdGVzIGNhbiBzb21ldGltZXMgYmUgaW5jb3Jwb3JhdGVkIGluIHN1Y2ggbW9kZWxzIChlLmcuIHN0YW5kYXJkIGVjb25vbWljIGluZGljYXRvcnMgbWF5IGJlIGluY2x1ZGVkIHRvIGhlbHAgZXhwbGFpbiAgIHZhcmlhdGlvbiBpbiBzYWxlcyBkYXRhKQ0KDQojIyBCYWNrIHRvIHRoZSBUb3VyaXNtIERhdGENCg0KIyMjIE1vZGVsIEZpdHRpbmcgYW5kIEFOT1ZBDQoNCmBgYHtyIFRvdXJpc20ubG19DQpUb3VyaXNtJE1vbnRoIDwtIGZhY3RvcihUb3VyaXNtJE1vbnRoKQ0KVG91cmlzbS5sbSA8LSBsbShSb29tTmlnaHRzfllyICsgTW9udGgsIGRhdGE9VG91cmlzbSkNCmFub3ZhKFRvdXJpc20ubG0pDQpgYGANCg0KDQpgYGB7ciBzdW1Ub3VyaXNtLmxtfQ0Kc3VtbWFyeShUb3VyaXNtLmxtKQ0KYGBgDQoNCg0KYGBge3IgRml0c1Bsb3RUb3VyaXNtLmxtfQ0KcGxvdChSb29tTmlnaHRzfk5ZZWFyLCB4bGFiPSJZZWFyIiwgZGF0YT1Ub3VyaXNtKQ0KbGluZXMoVG91cmlzbSROWWVhciwgZml0dGVkKFRvdXJpc20ubG0pLCBjb2w9ImJsdWUiKQ0KdGl0bGUoIkZpdHRlZCBMaW5lYXIgTW9kZWwiKQ0KYGBgDQoNCiMjIyBDb21tZW50cw0KDQpgWXJgICh3aGljaCBpcyBlc3NlbnRpYWxseSB0aW1lKSBpcyB0aGUgYXBwcm9wcmlhdGUNCiAgICBjb3ZhcmlhdGUgdG8gdHJhY2sgdHJlbmQgKG5vdCBgWWVhcmAsIHdoaWNoIHdvdWxkIGlnbm9yZQ0KICAgIHNlY3VsYXIgdHJlbmQgZHVyaW5nIGEgeWVhcikuDQoNCkl0IGlzIGltcG9ydGFudCB0byByZW1lbWJlciB0byBjb2RlIGBNb250aGAgYXMgYSBmYWN0b3Igc28gYXMNCiAgICB0byByZXByZXNlbnQgc2Vhc29uYWwgZWZmZWN0cy4NCg0KVGhlcmUgaXMgc3Ryb25nIGV2aWRlbmNlIG9mIHRyZW5kICgkUCA8IDIuMiBcdGltZXMgMTBeey0xNn0kIGZvcg0KICAgIGBZcmApIGFuZCBvZiBzZWFzb25hbGl0eSAoJFAgPCAyLjIgXHRpbWVzIDEwXnstMTZ9JCBmb3IgYE1vbnRoYCkgaW4gdGhlIGRhdGEuDQoNCkFzIG1pZ2h0IGJlIGV4cGVjdGVkLCByb29tIGJvb2tpbmdzIHRlbmQgdG8gYmUgbG93IGluIHRoZSB3aW50ZXINCiAgICBtb250aHMuIFRoZSBwYXR0ZXJuIG92ZXIgc3VtbWVyIGlzIGxlc3MgY2xlYXIuIFBlcmhhcHMgRGVjZW1iZXIgaXMgYQ0KICAgIGJhZCBtb250aCBiZWNhdXNlIG9mIHRoZSBDaHJpc3RtYXMgZWZmZWN0Pw0KDQpXZSBoYXZlIGFzc3VtZWQgYSBsaW5lYXIgc2VjdWxhciB0cmVuZC4gV291bGQgcXVhZHJhdGljIGJlIGJldHRlcj8NCg0KIyMjIE1vcmUgTW9kZWwgRml0dGluZyBpbiBSDQoNCmBgYHtyIFRvdXJpc20ubG0uMn0NClRvdXJpc20ubG0uMiA8LSBsbShSb29tTmlnaHRzfnBvbHkoWXIsMikrTW9udGgsIGRhdGE9VG91cmlzbSkNCmFub3ZhKFRvdXJpc20ubG0sIFRvdXJpc20ubG0uMikNCmBgYA0KDQpObyBldmlkZW5jZSB0aGF0IHF1YWRyYXRpYyB0cmVuZCBpbXByb3ZlcyBvbiBsaW5lYXIgKCpQPTAuMTQxOCopLg0KDQojIyMgUmVzaWR1YWxzIFZlcnN1cyBUaW1lDQoNCg0KYGBge3IgVG91cmlzbVBsb3R9DQpwbG90KFRvdXJpc20kWXIsIHJlc2lkKFRvdXJpc20ubG0pLCB0eXBlPSJsIiwgeGxhYj0iWWVhci0xOTc5IikNCmBgYA0KDQoNCg0KVGhlIHRpbWUgcGxvdCBvZiByZXNpZHVhbHMgZm9yIHRoZSBsaW5lYXIgdHJlbmQgbW9kZWwgc3VnZ2VzdHMgdGhhdA0KICAgIHRoZXJlIGFyZSBleHRlbmRlZCBwZXJpb2RzIHdoZW4gdGhlIHJlc2lkdWFscyBhcmUgYWxtb3N0IGFsbA0KICAgIG5lZ2F0aXZlICgxOTg2LTE5ODgpIGFuZCBleHRlbmRlZCBwZXJpb2RzIHdoZXJlIHRoZSByZXNpZHVhbHMgYXJlDQogICAgYWxtb3N0IGFsbCBwb3NpdGl2ZSAoMTk4OS0xOTkwKS4NCg0KU3VjaCBiZWhhdmlvdXIgc2hvdWxkIG5vdCBiZSBvYnNlcnZlZCBpZiB0aGUgZXJyb3JzIGFyZSBpbmRlcGVuZGVudC4NCg0KSG93ZXZlciwgZm9yIHRpbWUgc2VyaWVzIGRhdGEgaXQgaXMgY29tbW9uIGZvciBzb21lIHJlc2lkdWFsDQogICAgY29ycmVsYXRpb24gYmV0d2VlbiByZXNpZHVhbHMgdG8gcmVtYWluIGV2ZW4gd2hlbiB0aGUgdHJlbmQgYW5kDQogICAgc2Vhc29uYWwgdmFyaWF0aW9uIGhhcyBiZWVuIHJlbW92ZWQuDQoNClRoaXMgdHlwZSBvZiBjb3JyZWxhdGlvbiBpbiB0aGUgc2VxdWVuY2Ugb2YgcmVzaWR1YWxzIGlzIHVzdWFsbHkgICAgIGNhbGxlZCAqKmF1dG9jb3JyZWxhdGlvbioqLg0KDQojIyBTdGF0aW9uYXJ5IFByb2Nlc3NlcyBhbmQgQXV0b2NvcnJlbGF0aW9uDQoNCkNvbnNpZGVyIGEgcmFuZG9tIHByb2Nlc3MgaW4gdGltZTogKlp+dH4qIHdoZXJlICp0PTEsMiwuLi4qIGFuZCAqWn50fiogcmVwcmVzZW50cyB0aGUgdmFsdWUgb2YgdGhlIHByb2Nlc3MgYXQgdGltZSAqdCouDQoNClRoaXMgcHJvY2VzcyBpcyBzYWlkIHRvIGJlICh3ZWFrbHkpIHN0YXRpb25hcnkgaWY6DQogICAgDQotICpFW1p+dH5dKiBhbmQgKlZhcihafnR+KSogZG8gbm90IGNoYW5nZSB3aXRoIHRpbWUgICAqdCouDQotIFRoZSBjb3JyZWxhdGlvbiAqQ29ycihafnR+LCBafnQra34pKiBkZXBlbmRzIG9ubHkgb24gdGhlICAgICAgdGltZSBsYWcgKmsqLg0KDQpJdCBpcyBjb21tb24gdG8gbW9kZWwgdGhlIHJlc2lkdWFscyBmcm9tIGEgdGltZSBzZXJpZXMgYXMgYQ0KICAgIHN0YXRpb25hcnkgcmFuZG9tIHByb2Nlc3Mgd2l0aCB6ZXJvIG1lYW4uDQoNCkZvciBhIHN0YXRpb25hcnkgcHJvY2VzcywgdGhlICoqYXV0b2NvcnJlbGF0aW9uIGZ1bmN0aW9uKiogKG9yIEFDRikgaXMgZGVmaW5lZCBieQ0KICAgICQkXHJobyhrKSA9IFxtYm94e0NvcnJ9KFpfdCwgWl97dCtrfSkkJA0KDQpUaGUgQUNGIChhdXRvY29ycmVsYXRpb24gYWdhaW5zdCB0aW1lIGxhZykgY2FuIGJlIHBsb3R0ZWQgaW4gUiB1c2luZyAgICB0aGUgYGFjZigpYCBjb21tYW5kLg0KDQojIyMgVG91cmlzbSBEYXRhOiBBQ0YgUGxvdCBmb3IgUmVzaWR1YWxzDQoNCmBgYHtyIFRvdXJpc21BQ0Z9DQphY2YocmVzaWQoVG91cmlzbS5sbSkpDQpgYGANCg0KDQotIFRoZSBBQ0YgcGxvdCBpbmRpY2F0ZXMgYSBjb3JyZWxhdGlvbiBvZiBhYm91dCAkMC40JCBhdCBsYWcgb25lLg0KICAgIEhlbmNlIGNvbnNlY3V0aXZlIHJlc2lkdWFscyBhcmUgcG9zaXRpdmVseSBkZXBlbmRlbnQuDQotIFRoZSBkYXNoZWQgaG9yaXpvbnRhbCBsaW5lcyBvbiB0aGUgcGxvdCBhcmUgYSA5NSUgY29uZmlkZW5jZSBpbnRlcnZhbA0KICAgIHVuZGVyIHRoZSBhc3N1bXB0aW9uIHRoYXQgdGhlIHRydWUgYXV0b2NvcnJlbGF0aW9uIGlzIHplcm8uDQotIEFueSBjb3JyZWxhdGlvbiBseWluZyB3aXRoaW4gdGhpcyBjb25maWRlbmNlIGludGVydmFsIG1heSBiZSBqdXN0DQogICAgbm9pc2UuDQotIEFueSBjb3JyZWxhdGlvbiBseWluZyBvdXRzaWRlIHRoaXMgY29uZmlkZW5jZSBpbnRlcnZhbCBpcyBwcm9iYWJseQ0KICAgIGluZGljYXRpdmUgb2YgdHJ1ZSBzZXJpYWwgZGVwZW5kZW5jZSBpbiB0aGUgZGF0YS4NCg0KRm9yIHRoZSB0b3VyaXNtIGRhdGEgaXQgc2VlbXMgdGhhdCB0aGVyZSBpcyBzZXJpYWwgZGVwZW5kZW5jZSBpbiB0aGUNCiAgICBkYXRhLCBzaW5jZSB0aGUgY29ycmVsYXRpb25zIGF0IGxhZ3MgMSwgMiwgMyBhbmQgNCBhbGwgZXh0ZW5kIGJleW9uZA0KICAgIHRoZSBjb25maWRlbmNlIGludGVydmFsIGJvdW5kcy4NCg0KVGhlIGV4aXN0ZW5jZSBvZiBzZXZlcmFsIHNpZ25pZmljYW50IGNvcnJlbGF0aW9ucyBpcyBjb21tb24gd2hlbiBhdXRvY29ycmVsYXRpb24gZXhpc3RzLiBJZiB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiAgdGhlICppKl50aF4gYW5kICpqKl50aF4gdmFyaWFibGVzIGlzIGhpZ2gsIGFuZCBzbyBpcyB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiB0aGUgKmoqXnRoXiBhbmQgKmsqXnRoXiB2YXJpYWJsZXMsIHdlIG91Z2h0IHRvIGV4cGVjdCB0aGUgY29ycmVsYXRpb24gYmV0d2VlbiB0aGUgKmkqXnRoXiBhbmQgKmsqXnRoXiB2YXJpYWJsZXMgdG8gYWxzbyBiZSBoaWdoLiBXaGVuIHdlIGFyZSB0aGlua2luZyBhYm91dCB0aW1lIHNlcmllcywgdGhlIGNvcnJlbGF0aW9uIGZvciBkYXRhIGZyb20gbGFncyAwIGFuZCAxICBpcyB0aGUgc2FtZSBhcyB0aGUgZGF0YSBmcm9tIGxhZ3MgMSBhbmQgMiBiZWNhdXNlIHRoZXkgYXJlIGJvdGggc2V0cyBmb3JtZWQgYnkgcGFpcnMgb2Ygc3VjY2Vzc2l2ZSBvYnNlcnZhdGlvbnMuIElmIG9ic2VydmF0aW9ucyBhcmUgY29ycmVsYXRlZCB3aXRoIHRoZWlyIHByZWNlZGluZyBvYnNlcnZhdGlvbnMsIGFuZCB0aG9zZSBwcmVjZWRpbmcgb2JzZXJ2YXRpb25zIGFyZSBjb3JyZWxhdGVkIHdpdGggdGhlaXIgcHJlY2VkaW5nIG9ic2VydmF0aW9ucywgdGhlbiB0aGVyZSBpcyBsaWtlbHkgdG8gYmUgYSBjb3JyZWxhdGlvbiBiZXR3ZWVuIG9ic2VydmF0aW9ucyBhbmQgdGhvc2UgdGhhdCBhcmUgYmFjayB0d28gdGltZSBzdGVwcy4gVGhpcyBsb2dpYyBjb250aW51ZXMgdG8gdGhyZWUsIGZvdXIsIGFuZCBncmVhdGVyIHRpbWUgbGFncyBhbmQgaXMgZXNwZWNpYWxseSBsaWtlbHkgd2hlbiB0aGUgY29ycmVsYXRpb24gYXQgdGhlIGZpcnN0IGxhZyBpcyB2ZXJ5LCB2ZXJ5IGhpZ2guDQoNCg0K
Comments
Yr
(which is essentially time) is the appropriate covariate to track trend (notYear
, which would ignore secular trend during a year).It is important to remember to code
Month
as a factor so as to represent seasonal effects.There is strong evidence of trend (\(P < 2.2 \times 10^{-16}\) for
Yr
) and of seasonality (\(P < 2.2 \times 10^{-16}\) forMonth
) in the data.As might be expected, room bookings tend to be low in the winter months. The pattern over summer is less clear. Perhaps December is a bad month because of the Christmas effect?
We have assumed a linear secular trend. Would quadratic be better?