Drift in the ARIMA Model
Explanation
When analyzing time series, one often comes across a coefficient called Drift.
Of course, in the case above, the coefficient is too small compared to the standard error to matter. However, if you actually have a significant coefficient and need to write it down in a formula, it’s necessary to understand what a drift is. Unfortunately, there are no good explanations about what drift actually is domestically, and without involving formulas, it can only be described as something ‘similar to a constant term’.
If you’re not interested in reading formulas and it’s not crucially important for analysis, just knowing it as a constant term or an average should suffice. See the second analysis below if that’s the case. Of course, if you’re a major in the field, it’s strongly recommended to follow the formula derivation at least once.
Mathematical Explanation
Let’s start with the ARIMA model . No, let’s start with something simpler without difference, the ARMA model. To simplify, let’s use the following expression involving the backshift .
Applying this expression to results in This is another representation of the ARMA model, where we add a constant term as follows. Here, in equation is what we refer to as the Drift. Looking at the formula, you will understand why the drift is described as a ‘constant term’ or an ‘average’. Of course, this is the ARMA model, and in the ARIMA model with difference included, it appears differently. For convenience, consider only taking times in when thinking about the difference. In equation , is called the Intercept, and is called the Drift. Dissecting the difference reveals
Eventually, serves the same function as in , therefore it’s logical to call it Drift. Although analyses requiring more than two differences are rare, this method can also be generalized for natural numbers . [ NOTE: People familiar with mathematical intuition might recall the phenomenon where taking a difference in the constant term in makes it disappear, reminiscent of differentiation. ]
Practice
Returning to the analysis. As you can see, or as expected, the of the data and the of the data with difference taken are naturally the same. The only difference is that the name of the coefficient written as
drift
in has changed to mean
in . It would be better if you understood the formula, but even if you didn’t, you might guess that drift is something ‘similar to a constant term or an average’.
Code
library(TSA)
data("oil.price")
out<-auto.arima(oil.price); summary(out)
out<-auto.arima(diff(oil.price)); summary(out)