logo

Definition of Variogram 📂Statistical Analysis

Definition of Variogram

Definition 1

In a fixed subset DRrD \subset \mathbb{R}^{r} of Euclidean space , consider a space process {Y(s)}sD\left\{ Y(s) \right\}_{s \in D} which is a set of random variables Y(s):ΩR1Y(s) : \Omega \to \mathbb{R}^{1} and a direction vector hRr\mathbf{h} \in \mathbb{R}^{r}. Specifically, represent nNn \in \mathbb{N} sites as {s1,,sn}D\left\{ s_{1} , \cdots , s_{n} \right\} \subset D, and assume that Y(s)Y(s) has variance existing for all sDs \in D. The following defined 2γ(h)2 \gamma ( \mathbf{h} ) is called a Variogram. 2γ(h):=E[Y(s+h)Y(s)]2 2 \gamma ( \mathbf{h} ) := E \left[ Y \left( s + \mathbf{h} \right) - Y(s) \right]^{2} Especially, half of the variogram γ(h)\gamma ( \mathbf{h} ) is called a Semivariogram.

Explanation

Definition of Regular Spatial Process:

  1. If in all sDs \in D, μ(s)\mu (s) is a constant function μ(s):=μ\mu (s) := \mu and both belong to DD for all h\mathbf{h} such that the covariance is expressed as a function of h\mathbf{h} only C:RrRC : \mathbb{R}^{r} \to \mathbb{R}, regardless of ss, through some function CC then {Y(s)}\left\{ Y(s) \right\} is said to have Weak Stationarity. Cov(Y(s),Y(s+h))=C(h) \operatorname{Cov} \left( Y (s) , Y \left( s + \mathbf{h} \right) \right) = C \left( \mathbf{h} \right) Here, CC is called the Covariance Function or Covariogram.
  2. If the mean of [Y(s+h)Y(s)]\left[ Y \left( s + \mathbf{h} \right) - Y(s) \right] is 00 and the variance depends only on h\mathbf{h} then {Y(s)}\left\{ Y(s) \right\} is said to have Intrinsic Stationarity. E[Y(s+h)Y(s)]=0Var[Y(s+h)Y(s)]=2γ(h) \begin{align*} E \left[ Y \left( s + \mathbf{h} \right) - Y(s) \right] =& 0 \\ \Var \left[ Y \left( s + \mathbf{h} \right) - Y(s) \right] =& 2 \gamma ( \mathbf{h} ) \end{align*}

Intrinsic Stationarity

Just seeing from the definition, although the variogram 2γ(h)=E[Y(s+h)Y(s)]22 \gamma ( \mathbf{h} ) = E \left[ Y \left( s + \mathbf{h} \right) - Y(s) \right]^{2} is a function that depends also on ss, it’s usually assumed that the given space process is intrinsically stationary. Conversely, because the definition of intrinsic stationarity itself doesn’t depend on ss, these two cannot be thought of separately.

Weak Stationarity

It is natural to call C(h)C \left( \mathbf{h} \right) the covariance function in the definition of weak stationarity, and the reason why it is specifically called a Covariogram despite it can be defined alone without γ\gamma is due to the following relationship.

Theorem

For a weakly stationary spatial process {Y(s)}sD\left\{ Y (s) \right\}_{s \in D}, the semivariogram γ(h)\gamma \left( \mathbf{h} \right) and the covariogram C(h)C \left( \mathbf{h} \right) satisfy the following. VarY=γ(h)+C(h) \Var Y = \gamma \left( \mathbf{h} \right) + C \left( \mathbf{h} \right)

Proof

Following the weak stationarity of the space process {Y}\left\{ Y \right\}, by substituting the zero vector into the direction vector hRr\mathbf{h} \in \mathbb{R}^{r} in Cov(Y(s),Y(s+h))=C(h)\operatorname{Cov} \left( Y (s) , Y \left( s + \mathbf{h} \right) \right) = C \left( \mathbf{h} \right), we obtain the following. C(0)=Cov(Y(s),Y(s))=VarY(s) C \left( \mathbf{0} \right) = \operatorname{Cov} \left( Y (s) , Y (s) \right) = \Var Y (s)

Relationship of Stationarity: A strongly stationary spatial process is a weakly stationary spatial process, and a weakly stationary spatial process is intrinsic. Strong    Weak    Intrinsic \text{Strong} \implies \text{Weak} \implies \text{Intrinsic}

Meanwhile, since a weakly stationary spatial process is intrinsically stationary, for all hRr\mathbf{h} \in \mathbb{R}^{r} Var[Y(s+h)Y(s)]=2γ(h) \Var \left[ Y \left( s + \mathbf{h} \right) - Y(s) \right] = 2 \gamma ( \mathbf{h} ) holds. If we unravel this backwards, 2γ(h)=Var[Y(s+h)Y(s)]=Var[Y(s+h)]+Var[Y(s)]2Cov[Y(s+h),Y(s)]=Cov[Y(s+h),Y(s+h)]+Cov[Y(s),Y(s)]2Cov[Y(s+h),Y(s)]=C(0)+C(0)2C(h)=2[C(0)C(h)]=2[VarYC(h)] \begin{align*} & 2 \gamma \left( \mathbf{h} \right) \\ =& \Var \left[ Y \left( s + \mathbf{h} \right) - Y (s) \right] \\ =& \Var \left[ Y \left( s + \mathbf{h} \right) \right] + \Var \left[ Y (s) \right] - 2 \operatorname{Cov} \left[ Y \left( s + \mathbf{h} \right) , Y (s) \right] \\ =& \operatorname{Cov} \left[ Y \left( s + \mathbf{h} \right) , Y \left( s + \mathbf{h} \right) \right] + \operatorname{Cov} \left[ Y (s) , Y (s) \right] - 2 \operatorname{Cov} \left[ Y \left( s + \mathbf{h} \right) , Y (s) \right] \\ =& C ( \mathbf{0} ) + C ( \mathbf{0} ) - 2 C ( \mathbf{h} ) \\ =& 2 \left[ C ( \mathbf{0} ) - C ( \mathbf{h} ) \right] \\ =& 2 \left[ \Var Y - C ( \mathbf{h} ) \right] \end{align*} thus, we obtain the following equality. γ(h)=VarYC(h) \gamma \left( \mathbf{h} \right) = \Var Y - C \left( \mathbf{h} \right)

See Also

  • Isotropic Variogram: When the variogram does not depend on direction and only on distance, we say the variogram is isotropic.
  • Semivariogram Models: When the semivariogram is isotropic, plotting the scatter diagram with x-axis as d:=hd := \left\| \mathbf{h} \right\| and y-axis as γ(h)\gamma (h) and fitting it to a specific model can give a sense of how variance changes with distance. It’s from this graphical examination that 2γ2 \gamma and CC are called Variograms.
  • Empirical Variogram γ\gamma^{\ast}: In actual data, there may not be many observations that exactly match h\mathbf{h}. Before analysis, it’s advisable to look into whether the data meets certain assumptions through γ\gamma^{\ast}.

  1. Banerjee. (2015). Hierarchical Modeling and Analysis for Spatial Data(2nd Edition): p24. ↩︎