logo

Definition of Variance in Basic Statistics 📂Data Science

Definition of Variance in Basic Statistics

Definition 1

Let’s assume we are given nn quantitative data.

  1. The difference (xxi)\left( \overline{x} - x_{i} \right) between the sample mean x\overline{x} and the data is called the deviation.
  2. The value s2s^{2}, which is the sum of the squares of deviations divided by n1n-1, is known as the variance of a sample. s2:=(xix)2n1 s^{2} := {{ \sum \left( x_{i} - \overline{x} \right)^{2} } \over { n-1 }} Taking the square root of the sample variance s=s2s = \sqrt{s^{2}} is referred to as the standard deviation.

Description

Dispersion refers to how spread out the data is, and it is also known as variability or spread. Variance is a measure of this dispersion and is considered the second most important statistic after the mean.

See Also

When first encountering statistics, one might wonder why calculations involve squaring and why dividing by n1n-1 instead of nn; these points can seem troublesome. As one majors in statistics and advances in their studies, they will encounter mathematical theories (which are by no means easy) that provide answers to these questions. If you are a freshman, it’s okay to just accept it for now.


  1. Mendenhall. (2012). Introduction to Probability and Statistics (13th Edition): p60~63. ↩︎