Variance and standard deviation (ungrouped data) Introduction In this leaflet we introduce variance and standard deviation as measures of spread. We can evaluate the variance of a set of data from the mean that is, how far the observations deviate from the mean. This deviation can be both positive and negative, so we need to square these values to ensure positive and negative values do not simply cancel each other out when we add up all the deviations. Variance The variance of a set of values, which we denote by σ 2 , is defined as σ 2 = ∑ (x − ¯ x) 2 n where ¯ x is the mean, n is the number of data values, and x stands for each data value in turn. Recall that ∑ x, for example, means add up all the values of x. Similarly, ∑ (x − ¯ x) 2 means subtract the mean from each data value, square, and finally add up the resulting values. (If necessary revise the leaflet Sigma Notation). An alternative, yet equivalent formula, which is often easier to use is σ 2 = ∑ x 2 n − ¯ x 2 Worked example Find the variance of 6, 7, 10, 11, 11, 13, 16, 18, 25. Firstly we find the mean, ¯ x = ∑ x n = 117 9 = 13. Method 1: σ 2 = ∑ (x − ¯ x) 2 n It is helpful to show the calculation in a table: x 6 7 10 11 11 13 16 18 25 Total x − ¯ x −7 −6 −3 −2 −2 0 3 5 12 (x − ¯ x) 2 49 36 9 4 4 0 9 25 144 280 (x − ¯ x) 2 = 280 σ 2 = ∑ (x − ¯ x) 2 n = 280 9 = 31.11 (2dp) business www.mathcentre.ac.uk c mathcentre June 9, 2003