大象传媒

Standard deviation

Standard deviation is an important measure of spread or dispersion.

It tells us how far, on average the results are from the mean.

Therefore if the standard deviation is small, then this tells us that the results are close to the mean, whereas if the standard deviation is large, then the results are more spread out.

For example, the following two data sets are significantly different in nature and yet have the same mean, median and range.

Some sort of numerical measure which distinguishes between them would be useful.

Data set 1171215202228
Data set 21151515151628
Data set 1
1
7
12
15
20
22
28
Data set 2
1
15
15
15
15
16
28

Most of the results in data set 2 are close to the mean, whereas the results in data set 1 are further from the mean in comparison.

This suggests that the standard deviation is smaller in data set 2 than data set 1.

When comparing distributions, it is better to use a measure of spread or dispersion (such as standard deviation or semi-interquartile range) in addition to a measure of central tendency (such as mean, median or mode).

There are two formulae for calculating the standard deviation, however the most commonly used formula to calculate the standard deviation is:

\(SD = \sqrt {\frac{{\sum {{(X - \bar X)}^2}}}{{n - 1}}}\)

Where \(\sum\) means 'sum of'

\({\bar X}\) is the 'mean'

\(n\) is the number of data in the sample

Use this information to try the example questions below.

Question

Find the mean and standard deviation of the following numbers: \(4,\,7,\,9,\,11,\,13,\,15,\,18\)

Question

The weights in kilograms of seven women are shown below:

\(52,\,41,\,58,\,63,\,49,\,50,\,72\)

Calculate the mean and standard deviation of these weights.

(b) The mean and standard deviation of a group of men were found to be 60 and 5.5 respectively.

Make two statements comparing the group of men with the group of women.