Table of contents
Upskilling Made Easy.
Understanding Measures of Dispersion in Statistics
Published 08 May 2025
1.7K+
5 sec read
While measures of central tendency provide valuable information about the central location of a dataset, measures of dispersion (or variability) help to understand how spread out the values are. These measures quantify the extent to which data points differ from each other and from the mean of the dataset. Understanding dispersion is essential for statistical analysis, as it offers insights into the consistency and reliability of data.
This blog will explore various measures of dispersion, including range, variance, standard deviation, interquartile range (IQR), and mean absolute deviation (MAD), and detail their calculations, interpretations, and applications.
The range is the simplest measure of dispersion. It represents the difference between the maximum and minimum values in a dataset.
Range = Maximum Value - Minimum Value
For the dataset: 4, 8, 6, 5, 3, the maximum value is 8, and the minimum value is 3. Therefore:
Range = 8 - 3 = 5
Variance measures the average of the squared differences from the mean. It quantifies how much the data values vary from the mean, providing a deeper understanding of data spread.
Var = σ2 = ∑ (x_i − x̅)2 / n
Where:
For the same dataset: 4, 8, 6, 5, 3:
Calculate the mean:
x̅ = 4 + 8 + 6 + 5 + 35 = 5.2
Calculate variance:
Var = ((4-5.2)^2 + (8-5.2)^2 + (6-5.2)^2 + (5-5.2)^2 + (3-5.2)^2) / 5
This results in a variance around ( 2.56 ).
The standard deviation (SD) is the square root of the variance. It provides a measure of dispersion in the same units as the original data, making it easier to interpret.
For a sample:
s = sqrt(var)
Continuing from the previous example, the standard deviation would be calculated as follows:
s = sqrt(2.56) = 1.6
The interquartile range represents the range of the middle 50% of a dataset and is calculated as the difference between the third quartile (Q3) and the first quartile (Q1).
IQR = Q3 - Q1
For the dataset 4, 8, 6, 5, 3:
Q1 (25th percentile) = 4 and Q3 (75th percentile) = 6.
IQR = 6 - 4 = 2
Mean Absolute Deviation measures the average of the absolute differences from the mean. It provides another perspective on the dispersion of the data.
MAD = ∑ |x_i - x̅|
Where:
Using the dataset 4, 8, 6, 5, 3:
Calculate the mean:
x̅ = 4 + 8 + 6 + 5 + 35 = 5.2
Calculate the absolute deviations:
Calculate MAD:
MAD = 1.2 + 2.8 + 0.8 + 0.2 + 2.25 = 7.25 = 1.44
Measure | Formula | Sensitivity to Outliers | Interpretability | ||
---|---|---|---|---|---|
Range | Max - Min | High | Simple measure, easy to understand | ||
Variance | (1n-1 sum (x_i - barx)^2) | High | Less intuitive due to squared units | ||
Standard Deviation | (sqrtVariance) | High | Intuitive; same units as data | ||
Interquartile Range (IQR) | (Q3 - Q1) | Low | Robust against outliers | ||
Mean Absolute Deviation (MAD) | (1n sum | x_i - barx | ) | Moderate | Easy to interpret; original units |
Measures of dispersion are fundamental to statistical analysis, enabling researchers to understand the spread and variability of their data. Whether utilizing the range, variance, standard deviation, interquartile range, or mean absolute deviation, each measure offers unique insights into the distribution and characteristics of datasets. By mastering these measures, analysts can better interpret data and make informed decisions.
Happy analyzing!