Standard deviation is the most commonly used measure of variation, which describes how spread out the data is.
Standard deviation (σ) measures how far a 'typical' observation is from the average of the data (μ).
Standard deviation is important for many statistical methods.
Here is a histogram of the age of all 934 Nobel Prize winners up to the year 2020, showing standard deviations:
Each dotted line in the histogram shows a shift of one extra standard deviation.
If the data is normally distributed:
Note: A normal distribution has a "bell" shape and spreads out equally on both sides.
The standard deviation can easily be calculated with many programming languages.
Using software and programming to calculate statistics is more common for bigger sets of data, as calculating by hand becomes difficult.
With Python use the NumPy library std()
method to find the standard deviation of the values 4,11,7,14:
import numpy
values = [4,11,7,14]
x = numpy.std(values)
print(x)
Use an R formula to find the standard deviation of the values 4,11,7,14:
values <- c(4,7,11,14)
sqrt(mean((values-mean(values))^2))
With Python use the NumPy library std()
method to find the sample standard deviation of the values 4,11,7,14:
import numpy
values = [4,11,7,14]
x = numpy.std(values, ddof=1)
print(x)
Use the R sd()
function to find the sample standard deviation of the values 4,11,7,14:
values <- c(4,7,11,14)
sd(values)