Quartiles and percentiles are measures of variation, which describes how spread out the data is.
Quartiles and percentiles are both types of quantiles.
Quartiles are values that separate the data into four equal parts.
Here is a histogram of the age of all 934 Nobel Prize winners up to the year 2020, showing the quartiles:
The quartiles (Q0,Q1,Q2,Q3,Q4) are the values that separate each quarter.
Between Q0 and Q1 are the 25% lowest values in the data. Between Q1 and Q2 are the next 25%. And so on.
Quartiles can easily be found with many programming languages.
Using software and programming to calculate statistics is more common for bigger sets of data, as finding it manually becomes difficult.
With Python use the NumPy library quantile()
method to find the quartiles of the values 13, 21, 21, 40, 42, 48, 55, 72:
import numpy
values = [13,21,21,40,42,48,55,72]
x = numpy.quantile(values, [0,0.25,0.5,0.75,1])
print(x)
values <- c(13,21,21,40,42,48,55,72)
quantile(values)
Percentiles are values that separate the data into 100 equal parts.
For example, The 95th percentile separates the lowest 95% of the values from the top 5%
The 25th percentile (P25%) is the same as the first quartile (Q1).
The 50th percentile (P50%) is the same as the second quartile (Q2) and the median.
The 75th percentile (P75%) is the same as the third quartile (Q3)
Percentiles can easily be found with many programming languages.
Using software and programming to calculate statistics is more common for bigger sets of data, as finding it manually becomes difficult.
With Python use the NumPy library percentile()
method to find the 65th percentile of the values 13, 21, 21, 40, 42, 48, 55, 72:
import numpy
values = [13,21,21,40,42,48,55,72]
x = numpy.percentile(values, 65)
print(x)
Use the R quantile()
function to find the 65th percentile (0.65) of the values 13, 21, 21, 40, 42, 48, 55, 72:
values <- c(13,21,21,40,42,48,55,72)
quantile(values, 0.65)