Box plots

Box plots provide a five-number summary for a data set. They include:

  • the median, which has equal numbers of ordered observations on each side of it
  • the first and third quartiles divide the halves of the ordered data set into halves again
  • the overall range of the data from minimum to maximum provides information about the spread of the whole data set.

The interquartile range (IQR) represents the middle 50% of the data. The IQR extends from the first quartile to the third quartile.

 

Box plot labelled with the words ‘minimum’, ‘first quartile’, ‘median’, ‘third quartile’ and ‘maximum’. Another plot below with five-number summary of 5, 7, 17, 32 and 55.

Structure of a box plot.

 

Box plots are most useful for representing the spread of large sets of data, but provide no information on the number of observations represented. They can also be used to compare multiple data sets.

The article Box Plots in the Australian Curriculum provides an introduction to box plots and to contexts within which their use can be meaningful.

Predicting AFL winners

This investigation uses side-by-side box plots to predict Australian Football League winners based on their past scoring performances.

Curriculum links

Year 10: Compare shapes of box plots to corresponding histograms and dot plots

Year 10: Construct and interpret box plots and use them to compare data sets

Source