Suppose lots of students went into do an exam. The more the better. On a national scale, if all the results were taken and plotted onto a graph, some students will do poorly, some will do very well and the majority will lie in between the two extremes.
By human nature, the graph plotted will be a normal distribution graph, it will look like a bell shape design. The highest point of the graph will be the mean, and at this point when a line is drawn down vertically to cut the x-axis, it will divide the graph in two equal parts.
This kind of structure is what's known as a normal distribution curve and it is symmetrical about it's centre line which coincides with the mean (x bar) of the observations.
The standard deviation is a measure of how this data is spread out.
One standard deviation of both sides of the mean (x bar ±σ) will contain 68% of the data.
Two standard deviation of both sides of the mean (x bar ±2σ) will contain 95% of the data.
Three standard deviation of both sides of the mean (x bar ±3σ) will contain 99∙7% of the data.
These values apply for a normal distribution. So the standard distribution is a measure of the spread of data from the mean.