Standard Deviation
Home > Lessons > Standard Deviation
Search | Updated March 21st, 2018
Introduction

    Collecting data and analyzing it is how we come to understand situations.

    Nurses collect height, weight, pulse, temperature, and blood pressure to help measure a patient's health.

    Quality control specialists examine various attributes of items from an assembly line (like dimensions, color, texture, or taste if dealing with food) to determine if the items are being manufactured correctly.

    Teachers review classroom participation, assignment completion, and quiz and test scores to see if students are learning. Schools look over a variety of data, too, like student behavior, graduation rates, and attendance and search for patterns.

    We will see how calculating and utilizing standard deviation helps us to understand the data we collect to determine if it is extreme or not.


    There are two formulas to calculate standard deviation. If we have all the data in a set, we use this formula:

standards deviation formula

...where 'xi' is each value in our set and 'μ' is the mean of the set.

    If we have a sample of the data in a set, we use this formula:

sample standards deviation formula

...where 's' is the sample standard deviation and 'x-bar' is the sample mean.


    If we are given a set of data, we have to first calculate its mean. Let us use this sample data for the purpose of calculating mean and then the sample standard deviation.

frequency table of data

    From our knowledge of Frequency Tables, we know this data can be written as a set of data, like so.

data in set form

    The mean is simple to calculate. Here it is shown:

calculating the mean of the data

    Adding the numerator and cleaning up, we get this:

the mean of the data

    Now, we have to subtract the mean from each value in our set, like so.

sample standards deviation calculation subtraction

    Our next step is to square these differences.

sample standards deviation calculation squaring

    The sigma in the formula means to add the numbers. So, here is the sum.

sample standards deviation calculation sum of squares

    The formula tells us to divide this sum by one less than the number of data values we have. We have seven data values; so, we will divide by six.

sample standards deviation calculation divide one less than n

    The last step demands that we take the square root.

sample standards deviation calculation square root

    If we round this to the nearest hundredth, we get...

sample standards deviation calculation round hundredths



    It is absolutely true that no one uses formulas when calculating standard deviation. Instead, either an advanced calculator or a software product is used.

    Graphing calculators have a built in method for calculating standard deviation and sample deviation. This is true for other calculator manufacturers, which includes Casio and Hewlett Packard. However, the explanation below will use screenshots and explanations using the TI-Nspire calculator.

    Let us get started by using the same data presented in the last section. We have.

frequency table of data

    To begin, we have to get a new document. So, we have to press the home button to get a new document.

ti-nspire home button

    Then, we select "Lists and Spreadsheet."

ti-nspire new document lists & spreadsheet

    Next, we plug in the data we have in the same two-column format.

ti-nspire lists & spreadsheet two columns frequency table

    It is recommended (but optional) that we label the columns. Here, the top of column 'a' is labeled 'num' and the top of column 'b' is labeled 'freq.'

ti-nspire lists & spreadsheet label column

    Now, we want the calculator to determine the mean and sample standard deviation. We press Menu, then Statistics and Stat Calculations, as shown here:

ti-nspire statistics stat calculations

    Then, select One-Variable Statistics, like so.

ti-nspire stat calculations one-variable statistics

    We now have to set the next menu. The 'x1 list' is asking for the column that contains the numbers in our data set. We can leave it as a[] to indicate column 'a' or we can click on 'num' from the dropdown menu, as shown here:

ti-nspire one-variable statistics settings

    Next, click on the dropdown menu from the line marked 'Frequency List' to select either 'freq' or we can type in b[] to indicate column 'b.'

ti-nspire one-variable statistics frequency list setting

    In the line marked '1st Result Column,' we have to indicate where we want the data printed. Since columns 'a' and 'b' are already occupied, make sure to type in c[] to indicate column 'c.' Here is a screenshot.

ti-nspire one-variable statistics 1st result column

    This next screen contains what we were looking for, namely the mean (labeled as x-bar) and the sample standard deviation (labeled sx), which has been highlighted below.

ti-nspire stat calculations one-variable statistics results screen

    If we had all the data in the set and we were looking for the standard deviation (not the sample standard deviation), we would use the σx-value instead, which is highlighted in this menu.

ti-nspire one-variable statistics sigma standard deviation



    Standard deviation measures how much spread there is to a data set. If a data set has a high standard deviation, the data spreads far to the left-right. If a data set has a small standard deviation, the data is more horizontally compact.

    This can be best demonstrated by comparing two data sets that have different standard deviations.

    The histogram below is the result of rolling two 6-sided dice 100 times and recording the sum of the two dice. [See our Dice Roller: Mean, Standard Deviation interactive quizmaster to roll your own dice]. The mean is 6.51 and the standard deviation is 2.39.

Sum of Two 6-Sided Dice, 100 Rolls
rolling two 6-sided dice sum histogram


    The histogram below is similar to the situation above, but instead two 10-sided dice were rolled 100 times and the sum of the dice was recorded. [The Dice Roller: Mean, Standard Deviation was used again.] The mean is 10.89 and the standard deviation is 3.92.

Sum of Two 10-Sided Dice, 100 Rolls
rolling two 6-sided dice sum histogram


    The scales are both set to the same values to make the comparison possible. Notice that the graph with a higher standard deviation is wider than the graph with the lower standard deviation.

    This is evidence to support the claim that standard deviation informs us of the amount of horizontal spread in the data.

    Also, numbers that are far away from the mean of a data set are numbers that become more rare. Look at the next section to see how plotting mean and standard deviation on a number line can be used to describe common values versus uncommon values in a data set.


    This section will teach you how to use a horizontal graph with mean and standard deviation to determine the rarity of numbers within a set.

Example 1: It has been said male heights have a mean of 69.1 inches and the standard deviation is 2.9. Plot this information linearly in a horizontal format.

    The line graph below has the mean in the middle (marked with a red line segment). The number to the right of the mean is one standard deviation greater than the mean or 69.1 + 2.9 = 72. Similarly, the number to the left of the mean is one standard deviation lower than the mean or 69.1 - 2.9 = 66.2.

male height one standard deviation mean

    We can keep adding standard deviations (+2.9) as we move to the right and subtract standard deviations as we move to the left (-2.9). The horizontal graph below shows one more standard deviation right and left.

male height two standard deviations mean

    With the horizontal graph below, one more standard deviation right and left is shown.

male height three standard deviation mean

    Numbers that are several standard deviations from the mean are rare occurrences. For instance, being 6 feet tall (72 inches or one standard deviation to the right of the mean) is not uncommon.

    However, males who are more than two standard deviations to the right of the mean is uncommon. In fact, about 2.5% of the male population has a height greater than two standard deviations from the mean. Heights three or more standard deviations to the right is rarer still. Less than 1% of the male population is that tall!

Example 2: If a soda machine dispenses large sodas with a mean of 12 ounces and a standard deviation of 0.07 ounces, create a horizontal graph that includes three standard deviations.

    In general, we must place the mean at the center of the line graph (because the mean represents the center of the data) and we keep adding standard deviations to the right. We keep subtracting standard deviations to the left. So, in general the graph should look like this:

three standard deviations mean mu sigma

    To address the soda machine, we will place 12 at the center. To the right, we must place three standard deviations. This is what the math looks like.

12 + 1(0.07) = 12.07
12 + 2(0.07) = 12.14
12 + 3(0.07) = 12.21

    To the left of the mean, we subtract standard deviations, like so.

12 - 1(0.07) = 11.93
12 - 2(0.07) = 11.86
12 - 3(0.07) = 11.79

    With all the numbers in place, this is our line graph to three standard deviations.

three standard deviations soda machine output

    uiz: Bell Curve: Standard Deviation


    Try this quiz, which can help you learn the content above.

    uiz: Bell Curve: Standard Deviation


    Try this lesson and quizmaster, which are related to the sections above.

    esson: Frequency Tables
    esson: Bell Curve: 68-95-99 Rule
    esson: Margin of Error and Confidence Intervals
    esson: Margin of Error & Confidence Intervals by Formula

    uiz: Dice Roller: Mean, Standard Deviation