Standard Deviation Formula, Statistics, Variance, Sample and Population Mean

00:10:21
https://www.youtube.com/watch?v=Uk98hiMQgN0

Summary

TLDRThis video provides an educational overview on how to calculate standard deviation, a statistical measure of the dispersion of data points in a set. The tutorial describes formulas for both population and sample standard deviations, emphasizing the difference in denominator: 'n' for population and 'n-1' for sample. Using examples like data sets (3, 5, 7) and (4, 5, 6), the video demonstrates calculating mean first, followed by each data point's squared deviation from the mean, and finally computing the square root of the averaged squared deviations. By comparing these data sets, the video illustrates how more scattered numbers yield a higher standard deviation. Additionally, the video explains that variance, another measure of dispersion, is the square of the standard deviation. The presenter also invites viewers to explore more educational content on their channel, covering mathematics to science disciplines.

Takeaways

  • 📊 Introduction to standard deviation and its formulas for population and sample data.
  • 📐 The difference between population (using 'n') and sample (using 'n-1') standard deviation formulas.
  • 🧮 Step-by-step calculation of standard deviation using example datasets.
  • 📉 Understanding that greater data spread leads to higher standard deviation.
  • 🔍 Variance is derived by squaring the standard deviation.
  • 👥 Comparison of different datasets to illustrate concept application.
  • ↔️ Description of calculating the mean as an initial step in finding standard deviation.
  • 🎓 Explanation of sigma and mu symbols in the context of statistics.
  • 🔢 Problem-solving examples with numbers like (3, 5, 7) vs. (4, 5, 6).
  • 📚 Invitation to explore further educational content in math and science.

Timeline

  • 00:00:00 - 00:05:00

    The video explains how to calculate the standard deviation for a set of numbers, starting with two key formulas: the population standard deviation and the sample standard deviation. The population standard deviation, represented by the Greek letter sigma, involves finding the differences between each data point and the population mean (mu), squaring these differences, summing them, dividing by the number of data points (n), and then taking the square root. The sample standard deviation is similar but uses the sample mean and divides by n-1 instead of n. An example is provided with two sets of numbers, 4, 5, 6 and 3, 5, 7, to compare their standard deviations. The video suggests that the second set has a higher standard deviation because its numbers are more spread out.

  • 00:05:00 - 00:10:21

    The video continues with a detailed calculation of the standard deviation for the numbers 3, 5, 7, using the population standard deviation formula. The mean is calculated first, which is 5. The differences from the mean are calculated (3-5, 5-5, 7-5), squared, summed, and divided by the number of data points, followed by taking the square root to find the standard deviation, approximately 1.63. The same process is repeated for the numbers 4, 5, 6, with a calculated standard deviation of approximately 0.816, confirming it's lower due to less spread. The variance is addressed next, noted to be the square of the standard deviation, exemplified by squaring 1.63 to obtain approximately 2.66. The video concludes by summarizing that the methods for population and sample standard deviations mainly differ in the divisor used (n or n-1) and briefly discusses variance calculation.

Mind Map

Video Q&A

  • What is the formula for population standard deviation?

    The formula for population standard deviation is the square root of the sum of squared differences between each data point and the population mean, divided by the number of data points.

  • What is the difference between population and sample standard deviation formulas?

    The population standard deviation uses the actual number of data points, while the sample standard deviation divides by the number of data points minus one.

  • How do you find the mean of a data set?

    The mean is found by adding all the numbers in the data set and dividing by the number of data points.

  • Why does standard deviation measure dispersion?

    Standard deviation measures how spread out the numbers in a data set are, indicating the degree of variation from the mean.

  • What is the importance of understanding standard deviation?

    Standard deviation provides insight into the variability and consistency within a data set, making it useful for assessing risk and making comparisons.

  • Can you calculate variance from the standard deviation?

    Yes, variance is the standard deviation squared.

  • What is an example of calculating standard deviation in the video?

    The video shows the calculation of standard deviation for two sets of numbers: 3, 5, and 7, and 4, 5, and 6, using the population standard deviation formula.

  • What kind of subjects does the video creator offer?

    The video creator offers subjects in algebra, trade, pre-calculus, calculus, chemistry, and physics.

  • What is the approximate standard deviation for numbers 3, 5, and 7?

    The approximate standard deviation for numbers 3, 5, and 7 is 1.63.

  • How is variance calculated in the video?

    Variance is calculated by squaring the standard deviation, which is the sum of squared differences from the mean divided by the number of data points.

View more video summaries

Get instant access to free YouTube video summaries powered by AI!
Subtitles
en
Auto Scroll:
  • 00:00:01
    in this video we're going to calculate
  • 00:00:03
    the standard deviation of a set of
  • 00:00:05
    numbers
  • 00:00:06
    now there's two formulas you need to be
  • 00:00:07
    aware of the first one
  • 00:00:10
    is the population
  • 00:00:11
    standard deviation
  • 00:00:17
    now this formula
  • 00:00:19
    is represented by the letter sigma
  • 00:00:21
    that's the standard deviation
  • 00:00:24
    it's equal to the sum
  • 00:00:26
    of all the differences between
  • 00:00:28
    every point
  • 00:00:30
    in the data set and the population mean
  • 00:00:33
    the population mean is mu
  • 00:00:36
    which is this symbol here
  • 00:00:41
    and then you need to square it
  • 00:00:43
    divided by
  • 00:00:45
    n which is
  • 00:00:46
    all of the
  • 00:00:48
    numbers in the set
  • 00:00:49
    and then you got to take the square root
  • 00:00:51
    of the whole result
  • 00:00:52
    so that's the population standard
  • 00:00:54
    deviation
  • 00:00:56
    the next formula is the sample standard
  • 00:00:59
    deviation
  • 00:01:01
    so let's say if
  • 00:01:03
    you have
  • 00:01:05
    just a sample of a population not the
  • 00:01:07
    entire population
  • 00:01:09
    if you just have a sample data out of
  • 00:01:11
    the entire data
  • 00:01:13
    then you want to use this formula
  • 00:01:16
    s which is the standard deviation
  • 00:01:19
    is equal to sigma
  • 00:01:22
    the sum of all of the
  • 00:01:24
    differences between every point
  • 00:01:27
    and the mean that's the sample mean
  • 00:01:30
    in the other equation we had the
  • 00:01:32
    population mean represented by mu
  • 00:01:35
    but this is the sample mean which is
  • 00:01:37
    basically the average of all the data
  • 00:01:39
    points in the set
  • 00:01:41
    and then you have to square it but it's
  • 00:01:43
    going to be divided by n
  • 00:01:45
    minus 1 as opposed to n
  • 00:01:49
    and so that's how you calculate the
  • 00:01:50
    standard deviation of the sample
  • 00:01:55
    now let's work on an example
  • 00:01:58
    let's say if we have two set of numbers
  • 00:02:00
    four five and six
  • 00:02:03
    and also three
  • 00:02:05
    five and seven
  • 00:02:07
    which one has a greater standard
  • 00:02:09
    deviation let's use the population
  • 00:02:11
    standard deviation formula
  • 00:02:13
    but if we had to guess
  • 00:02:15
    which set of numbers
  • 00:02:17
    has the greater standard deviation is it
  • 00:02:19
    the one on the left or the one on the
  • 00:02:21
    right
  • 00:02:23
    what would you say
  • 00:02:24
    we need to understand the basic idea
  • 00:02:26
    of standard deviation you need to know
  • 00:02:28
    what it measures
  • 00:02:29
    standard deviation tells you
  • 00:02:31
    how far apart the numbers
  • 00:02:33
    are related to each other
  • 00:02:35
    so the more spread out they are the
  • 00:02:37
    greater the standard deviation
  • 00:02:40
    four five and six are closer to each
  • 00:02:42
    other than three five and seven
  • 00:02:44
    and you could tell if you plot them on a
  • 00:02:45
    number line
  • 00:02:47
    let's put five in the middle
  • 00:02:56
    so 4 5 and 6
  • 00:02:58
    here they are on a number line
  • 00:03:01
    now in contrast
  • 00:03:02
    let's put the same numbers
  • 00:03:05
    on this number line
  • 00:03:13
    we're going to have 3
  • 00:03:15
    5 and 7.
  • 00:03:17
    so if you look at the the red points
  • 00:03:20
    the points in red are further apart
  • 00:03:23
    the points in blue they're very close
  • 00:03:25
    together
  • 00:03:26
    so therefore
  • 00:03:27
    four five and six
  • 00:03:28
    has a lower standard deviation in three
  • 00:03:30
    five and seven
  • 00:03:33
    so sigma is low
  • 00:03:36
    and here the sigma value is high
  • 00:03:42
    now go ahead and calculate the
  • 00:03:45
    standard deviation
  • 00:03:48
    for this
  • 00:03:53
    for the set of numbers three five and
  • 00:03:54
    seven
  • 00:03:55
    so what's the first thing that we should
  • 00:03:57
    do
  • 00:03:59
    the first thing that we should do is
  • 00:04:00
    calculate the mean
  • 00:04:03
    to find the mean
  • 00:04:04
    it's going to be the sum
  • 00:04:06
    of all the numbers divided by 3.
  • 00:04:09
    now because the three numbers are evenly
  • 00:04:11
    spaced apart the mean is going gonna be
  • 00:04:13
    the middle number five
  • 00:04:15
    three plus five is eight eight plus
  • 00:04:17
    seven
  • 00:04:18
    is fifteen
  • 00:04:19
    fifteen divided by three is five so
  • 00:04:22
    that's the mean
  • 00:04:27
    now what should we do next now that we
  • 00:04:28
    have the mean
  • 00:04:32
    now think of the formula
  • 00:04:35
    it's going to be a sigma
  • 00:04:38
    of every point minus the
  • 00:04:40
    mean squared
  • 00:04:42
    divided by
  • 00:04:44
    n
  • 00:04:45
    and then all of this is within the
  • 00:04:47
    square root
  • 00:04:49
    so here's how to use the equation first
  • 00:04:52
    we're going to use the first point 3
  • 00:04:55
    subtract it by the mean and then squared
  • 00:04:58
    next we're going to take the second
  • 00:04:59
    point 5
  • 00:05:00
    subtract it from the mean squared
  • 00:05:02
    and then it's going to be 7
  • 00:05:04
    minus 5
  • 00:05:06
    squared
  • 00:05:08
    so each of these three points
  • 00:05:10
    you're going to plug into x sub i
  • 00:05:13
    and then you're going to square the
  • 00:05:14
    differences between each of those values
  • 00:05:17
    and the sigma represents sum so you're
  • 00:05:19
    going to add every difference
  • 00:05:21
    that you get
  • 00:05:23
    or you can add the square of every
  • 00:05:25
    difference that you get
  • 00:05:26
    and now let's divide it by n
  • 00:05:29
    so n is the number of numbers that we
  • 00:05:31
    have in this set there are three numbers
  • 00:05:33
    inside
  • 00:05:34
    so n is 3
  • 00:05:35
    and then we're going to take the square
  • 00:05:36
    root of the entire thing
  • 00:05:39
    three minus five is negative two
  • 00:05:42
    negative two squared is four
  • 00:05:44
    five minus five is zero
  • 00:05:47
    seven minus five is two two squared is
  • 00:05:49
    four
  • 00:05:52
    four plus four is eight
  • 00:05:54
    so we have the square root of eight
  • 00:05:56
    divided by three
  • 00:05:58
    and at this point we're going to use the
  • 00:06:00
    calculator
  • 00:06:05
    8 divided by 3 is about 2.67 and if you
  • 00:06:08
    take the square root of that
  • 00:06:10
    you're going to get 1.63
  • 00:06:14
    so that's the standard deviation
  • 00:06:17
    for 3 5 and 7.
  • 00:06:20
    now let's calculate the standard
  • 00:06:22
    deviation
  • 00:06:23
    for the other set of numbers four five
  • 00:06:26
    and six so why don't you go ahead and
  • 00:06:28
    pause the video
  • 00:06:29
    and try this example
  • 00:06:31
    calculate the standard deviation using
  • 00:06:33
    the same formula
  • 00:06:35
    so let's go ahead and begin let's
  • 00:06:36
    calculate the population mean
  • 00:06:39
    it's going to be 4 plus 5 plus 6 divided
  • 00:06:43
    by
  • 00:06:44
    the number of numbers that we have which
  • 00:06:45
    is
  • 00:06:46
    3. four plus six is ten ten plus five is
  • 00:06:50
    fifteen and we know that fifteen divided
  • 00:06:52
    by three is five
  • 00:06:54
    so once again any time the numbers are
  • 00:06:56
    evenly spread apart the mean is going to
  • 00:06:58
    be the middle number
  • 00:07:02
    so now
  • 00:07:03
    we can calculate the standard deviation
  • 00:07:06
    so sigma
  • 00:07:07
    is going to equal the square root but
  • 00:07:10
    before we do that let's calculate
  • 00:07:12
    the differences
  • 00:07:14
    so the first difference that we have the
  • 00:07:16
    first number is going to be 4
  • 00:07:18
    and we're going to subtract it from the
  • 00:07:19
    mean
  • 00:07:20
    and then square it the next number
  • 00:07:22
    is 5
  • 00:07:24
    subtract it from the mean
  • 00:07:26
    and then square it
  • 00:07:27
    and then after that the last number is
  • 00:07:28
    six
  • 00:07:29
    this is going to be
  • 00:07:31
    six minus five squared
  • 00:07:34
    now it's divided by n
  • 00:07:36
    and let's not forget to take the square
  • 00:07:38
    root of the entire thing
  • 00:07:41
    four minus five is negative one negative
  • 00:07:44
    one squared is simply one
  • 00:07:46
    five minus five is zero
  • 00:07:48
    six minus five is one
  • 00:07:51
    and it's all divided by three one plus
  • 00:07:53
    one is two
  • 00:07:55
    so we have the square root of two
  • 00:07:57
    divided by 3.
  • 00:08:00
    now 2 divided by 3 as a decimal is about
  • 00:08:02
    0.67
  • 00:08:03
    and the square root of 0.67
  • 00:08:07
    is 0.816
  • 00:08:12
    so as you can see the standard deviation
  • 00:08:14
    is less because
  • 00:08:16
    these numbers are closer to each other
  • 00:08:19
    they're not far apart from the mean
  • 00:08:21
    in the other example
  • 00:08:22
    three five and seven
  • 00:08:24
    they're further apart from the mean
  • 00:08:26
    which is five
  • 00:08:27
    three is two units away from five
  • 00:08:30
    four
  • 00:08:31
    is only one unit away from five and
  • 00:08:33
    that's why the standard deviation is so
  • 00:08:35
    much less
  • 00:08:37
    now let's go back to the first example
  • 00:08:39
    we said that the population standard
  • 00:08:41
    deviation
  • 00:08:42
    is approximately 1.63
  • 00:08:45
    so given this information
  • 00:08:47
    how can you calculate
  • 00:08:49
    the variance
  • 00:08:52
    v a r i a-n-c-e
  • 00:08:56
    how can we find the variance
  • 00:08:58
    the variance is simply the square
  • 00:09:00
    of the standard deviation
  • 00:09:03
    so 1.63 squared
  • 00:09:06
    is equal to now keep in mind this is a
  • 00:09:09
    rounded answer
  • 00:09:10
    i don't remember what the exact answer
  • 00:09:12
    was but
  • 00:09:14
    once you square it it's about 2.66
  • 00:09:16
    so that's how you can calculate the
  • 00:09:18
    variance
  • 00:09:20
    the formula for variance is basically
  • 00:09:23
    the sum
  • 00:09:24
    of all the square differences
  • 00:09:26
    between every point and the population
  • 00:09:28
    mean
  • 00:09:29
    divided by n
  • 00:09:30
    it's basically the same formula without
  • 00:09:32
    the square root symbol
  • 00:09:36
    well that's it for this video so now you
  • 00:09:38
    know how to calculate
  • 00:09:40
    the population standard deviation and
  • 00:09:42
    also the sample standard deviation even
  • 00:09:44
    though we did just one of them the
  • 00:09:46
    process is the same
  • 00:09:48
    of finding the other one
  • 00:09:50
    the only difference is you have n minus
  • 00:09:52
    one instead of n
  • 00:09:54
    you also know how to calculate the
  • 00:09:55
    variance as well
  • 00:09:57
    so that concludes this video by the way
  • 00:10:00
    if you want to find more of my videos
  • 00:10:01
    you can check out my channel or visit my
  • 00:10:03
    website video.tutor.net and you can find
  • 00:10:06
    playlists on algebra trade pre-calculus
  • 00:10:09
    calculus
  • 00:10:11
    chemistry and physics
  • 00:10:13
    so those are the subjects that i
  • 00:10:14
    currently offer right now
  • 00:10:16
    and if you're interested just feel free
  • 00:10:18
    to check that out
  • 00:10:19
    so thanks again for watching
Tags
  • standard deviation
  • variance
  • population mean
  • sample mean
  • data dispersion
  • calculations
  • statistics tutorial
  • educational video
  • math concepts
  • examples