00:00:00
well look at that we're already to
00:00:02
chapter 12
00:00:03
and we're only doing one section so this
00:00:05
is actually the last section
00:00:07
for the whole semester so this is on
00:00:10
anova which stands for analysis
00:00:13
of variance and remember variance is
00:00:15
just the standard deviation squared
00:00:18
so in the past we were able to use the t
00:00:22
test to taste the claim about two means
00:00:26
but what if it wasn't just two means
00:00:28
what if you had three or more groups
00:00:30
and you wanted to see if the mean was
00:00:32
equal for those three groups
00:00:35
so the claim could look something like
00:00:36
this mean one equals mean
00:00:38
two equals mean three and you could even
00:00:40
have four
00:00:41
five six seven you could do a whole
00:00:43
bunch of groups all at the same time
00:00:45
with one simple test
00:00:49
so for each group you have to find the
00:00:52
mean and standard deviation
00:00:55
and then we're going to see how much
00:00:57
variation there is in the three means
00:00:59
in other words if the three groups are
00:01:01
the same
00:01:03
an average of five an average of five an
00:01:05
average of five point one
00:01:07
they're all pretty much the same then
00:01:08
the variance of those means is going to
00:01:10
be very very small
00:01:13
so we're going to use what's called the
00:01:14
f test and the critical value is going
00:01:17
to be in table a5
00:01:19
now for this f test and for this anova
00:01:21
test you can actually use the calculator
00:01:23
but i'm going to do a simple example
00:01:25
using the formula to show you the idea
00:01:27
of where it came from
00:01:29
and then after that i'll show you how to
00:01:30
use the calculator
00:01:32
all right here we go so use anova to
00:01:35
test the claim of equal means
00:01:37
here's these three groups so perhaps
00:01:39
this is
00:01:40
three different classes trigonometry
00:01:43
statistics and calculus something like
00:01:46
that and then these are the ages of the
00:01:48
students
00:01:49
so just by looking at them does it look
00:01:51
like there's a difference
00:01:53
well to me these people look pretty
00:01:56
young
00:01:57
these people are in their early 20s and
00:02:00
then these people
00:02:01
are in their late 20s to early 30s so
00:02:04
group three definitely looks
00:02:05
older so i think that at the end i'm
00:02:08
gonna get reject
00:02:09
they are not the same okay so
00:02:12
first find the mean and standard
00:02:14
deviation each group has five people
00:02:17
sure enough for these people they're
00:02:18
younger 19.8
00:02:20
for these people they're older 29.2 and
00:02:23
we also need the standard deviation for
00:02:25
each group
00:02:29
okay so that's pretty much what i just
00:02:33
said
00:02:34
so here's the same data and now find the
00:02:37
overall mean
00:02:38
so what is the average of the three
00:02:40
averages so just add up these three
00:02:42
averages divide by three and the overall
00:02:45
average is 23.867
00:02:52
so how much do the means vary from
00:02:55
23.867
00:02:57
so like i said we can use the calculator
00:02:59
i'll show you
00:03:00
very easy very fast using the calculator
00:03:03
but this is what the formula is
00:03:04
basically saying
00:03:05
how much does this group vary from the
00:03:09
average how much does this group vary
00:03:10
from the average and how much does this
00:03:12
group vary from the average
00:03:14
since i'm not taking the square root of
00:03:16
it this is the variance called the
00:03:17
variance of the means
00:03:19
so that's a 23.29
00:03:23
now also we need to pay attention to how
00:03:25
much each individual group
00:03:27
varies or how much variance there is for
00:03:29
each one
00:03:30
so variance just means s squared so you
00:03:33
square each one
00:03:34
add them up and divide by 3.
00:03:39
now the formula for the test statistic
00:03:42
f is going to be n the sample size
00:03:46
times the variance of the means that's
00:03:49
this one
00:03:50
divided by which called this is called
00:03:53
the pooled variance which is this 4.897
00:03:57
or the mean of the variances
00:04:02
so just reiterating the top one is going
00:04:04
to be 23.92
00:04:06
and the bottom one is going to be 4.897
00:04:11
now when you divide those the test
00:04:14
statistic turns out to be 23.78
00:04:19
and then use table a5 it's a right till
00:04:22
only test
00:04:25
and degrees of freedom number one is
00:04:28
the number of categories so there is
00:04:30
three groups so k equals three
00:04:33
so categories minus one that's degrees
00:04:35
of freedom number one
00:04:37
degrees of freedom number two will be
00:04:38
that three times n minus one
00:04:41
so this was five people in each group so
00:04:43
this would be three
00:04:45
times five minus one
00:04:48
so there are the two degrees of freedom
00:04:50
a two and a twelve
00:04:53
so on table 85 i go to 2 for the top one
00:04:58
12 for the side one so it should be this
00:05:00
one right here
00:05:01
let's see if the magical rectangle
00:05:02
agrees ah yes
00:05:05
so the critical value is 3.8853
00:05:10
so i'm using 95 percent confidence
00:05:12
that's why this
00:05:13
right here is a 0.05 or right here it
00:05:16
says .05 in the right tail because it's
00:05:18
a right till only
00:05:20
and we're using 95 confidence
00:05:24
so then you draw the little picture and
00:05:27
the critical value is 3.8853
00:05:33
the test statistic was 23.78
00:05:36
which goes way past three point eight
00:05:39
eight five three the critical value that
00:05:41
means reject the claim
00:05:43
they are not the same so
00:05:46
in intuitively
00:05:49
now just by looking at the numbers at
00:05:51
the beginning i was trying to get all
00:05:52
fancy sorry about that
00:05:54
um from the beginning i could see group
00:05:56
one was younger
00:05:57
group three was older now i have actual
00:06:00
proof
00:06:00
the means are not equal because group
00:06:03
one is younger
00:06:04
group three is older okay now how to use
00:06:07
the calculator
00:06:10
that's right you don't have to use the
00:06:12
formula
00:06:14
so put the data into list one list two
00:06:17
list three
00:06:18
and of course if you have more than
00:06:19
three you can go ahead and use list four
00:06:21
list five list six
00:06:24
so there are my numbers i typed them
00:06:25
into the good old ti 84
00:06:27
list one list two list three then you go
00:06:30
to stat
00:06:32
tests and go down to the bottom
00:06:35
so the one we've used so much
00:06:39
stat and then tests
00:06:42
and you can scroll down to the bottom
00:06:44
but because it's the bottom you could
00:06:45
also scroll
00:06:46
up and then it says anova right there
00:06:50
i don't know if you can see it but i'll
00:06:51
show you in just a second
00:06:53
so it will say anova and then you just
00:06:56
have to put the list make sure you put a
00:06:58
comma
00:06:58
in between so it has to be list one
00:07:00
comma list two comma list three
00:07:04
and then hit enter
00:07:08
now the test statistic right here
00:07:11
i think this is even a little more
00:07:13
accurate than i had because mine had a
00:07:15
little bit of rounding error
00:07:17
because i was working with three
00:07:18
decimals and this is working with even
00:07:20
more decimals so this
00:07:21
is not only faster easier funner
00:07:24
but it's more accurate as well so that's
00:07:27
the test statistic
00:07:28
f and then for the degrees of freedom
00:07:31
you don't even have to do the formula
00:07:32
for that
00:07:33
because this is degrees of freedom
00:07:34
number one
00:07:36
and then there's this little arrow
00:07:37
saying scroll down so if you hit the
00:07:39
down button
00:07:42
it will then tell you degrees of freedom
00:07:44
number one
00:07:45
is a two and the second degrees of
00:07:47
freedom is a twelve
00:07:49
you can just ignore these words factor
00:07:52
and error
00:07:53
and ignore the ss and the ms all you
00:07:55
need is degrees of freedom number one
00:07:57
and degrees of freedom number two and
00:08:01
that's it