00:00:00
Today I'd like to share with you three fake proofs in increasing order of subtlety,
00:00:04
and then discuss what each one of them has to tell us about math.
00:00:11
The first proof is for a formula for the surface area of a sphere,
00:00:14
and the way that it starts is to subdivide that sphere into vertical slices,
00:00:19
the way you might chop up an orange or paint a beach ball.
00:00:22
We then unravel all of those wedge slices from the northern hemisphere,
00:00:26
so that they poke up like this, and then symmetrically unravel all of those from the
00:00:30
southern hemisphere below, and now interlace those pieces to get a shape whose area we
00:00:35
want to figure out.
00:00:36
The base of this shape came from the circumference of the sphere,
00:00:40
it's an unraveled equator, so its length is 2 pi times the radius of the sphere,
00:00:45
and then the other side of this shape came from the height of one of these wedges,
00:00:49
which is a quarter of a walk around the sphere,
00:00:52
and so it has a length of pi halves times r.
00:00:55
The idea is that this is only an approximation,
00:00:57
the edges might not be perfectly straight, but if we think of the limit as we do finer
00:01:02
and finer slices of the sphere, this shape whose area we want to know gets closer to
00:01:07
being a perfect rectangle, one whose area will be pi halves r times 2 pi r,
00:01:11
or in other words pi squared times r squared.
00:01:15
The proof is elegant, it translates a hard problem into a situation that's easier to
00:01:19
understand, it has that element of surprise while still being intuitive, its only fault,
00:01:24
really, is that it's completely wrong, the true surface area of a sphere is 4 pi r
00:01:29
squared.
00:01:30
I originally saw this example thanks to Henry Reich, and to be fair,
00:01:34
it's not necessarily inconsistent with the 4 pi r squared formula,
00:01:37
just so long as pi is equal to 4.
00:01:40
For the next proof I'd like to show you a simple
00:01:42
argument for the fact that pi is equal to 4.
00:01:45
We start off with a circle, say with radius 1,
00:01:48
and we ask how can we figure out its circumference, after all,
00:01:51
pi is by definition the ratio of this circumference to the diameter of the circle.
00:01:56
We start off by drawing the square whose side lengths are all tangent to that circle.
00:02:00
It's not too hard to see that the perimeter of this square is 8.
00:02:04
Then, and some of you may have seen this before, it's a kind of classic argument,
00:02:08
the argument proceeds by producing a sequence of curves,
00:02:10
all of whom also have this perimeter of 8, but which more and more closely
00:02:14
approximate the circle.
00:02:15
But the full nuance of this example is not always emphasized.
00:02:19
First of all, just to make things crystal clear,
00:02:21
the way each of these iterations works is to fold in each of the corners of
00:02:24
the previous shape so that they just barely kiss the circle,
00:02:27
and you can take a moment to convince yourself that in each region where a fold happened,
00:02:31
the perimeter doesn't change.
00:02:33
For example, in the upper right here, instead of walking up and then left,
00:02:36
the new curve goes left and then up.
00:02:39
And something similar is true at all of the folds of all of the different iterations.
00:02:42
Wherever the previous iteration went direction A then direction B,
00:02:46
the new iteration goes direction B then direction A, but no length is lost or gained.
00:02:51
Some of you might say, well obviously this isn't going to give the true perimeter of the
00:02:55
circle, because no matter how many iterations you do, when you zoom in,
00:02:58
it remains jagged, it's not a smooth curve, you're taking these very inefficient steps
00:03:02
along the circle.
00:03:03
While that is true, and ultimately the reason things are wrong,
00:03:06
if you want to appreciate the lesson this example is teaching us,
00:03:09
the claim of the example is not that any one of these approximations equals the curve,
00:03:13
it's that the limit of all of the approximations equals our circle.
00:03:17
And to appreciate the lesson that this example teaches us,
00:03:20
it's worth taking a moment to be a little more mathematically
00:03:23
precise about what I mean by the limit of a sequence of curves.
00:03:27
Let's say we describe the very first shape, this square,
00:03:30
as a parametric function, something that has an input t and it outputs
00:03:34
a point in 2d space, so that as t ranges from 0 to 1, it traces that square.
00:03:39
I'll call that function c0, and likewise we can parameterize the next
00:03:43
iteration with a function I'll call c1, as the parameter t ranges from 0 up to 1,
00:03:48
the output of this function traces along that curve.
00:03:52
This is just so that we can think of these shapes as instead being functions.
00:03:56
Now I want you to consider a particular value of t, maybe 0.2,
00:04:00
and then consider the sequence of points that you get by evaluating
00:04:04
the sequence of functions we have at this particular point.
00:04:07
Now I want you to consider the limit as n approaches infinity of c sub n of 0.2.
00:04:14
This limit is a well-defined point in 2d space, in fact that point sits on the circle.
00:04:20
And there's nothing specific about 0.2, we could do this limiting process for
00:04:25
any input t, and so I can define a new function that I'll call c infinity,
00:04:29
which by definition at any input t is whatever this limiting value for all the curves is.
00:04:34
So here's the point, that limiting function c infinity is the circle,
00:04:39
it's not an approximation of the circle, it's not some jagged version of the circle,
00:04:44
it is the genuine smooth circular curve whose perimeter we want to know.
00:04:49
And what's also true is that the limit of the lengths of all of our curves really is 8,
00:04:54
because each individual curve really does have a perimeter of 8.
00:04:59
And there are all sorts of examples throughout calculus when
00:05:02
we talk about approximating one thing we want to know as a
00:05:05
limit of a bunch of other things that are easier to understand.
00:05:08
So the question at the heart here is why exactly
00:05:11
is it not okay to do that in this example?
00:05:16
And maybe at this point you step back and say, you know,
00:05:19
it's just not enough for things to look the same, this is why we need rigor,
00:05:23
it's why we need proofs, it's why since the days of Euclid mathematicians
00:05:27
have followed in his footsteps and deduced truths step by step from axioms forward.
00:05:31
But for this last example I would like to do something that
00:05:34
doesn't lean as hard on visual intuition and instead give a
00:05:38
Euclid style proof for the claim that all triangles are isosceles.
00:05:42
The way this will work is we'll take any particular triangle and make no assumptions
00:05:47
about it, I'll label its vertices a, b, and c,
00:05:50
and what I would like to prove for you is that the side length a,
00:05:53
b is necessarily equal to the side length a, c.
00:05:57
Now, to be clear, the result is obviously false,
00:05:59
just in the diagram I've drawn you can visually see that these lengths are
00:06:03
not equal to each other.
00:06:05
But I challenge you to see if you can identify
00:06:07
what's wrong about the proof I'm about to show you.
00:06:10
Honestly, it's very subtle and three gold stars for anyone who can identify it.
00:06:15
The first thing I'll do is draw the perpendicular bisector, the line bc,
00:06:19
so that means this angle here is 90 degrees and this length is by definition
00:06:23
the same as this length, and we'll label that intersection point d.
00:06:28
And then next I will draw the angle bisector at a,
00:06:31
which means by definition this little angle here is the same as this little angle here,
00:06:36
I'll label both of them alpha, and we'll say that the point where these two
00:06:40
intersect is p.
00:06:41
And now, like a lot of Euclid style proofs, we're just going to draw some new lines,
00:06:45
figure out what things must be equal, and get some conclusions.
00:06:48
For instance, let's draw the line from p which is perpendicular to the side length ac,
00:06:53
and we'll label that intersection point e.
00:06:57
And likewise, we'll draw the line from p down to the other side length ac,
00:07:01
again it's perpendicular, and we'll label that intersection point f.
00:07:06
My first claim is that this triangle here, which is afp,
00:07:10
is the same, or at least congruent, to this triangle over here, aep.
00:07:16
Essentially this follows from symmetry across that angle bisector.
00:07:20
More specifically we can say they share a side length,
00:07:22
and then they both have an angle alpha, and both have an angle 90 degrees.
00:07:26
So it follows by the side angle angle congruence relation.
00:07:29
Maybe my drawing is a little bit sloppy, but the
00:07:31
logic helps us see that they do have to be the same.
00:07:34
Next I'll draw a line from p down to b, and then from p down to c,
00:07:38
and I claim that this triangle here is congruent to its reflection across that
00:07:44
perpendicular bisector.
00:07:46
Again the symmetry maybe helps make this clear,
00:07:48
but more rigorously they both have the same base, they both have a 90 degree angle,
00:07:53
and they both have the same height, so it follows by the side angle side relation.
00:07:57
So based on that first pair of triangles I'm going to mark this side length here
00:08:00
as being the same as this side length here, marking them with double tick marks.
00:08:04
And based on the second triangle relation I'll mark this side length here
00:08:09
as the same as this line over here, marking them with triple tick marks.
00:08:13
And so from that we have two more triangles that need to be the same,
00:08:17
namely this one over here, and the one with corresponding two side lengths over here.
00:08:21
And the reasoning here is they both have that triple ticked side,
00:08:25
a double ticked side, and they're both 90 degree triangles.
00:08:28
So this follows by the side side angle congruence relation.
00:08:32
And all of those are valid congruence relations,
00:08:34
I'm not pulling the wool over your eyes with one of those,
00:08:37
and all of this will basically be enough to show us why AB has to be the same as BC.
00:08:42
That first pair of triangles implies that the length AF is the same as the length AE,
00:08:47
those are corresponding sides to each other, I'll just color them in red here,
00:08:52
and then that last triangle relation guarantees for us that the side
00:08:56
FB is going to be the same as the side EC.
00:08:59
I'll kind of color both of those in blue.
00:09:01
And finally the result we want basically comes from adding up these two equations.
00:09:06
The length AF plus FB is clearly the same as the total length AB,
00:09:11
and likewise the length AE plus EC is the same as the total length AC.
00:09:17
So all in all the side length AB has to be the same as the side length AC,
00:09:21
and because we made no assumptions about the triangle this implies that any triangle
00:09:26
is isosceles.
00:09:27
Actually for that matter since we made no assumptions about the
00:09:30
specific two sides we chose, it implies that any triangle is equilateral.
00:09:35
So this leaves us somewhat disturbingly with three different possibilities.
00:09:39
All triangles really are equilateral, that's just the truth of the universe,
00:09:43
or you can use Euclid style reasoning to derive false results,
00:09:46
or there's something wrong in the proof.
00:09:49
But if there is, where exactly is it?
00:09:54
So what exactly is going on with these three examples?
00:09:58
Now the thing that's a little bit troubling about that first
00:10:01
example with the sphere is that it is very similar in spirit to
00:10:04
a lot of other famous and supposedly true visual proofs from geometry.
00:10:08
For example there's a very famous proof about the area of a circle that starts
00:10:12
off by dividing it into a bunch of little pizza wedges,
00:10:15
and you take all those wedges and you straighten them out,
00:10:18
essentially lining up the crust of that pizza,
00:10:20
and then we take half the wedges and inter-slice them with the other half.
00:10:24
And the idea is that this might not be a perfect rectangle,
00:10:27
it's got some bumps and curves, but as you take thinner and thinner slices you get
00:10:32
something that's closer and closer to a true rectangle,
00:10:35
and the width of that rectangle comes from half the circumference of the circle,
00:10:39
which is by definition pi times r, and then the height of that rectangle comes from the
00:10:44
radius of the circle, r, meaning that the whole area is pi r squared.
00:10:48
This time the result is valid, but why is it not okay to do what we did with the spheres,
00:10:53
but somehow it is okay to do this with the pizza slices?
00:10:57
The main problem with the sphere argument is that when we flatten out all
00:11:01
of those orange wedges, if we were to do it accurately in a way that
00:11:04
preserves their area, they don't look like triangles, they should bulge outward.
00:11:09
And if you want to see this, let's think really critically about just one particular
00:11:13
one of those wedges on the sphere, and ask yourself how does the width across that wedge,
00:11:18
this little portion of a line of latitude, vary as you go up and down the wedge?
00:11:22
In particular, if you consider the angle phi from the z-axis down to a point on
00:11:27
this wedge as we walk down it, what's the length of that width as a function of phi?
00:11:32
For those of you curious about the details of these sorts of things,
00:11:36
you'd start off by drawing this line up here from the z-axis to a point on the wedge,
00:11:40
its length will be the radius of the sphere r times the sine of this angle.
00:11:44
That lets us deduce how long the total line of latitude is where we're sitting,
00:11:49
it'll basically be 2 pi times that radial line, 2 pi r sine of phi,
00:11:53
and then the width of the wedge that we care about is just some constant proportion of
00:11:58
that full line of latitude.
00:12:00
Now the details don't matter too much, the one thing I want
00:12:03
you to notice is that this is not a linear relationship.
00:12:06
As you walk from the top of that wedge down to the bottom,
00:12:09
letting phi range from 0 up to pi halves, the width of the wedge doesn't grow linearly,
00:12:14
instead it grows according to a sine curve.
00:12:18
And so when we're unwrapping all of these wedges,
00:12:20
if we want those widths to be preserved, they should end up
00:12:23
a little bit chubbier around the base, their side lengths are not linear.
00:12:28
What this means is when we tried to interlace all of the wedges from the northern
00:12:31
hemisphere with those from the southern, there's a meaningful amount of overlap
00:12:35
between those non-linear edges, and we can't wave our hands about a limiting argument,
00:12:39
this is an overlap that persists as you take finer and finer subdivisions.
00:12:43
And ultimately it's that overlap that accounts for the difference between
00:12:47
our false answer with a pi squared from the true answer that has 4 pi.
00:12:51
It reminds me of one of those rearrangement puzzles where you have a number of
00:12:55
pieces and just by moving them around you can seemingly create area out of nowhere.
00:12:59
For example, right now I've arranged all these pieces to form a triangle,
00:13:03
except it's missing two units of area in the middle.
00:13:05
Now I want you to focus on the vertices of that triangle, these white dots,
00:13:09
those don't move, I'm not pulling any trickery with that,
00:13:12
but I can rearrange all of the pieces back to how they originally were so that those
00:13:16
two units of area in the middle seem to disappear,
00:13:19
while the constituent parts remain the same, the triangle that they form remains
00:13:23
the same, and yet two units of area seem to appear out of nowhere.
00:13:27
If you've never seen this one before, by the way,
00:13:29
I highly encourage you to pause and try to think it through,
00:13:31
it's a very fun little puzzle.
00:13:33
The answer starts to reveal itself if we carefully draw the edges of this triangle and
00:13:38
zoom in close enough to see that our pieces don't actually fit inside the triangle,
00:13:43
they bulge out ever so slightly, or at least arranged like this they bulge out ever so
00:13:48
slightly.
00:13:49
When we rearrange them and we zoom back in we can see that they dent
00:13:53
inward ever so slightly, and that very subtle difference between the
00:13:57
bulge out and the dent inward accounts for all of the difference in area.
00:14:01
The slope of the edge of this blue triangle works out to be 5 divided by 2,
00:14:05
whereas the slope of the edge of this red triangle works out to be 7 divided by 3.
00:14:10
Those numbers are close enough to look similar as slope,
00:14:13
but they allow for this denting inward and the bulging outward.
00:14:16
You have to be wary of lines that are made to look straight when
00:14:19
you haven't had explicit confirmation that they actually are straight.
00:14:24
One quick added comment on the sphere, the fundamental issue here is that the geometry
00:14:29
of a curved surface is fundamentally different from the geometry of flat space.
00:14:33
The relevant search term here would be Gaussian curvature.
00:14:36
You can't flatten things out from a sphere without losing geometric information.
00:14:41
When you see limiting arguments that relate to little pieces on a sphere that
00:14:45
somehow get flattened out and are reasoned through there,
00:14:48
those only can work if the limiting pieces that you're talking about get smaller
00:14:52
in both directions.
00:14:54
It's only when you zoom in close to curved surface that it appears locally flat.
00:14:59
The issue with our orange wedge argument is that our pieces never got
00:15:02
exposed to that local flatness because they only got thin in one direction.
00:15:06
They maintain the curvature in that other direction.
00:15:09
Now on the topic of the subtlety of limiting arguments,
00:15:12
let's turn back to our limit of jagged curves that approaches the smooth circular curve.
00:15:17
As I said, the limiting curve really is a circle and the
00:15:20
limiting value for the length of your approximations really is 8.
00:15:25
Here, the basic issue is that there is no reason to expect that the limit of
00:15:29
the lengths of the curves is the same as the length of the limits of the curves,
00:15:33
and in fact this is a nice counter example to show why that's not the case.
00:15:38
The real point of this example is not the fear that anyone is ever
00:15:42
going to believe that it shows that pi is equal to 4,
00:15:45
instead it shows why care is required in other cases where people apply
00:15:49
limiting arguments.
00:15:50
For example, this happens all throughout calculus.
00:15:53
It is the heart of calculus, where say you want to know the area under a given curve.
00:15:58
The way we typically think about it is to approximate that with a set of rectangles,
00:16:02
because those are the things we know how to compute the areas of.
00:16:05
You just take the base times height in each case.
00:16:08
Now this is a very jagged approximation, but the thought, or I guess the hope,
00:16:12
is that as you take a finer and finer subdivision into thinner and thinner rectangles,
00:16:16
the sums of those areas approaches the thing we actually care about.
00:16:20
If you want to make it rigorous, you have to be explicit about the error between
00:16:24
these approximations and the true thing we care about, the area under this curve.
00:16:29
For example, you might start your argument by saying that that
00:16:32
error has to be strictly less than the area of these red rectangles.
00:16:36
Essentially, the deviation between the curve and our
00:16:39
approximating rectangles sits strictly inside that red region.
00:16:43
And then what you would want to argue is that in this limiting process,
00:16:47
the cumulative area of all of those red rectangles has to approach zero.
00:16:57
Now as to the final example, our proof that all triangles are isosceles,
00:17:00
let me show you what it looks like if I'm a little bit more careful about
00:17:04
actually constructing the angle bisector rather than just eyeballing it.
00:17:08
When I do that, the relevant intersection point actually sits outside of the triangle.
00:17:13
And then from there, if I go through everything that we did in the original argument,
00:17:17
drawing the relevant perpendicular lines, all of that,
00:17:19
every triangle that I claimed was congruent really is congruent.
00:17:22
All of those were genuinely true, and the corresponding lengths of
00:17:25
those triangles that I claimed were the same really are the same.
00:17:28
The one place where the proof breaks down is at the very end,
00:17:32
when I said that the full side length AC was equal to AE plus EC.
00:17:37
That was only true under the hidden assumption that that point E sat in between them.
00:17:43
But in reality, for many triangles, that point would sit outside of those two.
00:17:48
It's pretty subtle, isn't it?
00:17:51
The point in all of this is that while visual intuition is great,
00:17:55
and visual proofs often give you a nice way of elucidating what's going on with
00:17:59
otherwise opaque rigor, visual arguments and snazzy diagrams will never obviate the
00:18:04
need for critical thinking.
00:18:06
In math, you cannot escape the need to look out for hidden assumptions and edge cases.
00:18:32
Thank you.