Video Transcript
I’m gonna guess that you have never
had the experience of your heart rate increasing in excitement while you are
imagining an infinitely large lake with lighthouses around it. Well, if you feel anything like I
do about math, that is gonna change by the end of this video.
Take one plus a fourth plus
one-ninth plus one sixteenth and so on, where you’re adding the inverses of the next
square number. What does this sum approach as you
keep adding on more and more terms? Now, this is a challenge that
remained unsolved for 90 years after it was initially posed, until finally it was
Euler who found the answer, super surprisingly, to be 𝜋 squared divided by six. I mean, isn’t that crazy? What is 𝜋 doing here, and why is
it squared? We don’t usually see it
squared. In honor of Euler, whose hometown
was Basel, this infinite sum is often referred to as the Basel problem. But the proof that I’d like to show
you is very different from the one that Euler had.
I’ve said in a previous video that
whenever you see 𝜋 show up, there will be some connection to circles. And there are those who like to say
that 𝜋 is not fundamentally about circles. And insisting on connecting
equations like these ones with the geometric intuition stems from a stubborn
insistence on only understanding 𝜋 in the context where we first discovered it. And that’s all well and good. But, whatever your own perspective
holds as fundamental, the fact is, 𝜋 is very much tied to circles. So if you see it show up, there
will be a path somewhere in the massive interconnected web of mathematics leading
you back to circles in geometry.
The question is just how long and
convoluted that path might be. And in the case of the Basel
problem, it’s a lot shorter than you might first think. And it all starts with light. Here’s the basic idea. Imagine standing at the origin of a
positive number line and putting a little lighthouse on all of the positive
integers: one, two, three, four, and so on. That first lighthouse has some
apparent brightness from your point of view, some amount of energy that your eye is
receiving from the light per unit time. And let’s just call that a
brightness of one.
For reasons I’ll explain shortly,
the apparent brightness of the second lighthouse is one-fourth as much as the
first. And the apparent brightness of the
third is one-ninth as much as the first and then one sixteenth and so on. And you can probably see why this
is useful for the Basel problem. It gives us a physical
representation of what’s being asked, since the brightness received from the whole
infinite line of lighthouses is gonna be one plus a fourth plus a ninth plus a
sixteenth and so on. So the result that we are aiming to
show is that this total brightness is equal to 𝜋 squared divided by six times the
brightness of that first lighthouse.
And at first that might seem
useless. I mean, we’re just reasking the
same original question. But the progress comes from a new
question that this framing raises. Are there ways that we can
rearrange these lighthouses that don’t change the total brightness for the
observer? And if so, can you show this to be
equivalent to a setup that’s somehow easier to compute? To start, let’s be clear about what
we mean when we reference “apparent brightness” to an observer.
Imagine a little screen which,
maybe, represents the retina of your eye or a digital camera sensor, something like
that. You could ask, what proportion of
the rays coming out of the source hit that screen? Or, phrased differently, what is
the angle between the ray hitting the bottom of that screen and the ray hitting the
top? Or, rather, since we should be
thinking of these lights as being in three dimensions, it might be more accurate to
ask, what is the angle the light covers in both directions perpendicular to the
source?
In spherical geometry, you
sometimes talk about the solid angle of a shape, which is the proportion of a sphere
it covers as viewed from a given point. You see, the first of two places
this story where thinking of screens is gonna be useful is in understanding the
inverse square law, which is a distinctly three-dimensional phenomenon. Think of all of the rays of light
hitting a screen one unit away from the source. As you double the distance, those
rays will now cover an area with twice the width and twice the height. So it would take four copies of
that original screen to receive the same rays at that distance. And so, each individual one
receives one-fourth as much light.
This is the sense in which I mean a
light would appear one-fourth as bright two times the distance away. Likewise, when you’re three times
farther away, you would need nine copies of that original screen to receive the same
rays. So each individual screen only
receives one-ninth as much light. And this pattern continues. Because the area hit by a light
increases by the square of the distance, the brightness of that light decreases by
the inverse square of that distance. And as I’m sure many of you know,
this inverse square law is not at all special to light. It pops up whenever you have some
kind of quantity that spreads out evenly from a point source, whether that’s sound
or heat or a radio signal, things like that.
And remember, it’s because of this
inverse square law that an infinite array of evenly-spaced lighthouses physically
implements the Basel problem. But again, what we need if we’re
gonna make any progress here is to understand how we can manipulate set-ups with
light sources like this without changing the total brightness for the observer. And the key building block is an
especially nice way to transform a single lighthouse into two.
Think of an observer at the origin
of the 𝑥𝑦-plane and a single lighthouse sitting out somewhere on that plane. Now, draw a line from that
lighthouse to the observer and then another line perpendicular to that one at the
lighthouse. Now, place two lighthouses where
this new line intersects the coordinate axes, which I’ll go ahead and call
lighthouse 𝐴 over here on the left and lighthouse 𝐵 on the upper side. It turns out, and you’ll see why
this is true in just a minute, the brightness that the observer experiences from
that first lighthouse is equal to the combined brightness experienced from
lighthouses 𝐴 and 𝐵 together.
And I should say, by the way, that
the standing assumption throughout this video is that all lighthouses are
equivalent. They’re using the same light bulb,
emanating the same power, all of that. So, in other words, assigning
variables to things here, if we call the distance from the observer to lighthouse 𝑎
little 𝑎 and the distance from the observer to lighthouse 𝑏 little 𝑏 and the
distance to the first lighthouse ℎ, we have the relation one over 𝑎 squared plus
one over 𝑏 squared equals one over ℎ squared. This is the much less well-known
inverse Pythagorean theorem which some of you may recognize from Mathologer’s most
recent and I’ll say most excellent video on the many cousins of the Pythagorean
theorem. Pretty cool relation, don’t you
think?
And if you’re a mathematician at
heart, you might be asking right now how you prove it. And there are some straightforward
ways where you express the triangles area in two separate ways and apply the usual
Pythagorean theorem. But there is another quite pretty
method that I’d like to briefly outline here that falls much more nicely into our
storyline because, again, it uses intuitions of light and screens.
Imagine scaling down the whole
right triangle into a tinier version. And think of this miniature
hypotenuse as a screen receiving light from the first lighthouse. If you reshape that screen to be
the combination of the two legs of the miniature triangle, like this, well it still
receives the same amount of light, right? I mean the rays of light hitting
one of those two legs are precisely the same as the rays that hit the
hypotenuse. Then the key is that the amount of
light from the first lighthouse that hits this left side, the limited angle of rays
that end up hitting that screen, is exactly the same as the amount of light over
here coming from lighthouse 𝐴 which hits that side. It’ll be the same angle of
rays.
And symmetrically, the amount of
light from the first house hitting the bottom portion of our screen is the same as
the amount of light hitting that portion from lighthouse 𝐵. Why, you might ask. Well, it’s a matter of similar
triangles. This animation already gives you a
strong hint for how it works. And we’ve also left a link in the
description to a simple GeoGebra applet for those of you who wanna think this
through in a slightly more interactive environment. And in playing with that, one
important fact here that you’ll be able to see is that the similar triangles only
apply in the limiting case for a very tiny screen.
All right, buckle up now cause
here’s where things get good. We’ve got this inverse Pythagorean
theorem, right? And that’s gonna let us transform a
single lighthouse into two others without changing the brightness experienced by the
observer. With that in hand and no small
amount of cleverness, we can use this to build up the infinite array that we
need. Picture yourself at the edge of a
circular lake directly opposite a lighthouse. We’re gonna want it to be the case
that the distance between you and the lighthouse along the border of the lake is
one. So we’ll say the lake has a
circumference of two.
Now, the apparent brightness is one
divided by the diameter squared. And, in this case, the diameter is
that circumference, two, divided by 𝜋. So the apparent brightness works
out to be 𝜋 squared divided by four. Now, for our first transformation,
draw a new circle twice as big, so circumference four, and draw a tangent line to
the top of the small circle. Then replace the original
lighthouse with two new ones where this tangent line intersects the larger
circle. An important fact from geometry
that we’ll be using over and over here is that if you take the diameter of a circle
and form a triangle with any point on the circle, the angle at that new point will
always be 90 degrees. The significance of that in our
diagram here is that it means the inverse Pythagorean theorem applies. And the brightness from those two
new lighthouses equals the brightness from the first one; namely, 𝜋 squared divided
by four.
As the next step, draw a new circle
twice as big as the last with a circumference eight. Now, for each lighthouse, take a
line from that lighthouse through the top of the smaller circle, which is the center
of the larger circle, and consider the two points where that intersects with the
larger circle. Again, since this line is a
diameter of that large circle, then the lines from those two new points to the
observer are gonna form a right angle. Likewise, by looking at this right
triangle here, whose hypotenuse is the diameter of the smaller circle, you can see
that the line from the observer to that original lighthouse is at a right angle,
with a new long line that we drew. Good news, right? Because that means we can apply the
inverse Pythagorean theorem. And that means that the apparent
brightness from the original lighthouse is the same as the combined brightness from
the two newer ones.
And of course, you can do that same
thing over on the other side, drawing a line through the top of the smaller circle
and getting two new lighthouses on the larger circle. And even nicer, these four
lighthouses are all gonna be evenly spaced around the lake. Why? Well, the lines from those
lighthouses to the center are at 90-degree angles with each other. So since things are symmetric left
to right, that means that the distances along the circumference are one, two, two,
two, and one. All right, you might see where this
is going. But I wanna walk through this for
just one more step.
You draw a circle twice as big, so
circumference of 16 now. And for each lighthouse, you draw a
line from that lighthouse through the top of the smaller circle, which is the center
of the bigger circle. And then, create two new
lighthouses where that line intersects with the larger circle. Just as before, because the long
line is a diameter of the big circle, those two new lighthouses make a right angle
with the observer, right? And just as before, the line from
the observer to the original lighthouse is perpendicular to the long line.
And those are the two facts that
justify us in using the inverse Pythagorean theorem. But what might not be as clear is
that when you do this for all of the lighthouses to get eight new ones on the big
lake, those eight new lighthouses are gonna be evenly spaced. This is the final bit of geometry
proofiness before the final thrust. To see this, remember that if you
draw lines from two adjacent lighthouses on the small lake to the center, they make
a 90-degree angle. If, instead, you draw lines to a
point anywhere on the circumference of the circle, that’s not between them, the very
useful inscribed angle theorem from geometry tells us that this will be exactly half
of the angle that they make with the center, in this case 45 degrees.
But, when we position that new
point at the top of the lake, these are the two lines which define the position of
the new lighthouses on the larger lake. What that means then is that when
you draw lines from those eight new lighthouses into the center, they divide the
circle evenly into 45-degree angle pieces. And that means the eight
lighthouses are evenly spaced around the circumference, with a distance of two
between each one of them. And now, just imagine this thing
playing on at every step doubling the size of each circle and transforming each
lighthouse into two new ones along a line drawn through the center of the larger
circle. At every step, the apparent
brightness to the observer remains the same, 𝜋 squared over four. And at every step, the lighthouses
remain evenly spaced with a distance two between each one of them on the
circumference.
And in the limit, what we’re
getting here is a flat horizontal line with an infinite number of lighthouses evenly
spaced in both directions. And because the apparent brightness
was 𝜋 squared over four the entire way, that will, also be true in this limiting
case. And this gives us a pretty awesome
infinite series. The sum of the inverse squares one
over 𝑛 squared, where 𝑛 covers all of the odd integers — one, three, five, and so
on, but also negative one, negative three, negative five, often the leftward
direction. Adding all of those up is gonna
give us 𝜋 squared over four.
That’s amazing! And it’s the core of what I wanna
show you. And just take a step back and think
about how unreal this seems. The sum of simple fractions that at
first sight have nothing to do with geometry, nothing to do with circles at all,
apparently, gives us this result that’s related to 𝜋. Except now, you can actually see
what it has to do with geometry. The number line is kind of like a
limit of ever-growing circles. And as you sum across that number
line, making sure to sum all the way to infinity on either side, it’s sort of like
you’re adding up along the boundary of an infinitely-large circle, in a very loose
but very fun way of speaking.
“But wait!” you might say. This is not the sum that you
promised us at the start of the video. And, well, you’re right. We do have a little bit of thinking
left. First things first, let’s just
restrict this sum to only being the positive odd numbers, which gets us 𝜋 squared
divided by eight. Now, the only difference between
this and the sum that we’re looking for that goes over all the positive integers,
odd and even, is that it’s missing the sum of the reciprocals of even numbers, what
I’m coloring in red up here. Now, you can think of that missing
series as a scaled copy of the total series that we want, where each lighthouse
moves to being twice as far away from the origin. One gets shifted to two; two gets
shifted to four; three gets shifted to six, and so on.
And because that involves doubling
the distance for every lighthouse, it means that the apparent brightness would be
decreased by a factor of four. And that’s also relatively
straightforward algebra. Going from the sum over all the
integers to the sum over the even integers involves multiplying by one-fourth. And what that means is that going
from all the integers to the odd ones would be multiplying by three-fourths, since
the evens plus the odds have to give us the whole thing. So if we just flip that around,
that means going from the sum over the odd numbers to the sum over all positive
integers requires multiplying by four-thirds. So taking that 𝜋 squared over
eight, multiplying by four-thirds, bada boom bada bing! We’ve got ourselves a solution to
the Basel problem.