I’m gonna guess that you have never had the experience of your heart rate increasing
in excitement while you are imagining an infinitely large lake with lighthouses
around it. Well, if you feel anything like I do about math, that is gonna change by the end of
Take one plus a fourth plus one-ninth plus one sixteenth and so on, where you’re
adding the inverses of the next square number. What does this sum approach as you keep adding on more and more terms? Now, this is a challenge that remained unsolved for 90 years after it was initially
posed, until finally it was Euler who found the answer, super surprisingly, to be 𝜋
squared divided by six. I mean, isn’t that crazy? What is 𝜋 doing here, and why is it squared? We don’t usually see it squared. In honor of Euler, whose hometown was Basel, this infinite sum is often referred to
as the Basel problem. But the proof that I’d like to show you is very different from the one that Euler
I’ve said in a previous video that whenever you see 𝜋 show up, there will be some
connection to circles. And there are those who like to say that 𝜋 is not fundamentally about circles. And insisting on connecting equations like these ones with the geometric intuition
stems from a stubborn insistence on only understanding 𝜋 in the context where we
first discovered it. And that’s all well and good. But, whatever your own perspective holds as fundamental, the fact is, 𝜋 is very much
tied to circles. So if you see it show up, there will be a path somewhere in the massive
interconnected web of mathematics leading you back to circles in geometry.
The question is just how long and convoluted that path might be. And in the case of the Basel problem, it’s a lot shorter than you might first
think. And it all starts with light. Here’s the basic idea. Imagine standing at the origin of a positive number line and putting a little
lighthouse on all of the positive integers: one, two, three, four, and so on. That first lighthouse has some apparent brightness from your point of view, some
amount of energy that your eye is receiving from the light per unit time. And let’s just call that a brightness of one.
For reasons I’ll explain shortly, the apparent brightness of the second lighthouse is
one-fourth as much as the first. And the apparent brightness of the third is one-ninth as much as the first and then
one sixteenth and so on. And you can probably see why this is useful for the Basel problem. It gives us a physical representation of what’s being asked, since the brightness
received from the whole infinite line of lighthouses is gonna be one plus a fourth
plus a ninth plus a sixteenth and so on. So the result that we are aiming to show is that this total brightness is equal to 𝜋
squared divided by six times the brightness of that first lighthouse.
And at first that might seem useless. I mean, we’re just reasking the same original question. But the progress comes from a new question that this framing raises. Are there ways that we can rearrange these lighthouses that don’t change the total
brightness for the observer? And if so, can you show this to be equivalent to a setup that’s somehow easier to
compute? To start, let’s be clear about what we mean when we reference “apparent brightness”
to an observer.
Imagine a little screen which, maybe, represents the retina of your eye or a digital
camera sensor, something like that. You could ask, what proportion of the rays coming out of the source hit that
screen? Or, phrased differently, what is the angle between the ray hitting the bottom of that
screen and the ray hitting the top? Or, rather, since we should be thinking of these lights as being in three dimensions,
it might be more accurate to ask, what is the angle the light covers in both
directions perpendicular to the source?
In spherical geometry, you sometimes talk about the solid angle of a shape, which is
the proportion of a sphere it covers as viewed from a given point. You see, the first of two places this story where thinking of screens is gonna be
useful is in understanding the inverse square law, which is a distinctly
three-dimensional phenomenon. Think of all of the rays of light hitting a screen one unit away from the source. As you double the distance, those rays will now cover an area with twice the width
and twice the height. So it would take four copies of that original screen to receive the same rays at that
distance. And so, each individual one receives one-fourth as much light.
This is the sense in which I mean a light would appear one-fourth as bright two times
the distance away. Likewise, when you’re three times farther away, you would need nine copies of that
original screen to receive the same rays. So each individual screen only receives one-ninth as much light. And this pattern continues. Because the area hit by a light increases by the square of the distance, the
brightness of that light decreases by the inverse square of that distance. And as I’m sure many of you know, this inverse square law is not at all special to
light. It pops up whenever you have some kind of quantity that spreads out evenly from a
point source, whether that’s sound or heat or a radio signal, things like that.
And remember, it’s because of this inverse square law that an infinite array of
evenly-spaced lighthouses physically implements the Basel problem. But again, what we need if we’re gonna make any progress here is to understand how we
can manipulate set-ups with light sources like this without changing the total
brightness for the observer. And the key building block is an especially nice way to transform a single lighthouse
Think of an observer at the origin of the 𝑥𝑦-plane and a single lighthouse sitting
out somewhere on that plane. Now, draw a line from that lighthouse to the observer and then another line
perpendicular to that one at the lighthouse. Now, place two lighthouses where this new line intersects the coordinate axes, which
I’ll go ahead and call lighthouse 𝐴 over here on the left and lighthouse 𝐵 on the
upper side. It turns out, and you’ll see why this is true in just a minute, the brightness that
the observer experiences from that first lighthouse is equal to the combined
brightness experienced from lighthouses 𝐴 and 𝐵 together.
And I should say, by the way, that the standing assumption throughout this video is
that all lighthouses are equivalent. They’re using the same light bulb, emanating the same power, all of that. So, in other words, assigning variables to things here, if we call the distance from
the observer to lighthouse 𝑎 little 𝑎 and the distance from the observer to
lighthouse 𝑏 little 𝑏 and the distance to the first lighthouse ℎ, we have the
relation one over 𝑎 squared plus one over 𝑏 squared equals one over ℎ squared. This is the much less well-known inverse Pythagorean theorem which some of you may
recognize from Mathologer’s most recent and I’ll say most excellent video on the
many cousins of the Pythagorean theorem. Pretty cool relation, don’t you think?
And if you’re a mathematician at heart, you might be asking right now how you prove
it. And there are some straightforward ways where you express the triangles area in two
separate ways and apply the usual Pythagorean theorem. But there is another quite pretty method that I’d like to briefly outline here that
falls much more nicely into our storyline because, again, it uses intuitions of
light and screens.
Imagine scaling down the whole right triangle into a tinier version. And think of this miniature hypotenuse as a screen receiving light from the first
lighthouse. If you reshape that screen to be the combination of the two legs of the miniature
triangle, like this, well it still receives the same amount of light, right? I mean the rays of light hitting one of those two legs are precisely the same as the
rays that hit the hypotenuse. Then the key is that the amount of light from the first lighthouse that hits this
left side, the limited angle of rays that end up hitting that screen, is exactly the
same as the amount of light over here coming from lighthouse 𝐴 which hits that
side. It’ll be the same angle of rays.
And symmetrically, the amount of light from the first house hitting the bottom
portion of our screen is the same as the amount of light hitting that portion from
lighthouse 𝐵. Why, you might ask. Well, it’s a matter of similar triangles. This animation already gives you a strong hint for how it works. And we’ve also left a link in the description to a simple GeoGebra applet for those
of you who wanna think this through in a slightly more interactive environment. And in playing with that, one important fact here that you’ll be able to see is that
the similar triangles only apply in the limiting case for a very tiny screen.
All right, buckle up now cause here’s where things get good. We’ve got this inverse Pythagorean theorem, right? And that’s gonna let us transform a single lighthouse into two others without
changing the brightness experienced by the observer. With that in hand and no small amount of cleverness, we can use this to build up the
infinite array that we need. Picture yourself at the edge of a circular lake directly opposite a lighthouse. We’re gonna want it to be the case that the distance between you and the lighthouse
along the border of the lake is one. So we’ll say the lake has a circumference of two.
Now, the apparent brightness is one divided by the diameter squared. And, in this case, the diameter is that circumference, two, divided by 𝜋. So the apparent brightness works out to be 𝜋 squared divided by four. Now, for our first transformation, draw a new circle twice as big, so circumference
four, and draw a tangent line to the top of the small circle. Then replace the original lighthouse with two new ones where this tangent line
intersects the larger circle. An important fact from geometry that we’ll be using over and over here is that if you
take the diameter of a circle and form a triangle with any point on the circle, the
angle at that new point will always be 90 degrees. The significance of that in our diagram here is that it means the inverse Pythagorean
theorem applies. And the brightness from those two new lighthouses equals the brightness from the
first one; namely, 𝜋 squared divided by four.
As the next step, draw a new circle twice as big as the last with a circumference
eight. Now, for each lighthouse, take a line from that lighthouse through the top of the
smaller circle, which is the center of the larger circle, and consider the two
points where that intersects with the larger circle. Again, since this line is a diameter of that large circle, then the lines from those
two new points to the observer are gonna form a right angle. Likewise, by looking at this right triangle here, whose hypotenuse is the diameter of
the smaller circle, you can see that the line from the observer to that original
lighthouse is at a right angle, with a new long line that we drew. Good news, right? Because that means we can apply the inverse Pythagorean theorem. And that means that the apparent brightness from the original lighthouse is the same
as the combined brightness from the two newer ones.
And of course, you can do that same thing over on the other side, drawing a line
through the top of the smaller circle and getting two new lighthouses on the larger
circle. And even nicer, these four lighthouses are all gonna be evenly spaced around the
lake. Why? Well, the lines from those lighthouses to the center are at 90-degree angles with
each other. So since things are symmetric left to right, that means that the distances along the
circumference are one, two, two, two, and one. All right, you might see where this is going. But I wanna walk through this for just one more step.
You draw a circle twice as big, so circumference of 16 now. And for each lighthouse, you draw a line from that lighthouse through the top of the
smaller circle, which is the center of the bigger circle. And then, create two new lighthouses where that line intersects with the larger
circle. Just as before, because the long line is a diameter of the big circle, those two new
lighthouses make a right angle with the observer, right? And just as before, the line from the observer to the original lighthouse is
perpendicular to the long line.
And those are the two facts that justify us in using the inverse Pythagorean
theorem. But what might not be as clear is that when you do this for all of the lighthouses to
get eight new ones on the big lake, those eight new lighthouses are gonna be evenly
spaced. This is the final bit of geometry proofiness before the final thrust. To see this, remember that if you draw lines from two adjacent lighthouses on the
small lake to the center, they make a 90-degree angle. If, instead, you draw lines to a point anywhere on the circumference of the circle,
that’s not between them, the very useful inscribed angle theorem from geometry tells
us that this will be exactly half of the angle that they make with the center, in
this case 45 degrees.
But, when we position that new point at the top of the lake, these are the two lines
which define the position of the new lighthouses on the larger lake. What that means then is that when you draw lines from those eight new lighthouses
into the center, they divide the circle evenly into 45-degree angle pieces. And that means the eight lighthouses are evenly spaced around the circumference, with
a distance of two between each one of them. And now, just imagine this thing playing on at every step doubling the size of each
circle and transforming each lighthouse into two new ones along a line drawn through
the center of the larger circle. At every step, the apparent brightness to the observer remains the same, 𝜋 squared
over four. And at every step, the lighthouses remain evenly spaced with a distance two between
each one of them on the circumference.
And in the limit, what we’re getting here is a flat horizontal line with an infinite
number of lighthouses evenly spaced in both directions. And because the apparent brightness was 𝜋 squared over four the entire way, that
will, also be true in this limiting case. And this gives us a pretty awesome infinite series. The sum of the inverse squares one over 𝑛 squared, where 𝑛 covers all of the odd
integers — one, three, five, and so on, but also negative one, negative three,
negative five, often the leftward direction. Adding all of those up is gonna give us 𝜋 squared over four.
That’s amazing! And it’s the core of what I wanna show you. And just take a step back and think about how unreal this seems. The sum of simple fractions that at first sight have nothing to do with geometry,
nothing to do with circles at all, apparently, gives us this result that’s related
to 𝜋. Except now, you can actually see what it has to do with geometry. The number line is kind of like a limit of ever-growing circles. And as you sum across that number line, making sure to sum all the way to infinity on
either side, it’s sort of like you’re adding up along the boundary of an
infinitely-large circle, in a very loose but very fun way of speaking.
“But wait!” you might say. This is not the sum that you promised us at the start of the video. And, well, you’re right. We do have a little bit of thinking left. First things first, let’s just restrict this sum to only being the positive odd
numbers, which gets us 𝜋 squared divided by eight. Now, the only difference between this and the sum that we’re looking for that goes
over all the positive integers, odd and even, is that it’s missing the sum of the
reciprocals of even numbers, what I’m coloring in red up here. Now, you can think of that missing series as a scaled copy of the total series that
we want, where each lighthouse moves to being twice as far away from the origin. One gets shifted to two; two gets shifted to four; three gets shifted to six, and so
And because that involves doubling the distance for every lighthouse, it means that
the apparent brightness would be decreased by a factor of four. And that’s also relatively straightforward algebra. Going from the sum over all the integers to the sum over the even integers involves
multiplying by one-fourth. And what that means is that going from all the integers to the odd ones would be
multiplying by three-fourths, since the evens plus the odds have to give us the
whole thing. So if we just flip that around, that means going from the sum over the odd numbers to
the sum over all positive integers requires multiplying by four-thirds. So taking that 𝜋 squared over eight, multiplying by four-thirds, bada boom bada
bing! We’ve got ourselves a solution to the Basel problem.