Pop Video: Who (Else) Cares about Topology? Stolen Necklaces and Borsuk-Ulam

Grant Sanderson • 3Blue1Brown • Boclips

Who (Else) Cares about Topology? Stolen Necklaces and Borsuk-Ulam

20:34

Video Transcript

You know that feeling you get when two things that seem completely unrelated turn out to have a key connection. In math especially, there’s a certain tingly sensation that I get whenever one of those connections starts to fall into place. That is what I have in store for you today. It takes a little time to set up. I have to introduce this fair division puzzle in discrete math; it’s called the stolen necklace problem. And we also have to set up a certain topological fact about spheres that we’ll use to solve it, called the Borsuk–Ulam theorem. But trust me, seeing how these two seemingly disconnected pieces of math come together is well worth the setup.

And more fun, I coordinated this video with Mathologer, who just put out a video solving a very similar fair division problem, but with a completely different tactic. So after this video, if you’re eager to learn more, head on over to his channel. If somehow you don’t already know about Mathologer, his stuff is some of the best math on YouTube, definitely poke around the rest of the channel and subscribe if you like his stuff as much as I do.

So here’s the puzzle that we’re gonna solve, the stolen necklace problem. You and your friend steal a necklace, full of a whole bunch of very valuable jewels. Maybe, it’s got some sapphires, emeralds, diamonds, and rubies. And let’s say they’re all arranged on the necklace in some totally random order. Moreover, let’s say that there happened to be an even number of each type of jewel. Right here, I have eight sapphires, 10 emeralds, four diamonds, and six rubies. You and your friend want to split the booty evenly, with each of you getting half of each jewel type. Four sapphires, five emeralds, two diamonds, and three rubies each.

And, of course, you could just cut all of the jewels off the necklace and divvy them up evenly, but that’s boring. There’s not really a puzzle there. Instead, the challenge is to make as few cuts to the necklace as possible so that you can divvy up the resulting segments between you and your co-conspirator and still have each of you end up with half of each jewel type. For example, I just did it using four cuts. If I give these top three strands to you and these bottom two strands to your co-conspirator. Notice how each of you ends up with four sapphires. Each of you has five emeralds, two diamonds, and three rubies.

The claim, the thing, that I wanna prove in this video is that if you have 𝑛 different types of jewels, it’s always possible to find a fair division using only 𝑛 cuts or fewer. So with four jewel types, like this example, you should always be able to find a way to make four cuts and divvy up the five resulting pieces so that each thief has the same number of each jewel type. If there were five jewel types, you should be able to do it in five cuts, no matter what the arrangement is. It’s kinda hard to think about, right? I mean you need to keep track of all of these different types of jewels, ensuring that they’re divided fairly. But at the same time, you have to try to minimize how many cuts you’re making.

Depending on your disposition to puzzles in math, maybe this feels a little contrived. But the core characteristics that make this problem hard. Like trying to minimize sharding and trying to allocate some collection of things in a balanced way. These are the kinds of optimization issues that actually come about a fair amount in practical applications. For the computer system folk out there, I’m sure you can imagine how this could relate to some kind of efficient memory allocation problem. Also, if you’re curious to actually see it in action. I’ve left a link in the video description to a certain electrical engineering paper that uses this very problem.

Independent from its usefulness, though, it certainly makes for a good puzzle. Can you always find a fair division using only as many cuts as there are types of jewels? So that’s the puzzle; remember it. And now, we’re gonna take a seemingly unrelated sidestep to the total opposite side of the mathematical universe, topology. Imagine taking a sphere in three-dimensional space and squishing it somehow onto the 2D plane, stretching and morphing it however you want as you do so. The only constraint is that you do this continuously, which you can think of as meaning just never cut the sphere or tear it in any way during the mapping. Now, as you do this, continuously squishing that sphere onto the plane. Many different pairs of points on the sphere are gonna land on top of each other once it hits the plane. And that’s not really a big deal.

The special fact that we’re gonna use, known as the Borsuk–Ulam theorem, is that you will always be able to find a pair of points that started off on exact opposite sides of the sphere for which land on each other during the mapping. Points on the exact opposite side of a sphere are called antipodes or antipodal points.

For example, let’s say you’re thinking of the sphere as Earth and the mapping you choose is to just project every point directly onto the plane of the equator. Well, in that case, the north and the south pole, which are antipodal, each land on the same point. And in this example, that’s the only antipodal pair that lands on the same point. Any other antipodal pair will end up somehow offset from each other. But let’s say you tweak this function a bit, maybe shearing it during the projection. Well, in that case, the north and south pole probably don’t land on each other anymore. But when the topology gods close a door, they open a window. Because the Borsuk–Ulam theorem guarantees that no matter what, there must be some other antipodal pair that now land on top of each other.

The classic example to illustrate this idea, which any math educator introducing the Borsuk–Ulam theorem is required by law to present. Is that there must exist some pair of points on the opposite side of the Earth where the temperature and the barometric pressure are both precisely the same. Think about it for a moment. Associating each point on the Earth with a pair of numbers, temperature, and pressure is the same as mapping the surface of the Earth into a 2D coordinate plane. Where the first coordinate represents temperature and the second one represents pressure. Also, each of those values varies continuously as you wander around the Earth. So this association is a continuous mapping from the surface of a sphere onto the plane, some nontearing way to squish that surface into two dimensions.

So what the Borsuk–Ulam theorem guarantees is that no matter what the weather patterns on Earth, or any other planet for that matter, some pair of antipodal points somewhere must land on top of each other. Which means they map to the same temperature pressure coordinates. Now, I imagine if you’re watching this video, you’re probably a mathematician at heart and you wanna see why this is true, not just that it’s true. Vsauce actually talked about the Borsuk–Ulam theorem in a great video that he recently did about fixed points. And he gave a really beautiful line of reasoning to explain it intuitively, which I’m just gonna shamelessly coopt for my own use here.

Given some function from the sphere onto the plane, imagine walking around the equator. The corresponding outputs on the plane are gonna form some kind of closed loop. And let’s say that your sister is on the exact opposite side of the globe. And as you walk around, she continues keeping herself perfectly antipodally opposite from you. Since the two of you eventually swap places, at some point along the way, the 𝑥-coordinates of your corresponding outputs have to line up. The first time this happens, I want you to mark where you are on the sphere, as well as where your antipodal sister is. Then if you tilt the equator slightly and walk along a slightly different great circle, the corresponding loop in the output space is gonna alter a bit.

But by the same line of reasoning, there has to be some point on your walk where you and your antipodal sister land on outputs with the same 𝑥-coordinate, lining up vertically. Mark those two points on the sphere as well. If you repeat this, continuously turning that equator 180 degrees around the full circle, your marked points are gonna make up some new closed loop around the sphere. Here is what that might look like in the output space, keeping track of all of the points where you and your antipodal sister first line up vertically. Every point on this new loop is one where you and your antipodal sister, by definition, end up with the same 𝑥-coordinate. So if the two of you are walking around this new loop, you always line up vertically.

But what’s more, since the two of you are eventually gonna swap places, there must be some point along the way where you also have the same 𝑦-coordinate, aligning horizontally. That gives a point where you and your antipodal sister must land on the same output. Pretty cool, right? To help set the stage for how the heck this applies to the necklace problem, I wanna write what this means a little more symbolically. Points in 3D space are represented with three coordinates, right? I mean, in some sense, that’s what 3D space is, to a mathematician at least, all possible triplets of numbers. Now, the simplest sphere to describe with coordinates is a standard unit sphere centered at the origin, the set of all points a distance one from that origin. Meaning, all triplets of numbers with the special property that the sum of their squares equals one.

So the geometric idea of a sphere is related to the algebraic idea of some set of positive numbers that add up to one, remember that. If you have one of these triplets, the point on the opposite side of the sphere, the corresponding antipodal point, is whatever you get by flipping the sign of each coordinate, right? So let’s just write out what the Borsuk–Ulam theorem is saying symbolically. This is gonna help for where we’re going. For any function that takes in points on the sphere, triplets of numbers whose squares sum to one, and spits out some point in 2D space, some pair of coordinates like temperature and pressure. As long as that function is continuous, there will be some input so that flipping all the signs doesn’t change the output. And with that on the table, let’s turn back to the stolen necklace problem.

Part of the reason that these two things feel so unrelated is that the necklace problem is discrete. But the Borsuk–Ulam theorem applies to a continuous situation. So our first step is to translate the stolen necklace problem into a continuous version. For right now, let’s limit ourselves to the case where there are only two jewel types, sapphires and emeralds. And we’re hoping to make a fair division of the necklace after only two cuts. As an example to have upon the screen, let’s say that we have eight sapphires and 10 emeralds on the necklace. Just as a reminder, this means that the goal is to cut the necklace in two different spots and divvy up those three segments so that each thief has half of the sapphires and half of the emeralds. Notice how the top and bottom here each have four sapphires, and each have five emeralds.

Think of this necklace as a line with length one, with the jewels sitting evenly spaced on it. Now, divide up that line into 18 evenly sized segments, one for each jewel type. And rather than thinking of each jewel as a discrete indivisible entity on the segment, remove the jewel itself. And, instead, just paint that segment the color of the jewel. So in this case, painting the whole necklace appropriately, eight eighteenths of the line are gonna be painted sapphire, while ten eighteenths of the line are gonna be painted emerald. The continuous version of this puzzle is to now ask whether we can find two cuts anywhere on this line, not necessarily on these one eighteenth interval marks, that let us divide up the pieces so that each thief has an equal length of each color.

In this case, that means each thief should come up with a total length of four eighteenths of sapphire-colored segments and five eighteenths of emerald-colored segments. An important but somewhat subtle point is that if you can solve this continuous variant of the puzzle, you can also solve the original discrete version. To see why, let’s say that you do find a fair division but whose cuts don’t necessarily fall cleanly between jewels. Maybe it cuts part of the way through an emerald segment. Well, because this is a fair division, the length of emerald in both the top and the bottom group has to add up to exactly five emerald segments, a whole number multiple of the segment length. So even if the division does cut partially into an emerald segment on the left, it would have to also cut partially into an emerald segment on the right here. So that the total length can add up to a whole number multiple of the segment length.

What that means is that we can adjust each cut without affecting the division so that they ultimately do line up on these one eighteenth marks. Now, in this continuous case, where you can cut the line wherever the heck you want, think about all of the choices that go into cutting the necklace and allocating its pieces. First, you choose two different places to cut the interval. But another way to think of that is to choose three positive numbers that add up to one. For example, maybe you choose one-sixth, one-third, and one-half. That would correspond with these two cuts here. Anytime that you find three positive numbers that add to one, it gives you a way to cut the necklace, and vice versa. Then after you cut it, you have to make a binary choice for each one of those three pieces for whether it goes to the thief one or if it goes to thief two.

Now compare that to if I asked you to choose some arbitrary point on the 3D sphere. Some point with coordinates 𝑥, 𝑦, 𝑧 so that 𝑥 squared plus 𝑦 squared plus 𝑧 squared equals one. Well, you might start off by choosing three positive numbers that add up to one. Maybe, you want 𝑥 squared to be one-sixth, 𝑦 squared to be one-third, and 𝑧 squared to be one-half. Then, you have to make a binary choice for each one, choosing whether to take the positive square root or the negative square root. So in a way that’s completely parallel to choosing a necklace division, choosing a point on the sphere involves first finding three positive numbers that add up to one and then making a binary choice for what to do with each one of them. That right there is a key observation for the whole video. It gives a correspondence between points on the sphere and necklace divisions.

For any point 𝑥, 𝑦, 𝑧 that sits on the sphere, because 𝑥 squared plus 𝑦 squared plus 𝑧 squared equals one, you can cut the necklace so that the first piece has a length of 𝑥 squared. The second has a length of 𝑦 squared, and the third has a length of 𝑧 squared. Then to choose how to allocate these pieces, if 𝑥 is positive, give it to thief one. Otherwise, give it to thief two. If 𝑦 is positive, give that second piece to thief one. Otherwise, give it to thief two. And then, similarly, to allocate the third piece, if 𝑧 is positive, give it to thief one. Otherwise, give it to thief two. And you can go the other way around; this is a one-to-one correspondence. Any way to divide up the necklace and divvy up the pieces would give you a unique point on the sphere.

It’s as if the sphere is the perfect way to encapsulate the idea of all possible necklace divisions using a geometric object. And with that association, we are tantalizingly close. Take a moment and think about the meaning of antipodal points under this association. If the point 𝑥, 𝑦, 𝑧 on the sphere corresponds to some necklace allocation, what does the point negative 𝑥, negative 𝑦, negative 𝑧 correspond to? Well, the squares of all of these coordinates are the same. So it would correspond to making the same cuts on the necklace. The difference is that each piece switches which thief it belongs to. So jumping to an antipodal point on the opposite side of the sphere corresponds to exchanging all the pieces between the two thieves.

Now, remember what it is that we’re actually looking for. We want the total length of each jewel type belonging to thief one to equal that for thief two. In other words, in a fair division, performing this antipodal swap doesn’t change the amount of each jewel belonging to each thief. Your brain should be burning with the thought of Borsuk–Ulam at this point. Specifically, the way you might move forward is to construct a certain function, a function that takes in a given necklace allocation and spits out two numbers. The total length of sapphire belonging to thief one and the total length of emerald belonging to thief one.

What we wanna show is that there must exist a way to divide the necklace with only two cuts and divvy up the pieces so that those two numbers are the same as what they would have been for thief two. Or, said a little differently, where swapping all of the pieces won’t change those two numbers for thief one. Because of this back and forth between necklace allocations and points on the sphere and because pairs of numbers correspond to points on the 𝑥𝑦-plane, this is in effect a mapping from the sphere onto the plane. So what the Borsuk–Ulam theorem guarantees is that some antipodal pair of points on the sphere land on each other in the plane. And what that means is there is some necklace division and allocation so that swapping the pieces between the two thieves won’t change the amount of each jewel that each one has.

That’s a fair division. That, my friends, is what beautiful math feels like. And if you’re anything like me, you’re just basking in the glow of what a clever proof that is. And it might be easy to forget that we actually wanna solve the more general stolen necklace problem, one that has more than just two jewel types. Well, for that, we can use a more general version of the Borsuk–Ulam theorem, one that applies to higher-dimensional spheres. As an example one dimension up, Borsuk–Ulam also applies to mapping hyperspheres in 4D space into 3D space. What I mean by a hypersphere, the way to think about it, is all possible lists of four coordinates where the sum of their squares equals one. Those are all of the points in 4D space a distance one from the origin.

The Borsuk–Ulam theorem says that if you try to map that set, all of those special quadruplets, into three-dimensional space, continuously associating each one with some triplet of numbers. There must be some antipodal collision, an input 𝑥 one, 𝑥 two, 𝑥 three, 𝑥 four, where flipping all of the signs wouldn’t change the output. I’m gonna leave it to you to pause and ponder and think about how that would apply to the three-jewel case. And about what the general statement of the Borsuk–Ulam theorem might be and how it can apply to the general necklace problem. Needless to say, actually trying to visualize a 4D sphere mapping into 3D space is rather difficult. Nevertheless, for the final animation, I’m gonna try to show you what that might look like.

Alright, so here’s that final animation. It’s a little confusing. So for analogy on the upper left here, I’m rotating a sphere in three dimensions and projecting it into 2D. But I’m only showing the lines of latitude on that sphere and not shading it or anything like that. So, similarly in the center, I’m rotating a four-dimensional hypersphere in 4D space but projecting it into 3D space. But all I’m showing is the spheres of latitude, so to speak. Keep in mind, you don’t actually need to be able to visualize a 4D sphere mapping into three dimensions to be able to understand the Borsuk–Ulam theorem. Much less, how it applies to the stolen necklace problem. This is just for fun. But it is pretty to try, don’t you think?

Pop Video: Who (Else) Cares about Topology? Stolen Necklaces and Borsuk-Ulam

Video Transcript

Rejoindre Nagwa Classes