# Pop Video: Essence of Calculus

Grant Sanderson • 3Blue1Brown • Boclips

Essence of Calculus

16:11

### Video Transcript

Hey everyone, Grant here. This is the first video in a series on the essence of calculus. And I’ll be publishing the following videos once per day for the next 10 days. The goal here, as the name suggests, is to really get the heart of the subject out in one binge-watchable set. But with a topic that’s as broad as calculus, there’s a lot of things that can mean. So, here’s what I’ve in my mind specifically.

Calculus has a lot of rules and formulas which are often presented as things to be memorized, lots of derivative formulas, the product rule, the chain rule, the implicit differentiation, the fact that integrals and derivatives are opposite, Taylor series, just a lot of things like that. And my goal is for you to come away feeling like you could’ve invented calculus yourself. That is, cover all those core ideas, but in the way that makes clear where they actually come from and what they really mean, using an all-around visual approach.

Inventing math is no joke. And there is a difference between being told why something’s true and actually generating it from scratch. But at all points, I want you to think to yourself. If you were an early mathematician pondering these ideas and drawing out the right diagrams, does it feel reasonable that you could’ve stumbled across these truths yourself?

In this initial video, I wanna show how you might stumble into the core ideas of calculus by thinking very deeply about one specific bit of geometry, the area of a circle. Maybe you know that this is 𝜋 times its radius squared, but why? Is there a nice way to think about where this formula comes from? Well, contemplating this problem and leaving yourself open to exploring the interesting thoughts that come about can actually lead you to a glimpse of three big ideas in calculus: integrals, derivatives, and the fact that they’re opposites.

But the story starts more simply, just you and a circle, let’s say with radius three. You’re trying to figure out its area. And after going through a lot of paper trying different ways to chop up and rearrange the pieces of that area, many of which might lead to their own interesting observations, maybe you try out the idea of slicing up the circle into many concentric rings. This should seem promising because it respects the symmetry of the circle. And math has a tendency to reward you when you respect its symmetries.

Let’s take one of those rings which has some inner radius, 𝑟, that’s between zero and three. If we can find a nice expression for the area of each ring like this one, and if we have a nice way to add them all up, it might lead us to an understanding of the full circle’s area. Maybe you start by imagining straightening out this ring. And you could try thinking through exactly what this new shape is and what its area should be. But for simplicity, let’s just approximate it as a rectangle. The width of that rectangle is the circumference of the original ring, which is two 𝜋 times 𝑟, right? I mean, that’s essentially the definition of 𝜋. And its thickness? Well, that depends on how finely you chopped up the circle in the first place, which was kind of arbitrary.

In the spirit of using what will come to be standard calculus notation, let’s call that thickness d𝑟, for a tiny difference in the radius from one ring to the next. Maybe you think of it as something like 0.1. So, approximating this unwrapped ring as a thin rectangle, its area is two 𝜋 times 𝑟, the radius, times d𝑟, the little thickness. And even though that’s not perfect, for smaller and smaller choices of d𝑟, this is actually gonna be a better and better approximation for that area since the top and the bottom sides of this shape are gonna get closer and closer to being exactly the same length.

So let’s just move forward with this approximation, keeping in the back of our minds that it’s slightly wrong. But it’s gonna become more accurate for smaller and smaller choices of d𝑟; that is, if we slice up the circle into thinner and thinner rings. So just to sum up where we are, you’ve broken up the area of the circle into all of these rings. And you’re approximating the area of each one of those as two 𝜋 times its radius times d𝑟, where the specific value for that inner radius ranges from zero, for the smallest ring, up to just under three, for the biggest ring, spaced out by whatever the thickness is that you choose for d𝑟, something like 0.1.

And notice that the spacing between the values here corresponds to the thickness d𝑟 of each ring, the difference in radius from one ring to the next. In fact, a nice way to think about the rectangles approximating each ring’s area is to fit them all upright side by side along this axis. Each one has a thickness d𝑟, which is why they fit so snugly right there together. And the height of any one of these rectangles sitting above some specific value of 𝑟, like 0.6, is exactly two 𝜋 times that value. That’s the circumference of the corresponding ring that this rectangle approximates.

Pictured like this, two 𝜋𝑟 can actually get kinda tall for the screen. I mean, two times 𝜋 times three is around 19. So let’s just throw up a 𝑦-axis that’s scaled a little differently so that we can actually fit all of these rectangles on the screen. A nice way to think about this set-up is to draw the graph of two 𝜋𝑟, which is a straight line that has a slope two 𝜋. Each of these rectangles extends up to the point where it just barely touches that graph. Again, we’re being approximate here. Each of these rectangles only approximates the area of the corresponding ring from the circle. But remember, that approximation, two 𝜋𝑟 times d𝑟, gets less and less wrong as the size of d𝑟 gets smaller and smaller.

And this has a very beautiful meaning when we’re looking at the sum of the areas of all those rectangles. For smaller and smaller choices of d𝑟, you might at first think that that turns the problem into a monstrously large sum. I mean there’s many, many rectangles to consider. And the decimal precision of each one of their areas is gonna be an absolute nightmare! But notice, all of their areas in aggregate just looks like the area under a graph. And that portion under the graph is just a triangle. A triangle with a base of three and a height that’s two 𝜋 times three. So its area, one-half base times height, works out to be exactly 𝜋 times three squared. Or, if the radius of our original circle was some other value, capital 𝑅, that area comes out to be 𝜋 times 𝑅 squared. And that’s the formula for the area of a circle.

It doesn’t matter who you are or what you typically think of math. That right there is a beautiful argument. But if you wanna think like a mathematician here, you don’t just care about finding the answer. You care about developing general problem-solving tools and techniques. So take a moment to meditate on what exactly just happened and why it worked. Cause the way that we transition from something approximate to something precise is actually pretty subtle. And it cuts deep to what calculus is all about.

You had this problem that could be approximated with the sum of many small numbers, each of which looked like two 𝜋𝑟 times d𝑟 for values of 𝑟 ranging between zero and three. Remember, the small number d𝑟 here represents our choice for the thickness of each ring, for example 0.1. And there are two important things to note here. First of all, not only is d𝑟 a factor in the quantities we’re adding up, two 𝜋𝑟 times d𝑟, it also gives the spacing between the different values of 𝑟. And secondly, the smaller our choice for d𝑟, the better the approximation.

Adding all of those numbers could be seen in a different pretty clever way as adding the areas of many thin rectangles sitting underneath a graph, the graph of the function two 𝜋𝑟 in this case. Then, and this is key, by considering smaller and smaller choices for d𝑟 corresponding to better and better approximations of the original problem, this sum, thought of as the aggregate area of those rectangles, approaches the area under the graph. And because of that, you can conclude that the answer to the original question in full unapproximated precision is exactly the same as the area underneath this graph.

A lot of other hard problems in math and science can be broken down and approximated as the sum of many small quantities, things like figuring out how far a car has traveled based on its velocity at each point in time. In a case like that, you might range through many different points in time. And at each one, multiply the velocity at that time times a tiny change in time, d𝑡, which would give the corresponding little bit of distance traveled during that little time. I’ll talk through the details of examples like this later in the series. But at a high level, many of these types of problems turn out to be equivalent to finding the area under some graph, in much the same way that our circle problem did.

This happens whenever the quantities that you’re adding up, the one whose sum approximates the original problem, can be thought of as the areas of many thin rectangles sitting side by side like this. If finer and finer approximations of the original problem correspond to thinner and thinner rings, then the original problem is gonna be equivalent to finding the area under some graph. Again, this is an idea we’ll see in more detail later in the series. So don’t worry if it’s not 100 percent clear right now.

The point now is that you, as the mathematician, having just solved a problem by reframing it as the area under a graph, might start thinking about how to find the areas under other graphs. I mean, we were lucky in the circle problem that the relevant area turned out to be a triangle. But imagine instead something like a parabola, the graph of 𝑥 squared. What’s the area underneath that curve, say between the values of 𝑥 equals zero and 𝑥 equals three? Well, it’s hard to think about, right? And let me reframe that question in a slightly different way. We’ll fix that left endpoint in place at zero and let the right endpoint vary. Are you able to find a function, 𝐴 of 𝑥, that gives you the area under this parabola between zero and 𝑥? A function, 𝐴 of 𝑥, like this is called an integral of 𝑥 squared.

Calculus holds within it the tools to figure out what an integral like this is. But right now, it’s just a mystery function to us. We know it gives the area under the graph of 𝑥 squared between some fixed left point and some variable right point. But we don’t know what it is. And again, the reason we care about this kind of question is not just for the sake of asking hard geometry questions. It’s because many practical problems that can be approximated by adding up a large number of small things can be reframed as a question about an area under a certain graph. And I’ll tell you right now that finding this area, this integral function, is genuinely hard.

And whenever you come across a genuinely hard question in math, a good policy is to not try too hard to get at the answer directly, since usually you just end up banging your head against a wall. Instead, play around with the idea with no particular goal in mind. Spend some time building up familiarity with the interplay between the function defining the graph, in this case 𝑥 squared, and the function giving the area. In that playful spirit, if you’re lucky, here’s something that you might notice. When you slightly increase 𝑥 by some tiny nudge, d𝑥, look at the resulting change in area represented with this sliver that I’m going to call d𝐴, for a tiny difference in area. That sliver can be pretty well approximated with a rectangle, one whose height is 𝑥 squared and whose width is d𝑥. And the smaller the size of that nudge, d𝑥, the more that sliver actually looks like a rectangle.

Now this gives us an interesting way to think about how 𝐴 of 𝑥 is related to 𝑥 squared. A change to the output of 𝐴, this little d𝐴, is about equal to 𝑥 squared, where 𝑥 is whatever input you started at, times d𝑥, the little nudge to the input that caused 𝐴 to change. Or, rearranged, d𝐴 divided by d𝑥, the ratio of a tiny change in 𝐴 to the tiny change in 𝑥 that caused it, is approximately whatever 𝑥 squared is at that point. And that’s an approximation that should get better and better for smaller and smaller choices of d𝑥. In other words, we don’t know what 𝐴 of 𝑥 is. That remains a mystery. But we do know a property that this mystery function must have.

When you look at two nearby points, for example three and 3.001, consider the change to the output of 𝐴 between those two points, the difference between the mystery function evaluated at 3.001 and evaluated at three. That change divided by the difference in the input values, which in this case is 0.001, should be about equal to the value of 𝑥 squared for the starting input, in this case three squared. And this relationship between tiny changes to the mystery function and the values of 𝑥 squared itself is true at all inputs, not just three. That doesn’t immediately tell us how to find 𝐴 of 𝑥. But it provides a very strong clue that we can work with.

And there’s nothing special about the graph 𝑥 squared here. Any function defined as the area under some graph has this property that d𝐴 divided by d𝑥, a slight nudge to the output of 𝐴 divided by a slight nudge to the input that caused it, is about equal to the height of the graph at that point. Again, that’s an approximation that gets better and better for smaller choices of d𝑥. And here, we’re stumbling into another big idea from calculus, derivatives. This ratio, d𝐴 divided by d𝑥, is called the derivative of 𝐴. Or more technically, the derivative is whatever this ratio approaches as d𝑥 gets smaller and smaller. I’ll dive much more deeply into the idea of a derivative in the next video. But loosely speaking, it’s a measure of how sensitive a function is to small changes in its input.

You’ll see as the series goes on that there are many, many ways that you can visualize a derivative, depending on what function you’re looking at and how you think about tiny nudges to its output. And we care about derivatives because they help us solve problems. And in our little exploration here, we already have a slight glimpse of one way that they’re used. They are the key to solving integral questions, problems that require finding the area under a curve. Once you gain enough familiarity with computing derivatives, you’ll be able to look at a situation like this one where you don’t know what a function is. But you do know that its derivative should be 𝑥 squared. And from that, reverse engineer what the function must be.

And this back and forth between integrals and derivatives where the derivative of a function for the area under a graph gives you back the function defining the graph itself is called the fundamental theorem of calculus. It ties together the two big ideas of integrals and derivatives. And it shows how, in some sense, each one is an inverse of the other.

All of this is only a high-level view, just a peek at some of the core ideas that emerge in calculus. And what follows in the series are the details for derivatives and integrals and more. At all points, I want you to feel that you could’ve invented calculus yourself. That if you drew the right pictures and played with each idea in just the right way, these formulas and rules and constructs that are presented could have just as easily popped out naturally from your own explorations.