Lesson Explainer: Approximating a Binomial Distribution | Nagwa Lesson Explainer: Approximating a Binomial Distribution | Nagwa

Lesson Explainer: Approximating a Binomial Distribution Mathematics

Join Nagwa Classes

Attend live Mathematics sessions on Nagwa Classes to learn more about this topic from an expert teacher!

In this explainer, we will learn how to approximate a binomial distribution with a normal distribution.

Recall that if a discrete random variable 𝑋 represents the number of successful trials in an experiment, we can model 𝑋 with a binomial distribution 𝐵(𝑛,𝑝), written 𝑋𝐵(𝑛,𝑝), provided the experiment satisfies all the following conditions:

  1. The number of trials, 𝑛, is fixed.
  2. Each trial has two possible outcomes (success or failure).
  3. The probability of success, 𝑝, is fixed.
  4. The trials are independent, so the outcome of one trial does not affect the outcome of any other.

The probability mass function of the binomial distribution is given by 𝑃(𝑋=𝑟)=𝑛𝑟𝑝(1𝑝)=𝑛𝑟𝑛𝑟𝑝(1𝑝). When the number of trials 𝑛 is large, the number 𝑛 is very big, and this calculation can become unwieldy. For this reason, it is often useful to approximate a binomial distribution with a normal distribution. In order for a normal approximation to be appropriate for a binomially distributed random variable 𝑋𝐵(𝑛,𝑝), two conditions must be met:

  1. The probability 𝑝 must be close to 0.5. For us, this means 0.4𝑝0.6.
  2. The number of trials 𝑛 must be sufficiently large. For us, this means 𝑛50.

The normal distribution 𝑁𝜇,𝜎 takes two parameters, the mean 𝜇 and the standard deviation 𝜎 (or variance 𝜎). When approximating a binomial distribution with a normal distribution, these parameters are calculated as follows.

Formula: Approximating a Binomial Distribution with a Normal Distribution

Suppose 𝑋𝐵(𝑛,𝑝) is a binomially distributed discrete random variable with 𝑛50 and 0.4𝑝0.6. Then, 𝑋 may be approximated by a normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝,𝜎=𝑛𝑝(1𝑝).

An important point to remember when approximating a binomial distribution with a normal distribution is that the binomial distribution is a discrete probability distribution, while the normal distribution is continuous. For this reason, it is necessary to apply a continuity correction when calculating probabilities.

For example, consider a discrete random variable 𝑋𝐵(400,0.55). Suppose we approximate this variable with the normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝=220 and 𝜎=𝑛𝑝(1𝑝)=99. If we use this approximation to calculate the probability 𝑃(𝑋=200), then we will get answer of 0 because the normal distribution (being continuous) has 𝑃(𝑋=𝑎)=0 for any 𝑎. The continuity correction we need to make is to instead calculate the probability on a unit interval centered on 200; that is, 𝑃(199.5<𝑋<200.5)=0.0053. Similarly, if we want to calculate the probability that 𝑋 is strictly less than some value, say 230, then we should calculate 𝑃(𝑋<229.5)=0.8302 in order to exclude 230. However, if we want the probability that 𝑋 is less than or equal to 230, we should calculate 𝑃(𝑋<230.5)=0.8544.

Let us have a look at an example of some variables that can be approximated by a normal distribution and some that cannot.

Example 1: Recognizing a Valid Normal Approximation

Which of the following four binomially distributed random variables may be approximated by a normal distribution?

  1. 𝑋𝐵(98,0.42)
  2. 𝑋𝐵(30,0.55)
  3. 𝑋𝐵(700,0.75)
  4. 𝑋𝐵(200,0.59)

Answer

The criteria for whether a normal approximation is appropriate for a binomial random variable 𝑋𝐵(𝑛,𝑝) are as follows:

  1. The probability 𝑝 is close to 0.5, meaning 0.4𝑝0.6.
  2. The number of trials 𝑛 is sufficiently large, meaning 𝑛50.

By these criteria, we can see that variables a and d may be approximated by a normal distribution. Variables b and c cannot be approximated by a normal distribution since variable b fails to satisfy criterion 2 and variable c fails to satisfy criterion 1.

Let us now calculate the mean and variance of a normal approximation.

Example 2: Calculating the Mean and Variance of a Normal Approximation

𝑋 is a binomially distributed random variable, and 𝑋𝐵(250,0.47). Write down a normal approximation of 𝑋𝑁𝜇,𝜎, stating the values of 𝜇 and 𝜎.

Answer

Recall that a binomial random variable 𝑋𝐵(𝑛,𝑝) with 𝑛50 and 0.4𝑝0.6 may be approximated by a normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝 and 𝜎=𝑛𝑝(1𝑝).

Therefore, our binomial random variable 𝑋𝐵(250,0.47) may be approximated by a normal distribution with 𝜇=𝑛𝑝=250×0.47=117.5 and 𝜎=𝑛𝑝(1𝑝)=117.5×0.53=62.275.

Let us now use a normal approximation to estimate a probability.

Example 3: Using a Normal Approximation to Estimate Probabilities

A discrete random variable 𝑋 is binomially distributed, and 𝑋𝐵(200,0.47). Using a normal approximation, estimate 𝑃(𝑋96).

Answer

Recall that a binomial random variable 𝑋𝐵(𝑛,𝑝) with 𝑛50 and 0.4𝑝0.6 may be approximated by a normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝 and 𝜎=𝑛𝑝(1𝑝).

Since the normal distribution is a continuous probability distribution while the binomial distribution is discrete, when calculating a probability using a normal approximation, we need to make continuity corrections.

We approximate 𝑋𝐵(200,0.47) with a normal distribution 𝑁𝜇,𝜎 where 𝜇 is given by 𝜇=200×0.47=94 and 𝜎=94×0.53=49.82. Our approximate model is therefore 𝑁(94,49.82).

Notice that the question asks us to estimate the probability that 𝑋 is less than or equal to 96. We therefore need to make a continuity correction to include the value 96 and calculate the probability 𝑃(𝑋<96.5) in our continuous normal approximation. We get a probability of 0.6384.

Remember that if we want to use a normal approximation to estimate the probability that a discrete random variable takes a particular value 𝑃(𝑋=𝑎), then we have to make the continuity correction 𝑃(𝑎0.5<𝑋<𝑎+0.5). Here is an example to test this skill.

Example 4: Using a Normal Approximation to Estimate Probabilities

A discrete random variable 𝑋 is binomially distributed, and 𝑋𝐵(175,0.56). Using a normal approximation, estimate 𝑃(𝑋=106).

Answer

Recall that a binomial random variable 𝑋𝐵(𝑛,𝑝) with 𝑛50 and 0.4𝑝0.6 may be approximated by a normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝 and 𝜎=𝑛𝑝(1𝑝).

Since the normal distribution is a continuous probability distribution while the binomial distribution is discrete, when calculating a probability using a normal approximation, we need to make continuity corrections.

We approximate 𝑋𝐵(175,0.56) with a normal distribution 𝑁𝜇,𝜎, where 𝜇=175×0.56=98 and 𝜎=98×0.44=43.12. Our approximate model is therefore 𝑁(98,43.12).

The question asks us to estimate the probability that 𝑋 is equal to 106. We therefore need to make a continuity correction and calculate the probability 𝑃(105.5<𝑋<106.5) over the unit interval centered on 106. We get a probability of 0.0289.

Let us now make a normal approximation in a real-life context.

Example 5: Solving a Binomial Distribution Approximation Problem In a Real-Life Context

Karim flips a fair coin 84 times. By using a suitable normal approximation, calculate the estimated probability that the coin lands on heads more than 50 times.

Answer

Recall that a binomial random variable 𝑋𝐵(𝑛,𝑝) with 𝑛50 and 0.4𝑝0.6 may be approximated by a normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝 and 𝜎=𝑛𝑝(1𝑝).

Since the normal distribution is a continuous probability distribution while the binomial distribution is discrete, when calculating a probability using a normal approximation, we need to make continuity corrections.

Let 𝑋 represent the number of times the coin lands on heads. This is a discrete random variable that follows the binomial distribution 𝐵(𝑛,𝑝) with 𝑛=84 and 𝑝=0.5. Since the number of trials is 𝑛=84>50 and the probability of getting heads is 𝑝=0.5, we may approximate 𝑋 with a normal distribution 𝑁𝜇,𝜎, where 𝜇=84×0.5=42 and 𝜎=42×0.5=21. Our approximate model is therefore 𝑁(42,21). We need to estimate the probability 𝑃(𝑋>50). Since the inequality is strict, we make the continuity correction 𝑃(𝑋>50.5) to exclude the case 𝑋=50 for an estimate of 0.0318.

It can sometimes be instructive to compare the probabilities obtained from a binomial distribution and a normal approximation of it, as shown in the next example.

Example 6: Comparing a Binomial Distribution and a Normal Approximation

A farm produces eggs. The farmer claims that 60% of the eggs weigh more than 64 grams.

  1. A random sample of 20 eggs is taken. Find the exact probability that at least 15 of the eggs weigh more than 64 grams.
  2. A random sample of 1‎ ‎000 eggs is taken. Use a normal approximation to estimate the probability that more than 550 and less than 600 eggs weigh more than 64 grams.

Answer

Part 1

Recall that if a discrete random variable 𝑋 represents the number of successful trials in an experiment, we can model 𝑋 with a binomial distribution 𝐵(𝑛,𝑝), written 𝑋𝐵(𝑛,𝑝), provided the experiment satisfies all the following conditions:

  1. The number of trials, 𝑛, is fixed.
  2. Each trial has two possible outcomes (success or failure).
  3. The probability of success, 𝑝, is fixed.
  4. The trials are independent, so the outcome of one trial does not affect the outcome of any other.

Let 𝑋 be the number of eggs that weigh more than 64 grams. A sample of 20 eggs may be modeled by the binomial distribution 𝑋𝐵(𝑛,𝑝). Here, 𝑛=20 and we take our fixed probability 𝑝=0.6 according to the farmer’s claim that 60% of the eggs weigh more than 64 grams.

Recall that the probability mass function of the binomial distribution is given by 𝑃(𝑋=𝑟)=𝑛𝑟𝑝(1𝑝). Therefore, we can calculate 𝑃(𝑋15) as 20150.6×0.4+20160.6×0.4+20170.6×0.4+20180.6×0.4+20190.6×0.4+20200.6×0.4=15504×0.0004702×0.01024+4845×0.0002821×0.0256+1140×0.0001693×0.064+190×0.0001016×0.16+20×0.00006094×0.4+0.00003656=0.1256.

Part 2

As we can see, this calculation is already rather cumbersome! We certainly would not want to have to do this with a sample size of 1‎ ‎000. For this reason, we will approximate 𝑋 with a normal distribution 𝑁𝜇,𝜎.

Recall that a binomial random variable 𝑋𝐵(𝑛,𝑝) with 𝑛50 and 0.4𝑝0.6 may be approximated by a normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝 and 𝜎=𝑛𝑝(1𝑝).

So, in our approximation, we take 𝜇=1000×0.6=600 and 𝜎=600×0.4=240. Our approximate model is therefore 𝑁(600,240).

We can now use our calculators to calculate the probability that 𝑋 lies strictly between 550 and 600 in this model.

In order to exclude the cases of 550 and 600, we make a continuity correction and calculate 𝑃(550.5<𝑋<599.5)=0.4864.

We can evaluate the accuracy of a normal approximation by comparing it to the binomial distribution it is approximating. Suppose we have a binomial random variable 𝑋𝐵(𝑛,𝑝) that is approximated by a normal distribution 𝑁𝜇,𝜎. We can evaluate a particular probability in the binomial model, say 𝑃(𝑋=𝑎), and in the normal approximation, 𝑃(𝑋=𝑎). The magnitude of the difference between these two numbers, |𝑃(𝑋=𝑎)𝑃(𝑋=𝑎)|, is the absolute error of the approximation. A more useful statistic is the percentage error, which is calculated as percentageerror=|𝑃(𝑋=𝑎)𝑃(𝑋=𝑎)|𝑃(𝑋=𝑎)×100. In our final example, we apply this idea.

Example 7: Calculating the Percentage Error of a Normal Approximation

A population of frogs contains female and male frogs in the ratio 5347. A random sample of 100 frogs is taken. Find the percentage error when using a normal approximation to calculate the probability that exactly 60 of the frogs are female.

Answer

Recall that if a discrete random variable 𝑋 represents the number of successful trials in an experiment, we can model 𝑋 with a binomial distribution 𝐵(𝑛,𝑝), written 𝑋𝐵(𝑛,𝑝), provided the experiment satisfies all the following conditions:

  1. The number of trials, 𝑛, is fixed.
  2. Each trial has two possible outcomes (success or failure).
  3. The probability of success, 𝑝, is fixed.
  4. The trials are independent, so the outcome of one trial does not affect the outcome of any other.

Let 𝑋 be the number of female frogs in the sample. This is a binomial random variable 𝑋𝐵(𝑛,𝑝) with 𝑛=100 and 𝑝=0.53.

Recall that a binomial random variable 𝑋𝐵(𝑛,𝑝) with 𝑛50 and 0.4𝑝0.6 may be approximated by a normal distribution 𝑁𝜇,𝜎, where 𝜇=𝑛𝑝 and 𝜎=𝑛𝑝(1𝑝).

We will approximate 𝑋 with a normal distribution 𝑁𝜇,𝜎, where 𝜇=100×0.53=53 and 𝜎=53×0.47=24.91. Our approximate model is therefore 𝑁(53,24.91). In order to calculate the percentage error of this approximation, we will first calculate the probability of exactly 60 female frogs in the binomial model, 𝑃(𝑋=60), and then calculate the same probability with a continuity correction in the normal model, 𝑃(59.5<𝑋<60.5).

The probability mass function of the binomial distribution is given by 𝑃(𝑋=𝑟)=𝑛𝑟𝑝(1𝑝), and so we can calculate 𝑃(𝑋=60) as 𝑃(𝑋=60)=100600.53×0.47=0.0301. On the other hand, our calculators give us a value of 𝑃(59.5<𝑋<60.5)=0.0299 for the normal approximation. We can now calculate the percentage error of the approximation according to the formula percentageerror=|𝑃(𝑋=𝑎)𝑃(𝑋=𝑎)|𝑃(𝑋=𝑎)×100=0.03010.02990.0301×100=0.66%. This shows that with a percentage error of less than 1%, the normal distribution is a reasonably good approximation in this case.

Let us finish by recapping a few important concepts from this explainer.

Key Points

  • It is appropriate to approximate a binomial random variable 𝑋𝐵(𝑛,𝑝) with a normal distribution when 𝑛 is sufficiently large (𝑛50) and 𝑝 is close to 0.5 (0.4𝑝0.6).
  • The parameters of a normal approximation 𝑁𝜇,𝜎 are given by 𝜇=𝑛𝑝 and 𝜎=𝑛𝑝(1𝑝).
  • When using a normal approximation to estimate probabilities, we need to make continuity corrections as follows:
    • When estimating a probability of the form 𝑃(𝑋=𝑎), we should calculate the probability over the unit interval centered on 𝑎; that is, 𝑃(𝑎0.5<𝑋<𝑎+0.5).
    • When estimating a probability of the form 𝑃(𝑋<𝑎), we should evaluate the probability 𝑃(𝑋<𝑎0.5) to exclude 𝑎.
    • When estimating a probability of the form 𝑃(𝑋𝑎), we should evaluate the probability 𝑃(𝑋<𝑎+0.5) to include 𝑎.
  • We can evaluate the accuracy of a normal approximation by calculating its percentage error according to the formula percentageerror=|𝑃(𝑋=𝑎)𝑃(𝑋=𝑎)|𝑃(𝑋=𝑎)×100.

Join Nagwa Classes

Attend live sessions on Nagwa Classes to boost your learning with guidance and advice from an expert teacher!

  • Interactive Sessions
  • Chat & Messaging
  • Realistic Exam Questions

Nagwa uses cookies to ensure you get the best experience on our website. Learn more about our Privacy Policy