## What is Binomial Distribution?

A Binomial Distribution describes the probability of an event that only has 2 possible outcomes. Ex. Heads or tails.

It can also be used to describe the probability of a series of independent events that only have 2 possible outcomes occurring. Ex. Flipping a coin 10 times and having it land with 5 on heads exactly 5 times. Or, flipping a coin 10 times and having at least 5 land on heads.

The probability of an event with n trials and f failures follows a binomial distribution.

### Visualizing Binomial Distribution

The Binomial Distribution looks like so when graphed:

By Tayste – Own work, Public Domain, Link

## When Would You Use Binomial Distribution?

### Requirements and Conditions for a Binomial Distribution

- Must be a fixed number of trials.
- Continuous data are not binomial.
- Probability of success should be the same on every trial.
- Probability of success is constant.
- Two state. Two possible outcomes.
- (true or false, hot or cold, success or failure, defective or not defective.)

- Independent trials – trials are statistically independent.
- (ex the flip of one coin means nothing to the results of the next coin flip.)

- Use Binomial Distribution when you are sampling with replacement.

### You Cannot Use Binomial Distribution On:

- When the probability of success is not constant for an event.
- Ex. The probability of it snowing or not snowing in NYC would not fit the criteria for a Binomial Distribution because the probability of success is not constant. The chance of snow on winter days is higher than summer days.

- When you sample WITHOUT replacement.
- In that case, use Hypergeometric.

### Cumulative vs Non-Cumulative

There are (2) ways I’ve seen Binomial Distribution Problems be represented in Six Sigma Exams:

- Non-cumulative questions
- Cumulative questions (with or without a chart)

The questions can either be about the actual equations and translating a word problem into an actual solution. Or they can be about the necessary scenarios when you would apply binomial probability – in other words, what MUST be true to use it, or if an event is binomial, what can we infer about it’s properties.

#### Non Cumulative Probability

This is just a fancy statistics term of single events taken on their own.

Example: a single flip of a coin

#### Cumulative Probability

Another fancy statistics term for multiple events. I like to remember that Cumulative events literally accumulate multiple probabilities.

One thing to be aware of is that you can be asked to either manually calculate a binomial cumulative probability question or you could be used to leverage a chart.

### Notes About Sampling With Replacement

Binomial is for N trials and F failures and describes the probability of * k* successes in

**draws**

*n**with*replacement from a finite population of size

*containing exactly*

**N***successes.*

**K**Remember, binomial is for independent events where the likelihood of each event occurring is the same as the others. If you sample without replacing, you’re affecting the likelihood of independence.

This is in contrast to the hypergeometric distribution, which describes the probability of * k* successes in

**draws**

*n**when sampling without*replacement.

## Examples of Binomial Distribution Problems

### Non Cumulative Binomial Probability

#### Non-**Cumulative Calculation Example 1: “Contains Exactly”**

If a coin is tossed 10 times, what is the probability of ** obtaining exactly four heads?** (If an honest coin is tossed 10 times, what is the probability of 4 heads?)

With a coin toss there are only 2 possible outcomes; heads or tails. So you should be on alert for using binomial.

#### Non-**Cumulative Calculation Example 2: “Contains Exactly”**

A manufacturing process creates 3.4% defective parts. A sample of 10 parts from the production process is selected. What is the probability that the sample ** contains exactly **3 defective parts?

As soon as you see the word defective, you should be alert to using the Binomial equation. Since defect in this sense means that a part is in a binary state, either functioning or defective, it meets our criteria.

#### Non-**Cumulative Calculation Example 3: “Contains Exactly”**

Forty-five percent of all registered voters in a national election are female. A random sample of 8 voters is selected. The probability that the sample contains 2 males is:

Since in this question the assumption is that there are (2) biological genders measured, male or female, you should be on alert to use the binomial distribution.

**Non-Cumulative Chart Example: **

You can see that the same percentage was found on the chart. (In this case, x is the probability of the event happening instead of p.)

### Examples of Cumulative Binomial Probability Questions

**Cumulative Calculation Example:** “At least” or “Fewer Than” or “At Most”

79% percent of the students of a large class passed the final exam. A random sample of 4 students are selected to be analyzed by the school. What is the probability that the sample ** contains fewer than 2 **students that passed the exam?

Since we are examining data that only has a binary state, pass or fail, you should be on alert to using the binomial equation.

Also, since you’re asked for ‘x or fewer’, you have to calculate the probability of ALL POSSIBLE events.

In this example we must calculate the odds of EXACTLY one pass PLUS the odds of EXACTLY no pass in the sample.

#### Cumulative Calculation Example: “More than”

An example of this kind of question is “Test the probability of the number of dropped calls ** will exceed** a certain number.”

Here you’d do the opposite of the “Fewer” example above and count the probabilities above.

So, if we re-wrote the question above to be ‘What’s the probability of sampling 10 students and 8 or more passed’ it would be the probability of EXACTLY 8 passing, PLUS EXACTLY 9 passing, PLUS EXACTLY 10 passing.

**Binomial Probability Chart Questions**

This can be a LOT of calculations. Fortunately, there are some short cuts involving a table.

### Binomial Probability Charts

Here are some good references:

- https://www.statisticshowto.datasciencecentral.com/tables/binomial-distribution-table/
- http://www.math.hawaii.edu/~xander/fa12_471/Binomial_Probabilities.pdf
- https://mat.iitm.ac.in/home/vetri/public_html/statistics/binomial.pdf

#### Binomial Chart Question Example: (Without Calculations)

- If I were using a Binomial Distribution Table, which values of X would I add together to get the probability of at most N defective?
- You would add the probabilities of N, n-1, n-2… all the way to zero.

- Ex. “which values of X would I add together to get the probability of at most 4 defective?”
- Add the probabilities of 4 + 3 + 2 +1 + 0.

#### Binomial Chart Question Example: (“More Than”)

Let’s assume that 20% of drivers on the road have NO car insurance. What is the probability that **more than 7** out of a sample of 15 people have no car insurance?

- Find
in the first column on the left.**n****= 15** - Find the column containing
.**p = 0.20** - Find the
**7**in the second column on the left, since we want to find*F*(7) =*P*(*X*≤ 7).

Now, all we need to do is read the probability value where the *p* = 0.20 column and the (*n* = 15, *x* = 7) row intersect. What do you get?

The cumulative binomial probability table tells us that *P*(*X* ≤ 7) = 0.9958. Therefore: *P*(*X* > 7) = 1 − 0.9958 = 0.0042

That is, the probability that more than 7 in a random sample of 15 would have no car insurance is 0.0042

## Example of Using Binomial Probability in a Six Sigma Project

Typically in a Six Sigma project you are using DMAIC methodology. Let’s say during the Define Phase you chartered a project to see if you could improve a process that had a binomial outcome. You even might use the Binomial Distribution to articulate the business case for why you should do the project in the first place – for instance, event A should be binomial, but it’s clearly not.

You might use the Binomial distribution to see if samples of the process outcome were in fact following a binomial distribution during the Analysis phase.

If you are running a pilot implementation during the Improve Phase, you might make the assumption that some events that are inputs to your process in fact will follow a binomial distribution.

In the Control Phase you might monitor that a specific process is generating Binomially distributed results if it should.

## What You Need to Know for Certification

### Six Sigma Green Belts

The ASQ Six Sigma Green Belt BOK requires:

Probability distributions

Describe and interpret normal, binomial, and Poisson, chi square, Student’s t, and F distributions. (Apply)

### Six Sigma Black Belts

The ASQ Six Sigma Black Belt BOK Requires:

Describe, apply and interpret the following distributions: normal, Poisson, binomial, chi square, Student’s t and F distributions. (Evaluate)

The Villanova Six Sigma Black Belt BOK requires the following

Advanced statistical process controlDistributions commonly used by black belts. Describe and apply binomial, Poisson, normal, chi-square, Student’s t, and F distributions.

## Comments (10)

I think there is a typo:

Cumulative

Order of when they happen does not matter.

Non Cumulative

Order of when they happen does not matter.

did you mean:

Cumulative

Order of when they happen does matter.

Non Cumulative

Order of when they happen does not matter.

Thanks for building this resource!

Yes, thank you Amy. Very much appreciate the proof reading!

The Cumulative Calculation Example is missing

Sorry, Nicolle. I’ll add that to the list to update.

TED can you please update Cumulative calculation example & Cumulative chart example are still missing.

Hi Akbar,

I’ve added that to my to-do list.

Best, Ted.

This is now complete. Take a look!

Hi All,

This article has been completely re-written and is much more comprehensive and hopefully more helpful.

Let me know how it goes!

Best, Ted.

Hi Ted,

Is Binomial Distribution needed for IASSC?

I don’t see it specifically in the IASSC Green Belt BOKIASSC Green Belt BOKIASSC Green Belt BOK, but they do list Statistics as being a core need. I find it worthy to master because a) it may be on the exam and b) understanding the concept is integral to industry expectations for green belt holders.