A generic candlestick chart.

4.1

Probability Fundamentals

Learning Objectives

Upon completion of this section, you should be able to


Introduction

Many events in the world around us exhibit random patterns, but by repeated observations of such events it can often be seen to predict long-term patterns (despite random, short-term fluctuations). Probability is simply the branch of mathematics devoted to the study of these events. The probability of a specified event is the chance or likelihood that it will occur.

A close up of a hand tossing a coin

What is the probability of a Head in a coin toss? How do you know that?

There are several ways of viewing probability. One would be experimental (also called empirical) in nature, where we repeatedly conduct an experiment and assign the probability to be the ratio of times we observed the event to the number of trials in the experiment. Suppose we flipped a coin 1000 times and heads occurred 529 times. We could assign the probability of being a head on a flip as the ratio 529 1000 .

Another approach is more theoretical in that we use logic or knowledge about the experiment to assign a probability. For our coin example we could use logical reasoning to say that each side of the coin has the same chance of landing upright when flipped, so we would assign the probability as 1 2 for being a heads. This theoretical value comes with some assumptions typically (the coin is balanced equally throughout, the player is not attempting to manipulate the toss, etc...).

Another view of how to assign probabilities would be subjective in nature, in other words an educated guess. If someone asked you the probability that the Diamondbacks would win their next baseball game, it would be impossible to conduct an experiment where the same two teams played each other repeatedly, each time with the same starting lineup and starting pitchers, each starting at the same time of day on the same field under the precisely the same conditions. Since there are so many variables to take into account, someone familiar with baseball and with the two teams involved might make an educated guess that there is a 75% chance they will win the game; that is, if the same two teams were to play each other repeatedly under identical conditions, the Mariners would win about three out of every four games. But this is just a guess, with no way to verify its accuracy, and depending upon how educated the educated guesser is, a subjective probability may not be worth very much.

We will return to the experimental and subjective probabilities from time to time, but in this course we will mostly be concerned with theoretical probability.

Sample Spaces and Events

If you roll a die, pick a card from deck of playing cards, or randomly select a person and observe their age, we say we are conducting a random experiment, an action for which all possible outcomes can be listed, but for which the actual outcome on any given trial of the experiment cannot be predicted with certainty. In probability, we look at the likelihood of an outcome or a collection of outcomes that we call an event in the random experiment. We begin with some terminology.

Terminology

A random experiment is a mechanism that produces a definite outcome that cannot be predicted with certainty. The sample space associated with a random experiment is the set or collection of all possible outcomes. An event is any particular outcome or group of outcomes. When an event is made up of just one single outcome we call it a simple event.

Example 1

Construct a sample space for the experiment that consists of tossing a single coin.

Solution

The outcomes could be labeled h for heads and t for tails. Then the sample space is the set S = { h , t } .

Example 2

Construct a sample space for the experiment that consists of rolling a standard single die. Find the events that correspond to the phrases “an even number is rolled” and “a number greater than two is rolled.”

Solution

The sample space is the set of all possible simple events:

S = { 1 , 2 , 3 , 4 , 5 , 6 }

The event “an even number is rolled” would be the collection of all even numbers. We will call this event E and would be: E = { 2 , 4 , 6 }

The event “a number greater than two is rolled.” would be the collection of all numbers larger (greater) than 2. We will call this event F and would be: F = { 3 , 4 , 5 , 6 }

The choice of the letter for an event is similar to the choice for a letter in a variable in an algebra application. We pick one that will hopefully help with the meaning of the event in terms of the application. The letter E was a natural choice for the event of an even number in the dice rolls, so it was used. The other event didn't have a natural choice so we picked a letter close to E and didn't have any inferred meaning in the application.

A graphical representation of a sample space and events is a Venn diagram, as shown below "Venn Diagrams for Two Sample Spaces". In general the sample space S is represented by a rectangle, outcomes by points within the rectangle, and events by ovals that enclose the outcomes that compose them.

first rectangle labeled a illustrating the coin toss sample space with two dots inside with one labeled h and other labeled t. Second rectangle labeled b illustrating the sample space for a single dice roll with dots labeled 1, 2, 3, 4, 5, 6 and blue oval around dots 3, 4, 5, 6 and red oval around dots 2, 4, and 6.
Venn Diagrams for Two Sample Spaces

Please be aware that certain images and Google documents may not be properly viewed on mobile devices.

A device that can be helpful in identifying all possible outcomes of a random experiment, particularly one that can be viewed as proceeding in stages, is what is called a tree diagram. It is described in the following example.

Example 3

Construct a sample space that describes all three-child families according to the binary genders of the children with respect to birth order. For simplicity we are only using the binary genders, but recognize there are those who identify as nonbinary.

Two of the outcomes are “two boys then a girl,” which we might denote bbg, and “a girl then two boys,” which we would denote gbb. Clearly there are many outcomes, and when we try to list all of them it could be difficult to be sure that we have found them all unless we proceed systematically. The tree diagram on the right "Tree Diagram For Three-Child Families", gives a systematic approach.

Tree diagram starting with b and g and each branching out to two choices of either b or g until we have branches that list out bbb, bbg, bgb, bgg, gbb, gbg, ggb, ggb, ggg.
Tree Diagram For Three-Child Families

The diagram was constructed as follows. There are two possibilities for the first child, boy or girl, so we draw two line segments coming out of a starting point, one ending in a b for “boy” and the other ending in a g for “girl.” For each of these two possibilities for the first child there are two possibilities for the second child, “boy” or “girl,” so from each of the b and g we draw two line segments, one segment ending in a b and one in a g. For each of the four ending points now in the diagram there are two possibilities for the third child, so we repeat the process once more.

The line segments are called branches of the tree. The right ending point of each branch is called a node. The nodes on the extreme right are the final nodes; to each one there corresponds an outcome, as shown in the figure.

From the tree it is easy to read off the eight outcomes of the experiment, so the sample space is, reading from the top to the bottom of the final nodes in the tree,

S = { b b b , b b g , b g b , b g g , g b b , g b g , g g b , g g g }

Try it Now 1

A random experiment consists of tossing two coins. Write out the sample space assuming we can tell the difference between the two coins. What outcomes make up the event there is at least one head showing?

Answer

Since we can tell the coins apart, there are two ways for the coins to differ we can label each outcome as a pair of letters, the first of which indicates how the first coin landed and the second coin landed. A sample space is then S = { h h , h t , t h , t t } .

The event "there is at least one head showing" can be made observed when there is one heads or two heads. We will call the event E and the outcomes are: E = { h h , h t , t h }

Probability

We have mentioned earlier for empirical probability that it represents a ratio of the number of times we observed an outcome over number of trials. With that in mind we will give a more formal definition along with properties that makes sense when we think of probabilities as a ratio.

Probability

The probability of an outcome e in a sample space S is a number p between 0 and 1 that measures the likelihood that e will occur on a single trial of the corresponding random experiment.

  • The value p = 0 corresponds to the outcome e being impossible

  • The value p = 1 corresponds to the outcome e being certain.

The probability of an event E is the sum of the probabilities of the individual outcomes of which it is composed. It is denoted P ( E ) .

If E = { e 1 , e 2 , ... , e k } , then P ( E ) = P ( e 1 ) + P ( e 2 ) + P ( e k )

"Sample Spaces and Probability" graphically illustrates the definitions.

Sample space S with outcomes e1, e2, e3, e4, e5 with associated probabilities p1, p2, p3, p4, p5. The events A={e1,e2} and B={e2,e3,e4}. P(A)=p1+p2 and P(B)=p2+p3+p4
Sample Spaces and Probability

Please be aware that certain images and Google documents may not be properly viewed on mobile devices.

Since the whole sample space S is an event that is certain to occur, the sum of the probabilities of all the outcomes must be the number 1.

In ordinary language probabilities are frequently expressed as percentages. For example, we would say that there is a 70% chance of rain tomorrow, meaning that the probability of rain is 0.70. We will use this practice here, but in all the computational formulas that follow we will use the form 0.70 and not 70%.

It is relatively straight forward to find probability is when all the outcomes have the same chance of occurring. For example experiments where each outcome has the same chance of happening would include the roll of a fair dice, flipping a fair coin, or even the results of a simple random sample from a population. Finding probabilities in these situations is just a matter or counting (as shown in the definition below).

Probability for Equal Likely Outcomes

Given that all outcomes are equally likely, we can compute the probability of an event E using this formula:

P ( E ) = Number of outcomes corresponding to the event E Total number of equally-likely outcomes

The " P ( E ) " notation we will be using reads "the probability of the event E occurring". In the notation P just means to find the probability and E is the event of interest.

Example 4

A coin is called “balanced” or “fair” if each side is equally likely to land up. Assign a probability to each outcome in the sample space for the experiment that consists of tossing a single fair coin.

Solution

With the outcomes labeled h for heads and t for tails, the sample space is the set S = { h , t } . Since the outcomes have the same probabilities, which must add up to 1, each outcome is assigned probability 1 2 . We could write these as P ( h ) = 1 2 and P ( t ) = 1 2 .

Example 5

A die is called “balanced” or “fair” if each side is equally likely to land on top. Assign a probability to each outcome in the sample space for the experiment that consists of tossing a single fair die. Find the probabilities of the events E: “an even number is rolled” and T: “a number greater than two is rolled.”

Solution

With outcomes labeled according to the number of dots on the top face of the die, the sample space is the set S = { 1 , 2 , 3 , 4 , 5 , 6 } . Since there are six equally likely outcomes, which must add up to 1, each is assigned probability 1 6 .

Since E = { 2 , 4 , 6 } and this makes up 3 of the total 6 outcomes we have

P ( E ) = P ( 2 ) + P ( 4 ) + P ( 6 ) P ( E ) = 1 6 + 1 6 + 1 6 P ( E ) = 3 6 P ( E ) = 1 2

Since T = { 3 , 4 , 5 , 6 } and this represents 4 of the 6 outcomes we have that

P ( T ) = P ( 3 ) + P ( 4 ) + P ( 5 ) + P ( 6 ) P ( T ) = 1 6 + 1 6 + 1 6 + 1 6 P ( T ) = 4 6 P ( T ) = 2 3

Probabilities are essentially fractions, and can be reduced to lower terms like fractions as you saw in the examples, however we may not always do this as the unsimplified fraction may contain more information when reading it then the simplified answer. For instance in the example above we know there was six outcomes on the dice roll and the probability of rolling a number greater than 2 was found to be 2 3 . If we leave the answer as 4 6 we keep the information about how many outcomes out of the 6 made up the event rolling a number greater than 2.

Example 6

Let's say you have a bag with 20 cherries where 14 are sweet and 6 are sour. If you pick a cherry at random, what is the probability that it will be sweet?

Solution

There are 20 possible cherries that could be picked, so the number of possible outcomes is 20. Of these 20 possible outcomes, 14 are favorable (sweet), so the probability that the cherry will be sweet is P ( Sweet ) = 14 20 = 7 10 .

There is one potential complication to the cherry picking example, however. It must be assumed that the probability of picking any of the cherries is the same as the probability of picking any other. This wouldn't be true if (let us imagine) the sweet cherries are smaller than the sour ones. (The sour cherries would come to hand more readily when you sampled from the bag.) Let us keep in mind, therefore, that when we assess probabilities in terms of the ratio of favorable to all potential cases, we rely heavily on the assumption of equal probability for all outcomes.

Example 7

A standard deck of 52 playing cards consists of four suits (hearts, spades, diamonds and clubs). Spades and clubs are black while hearts and diamonds are red. Each suit contains 13 cards, each of a different rank: an Ace (which in many games functions as both a low card and a high card), cards numbered 2 through 10, a Jack, a Queen and a King.

  1. Compute the probability of randomly drawing one card from a deck and getting an Ace.
  2. Compute the probability of randomly drawing one card from a deck and getting a Diamond.
  3. Compute the probability of randomly drawing one card from a deck and getting a black card.

Cards

Solutions

  1. There are 52 cards in the deck and 4 Aces so  P ( A c e ) = 4 52 = 1 13 0.0769
  2. There are 13 diamonds in the deck, so P ( Diamond ) = 13 52 = 1 4 = 0.25
  3. There are two suits that are black for a total of 26 cards, so P ( Black ) = 26 52 = 1 2 = 0.5

Try it Now 2

Two fair coins are tossed. Find the probability that the coins match, i.e., either both land heads or both land tails.

Hint 1

We constructed the sample space in the first Try It Now for this chapter. That sample space was S = { h h , h t , t h , t t } . What outcomes make up the event in question?

Answer

We constructed the sample space in the first Try It Now for this chapter. That sample space was S = { h h , h t , t h , t t } . We are interested in finding the probability of the event M = { h h , t t }

.

Now since the coin was fair we can safely assume each of the paired outcomes of the two coins will also be equally likely. This gives us that

P ( M ) = P ( h h ) + P ( t t ) P ( M ) = 1 4 + 1 4 P ( M ) = 2 4 P ( M ) = 1 2

Probability when things are not equally likely

The previous examples illustrate how probabilities can be computed simply by counting when the sample space consists of a finite number of equally likely outcomes. In some situations the individual outcomes of any sample space that represents the experiment are unavoidably unequally likely, in which case probabilities cannot be computed merely by counting, but the computational formula given in the definition of the probability of an event must be used.

The probability of an event E is the sum of the probabilities of the individual outcomes of which it is composed. It is denoted P ( E ) .

If E = { e 1 , e 2 , ... , e k } , then P ( E ) = P ( e 1 ) + P ( e 2 ) + P ( e k )

Example 8

The breakdown of the student body Race and Ethnicity at Pima Community College in Fall 2020 was 37.33% white, 4.93% black, 47.16% Hispanic, 3.12% Asian, and 1.97% American Indian or Alaskan Native, 5.49% for all others (including undeclared). A student is randomly selected from Fall 2020. (To select “randomly” means that every student has the same chance of being selected.) Find the probabilities of the following events:

  1. H: the student is Hispanic,
  2. N: the student is not Black.
  3. M: the student is a minority (that is, not white),

Solutions

The student body's race and ethnicity is broken down into 6 different outcomes. We can write this out as a sample space of S = { w , b , h , a , a i , o } . The experiment is the action of randomly selecting a student from the student population at Pima. Since 47.16% of the students are Hispanic and all students have the same chance of being selected, P ( h ) = 0.4716 , and similarly for the other outcomes. This information is summarized below:

Outcomes w b h a ai o
Probability 0.3733 0.0493 0.4716 0.0312 0.0197 0.0549
  1. Since H = { h } , P ( H ) = P ( h ) = 0.4716
  2. Since N = { w , h , a , a i , o } , P ( N ) = P ( w ) + P ( h ) + P ( a ) + P ( a i ) + P ( o ) P ( N ) = 0.3733 + 0.4716 + 0.0312 + 0.0197 + 0.0549 P ( N ) = 0.9507
  3. Since M = { b , h , a , a i , o } , P ( M ) = P ( b ) + P ( h ) + P ( a ) + P ( a i ) + P ( o ) P ( M ) = 0.0493 + 0.4716 + 0.0312 + 0.0197 + 0.0549 P ( M ) = 0.6267

In the last example the process for adding the probabilities together worked as the classifications of race/ethnicity didn't allow for any of the students to fall into more than one classification as well as all students would be captured into just one of them. If there were overlaps between two classifications we could not treat the probability question in the same way.

Try it Now 3

The student body in a local high school may be broken down into ten categories as follows: 25% white male, 26% white female, 12% black male, 15% black female, 6% Hispanic male, 5% Hispanic female, 3% Asian male, 3% Asian female, 1% male of other minorities combined, and 4% female of other minorities combined. A student is randomly selected from this high school. Find the probabilities of the following events:

  1. B: the student is black,
  2. MF: the student is minority female,
  3. FN: the student is female and is not black.
Hint 1

It may be helpful to organize the data into a table and convert the outcomes in the sample space into probabilities. The sample space would be S = { w m , b m , h m , a m , o m , w f , b f , h f , a f , o f }

Answer

The sample space would be S = { w m , b m , h m , a m , o m , w f , b f , h f , a f , o f } . The information given in the example can be summarized in the following table, called a two-way contingency table:

Gender Race / Ethnicity
White Black Hispanic Asian Others
Male 0.25 0.12 0.06 0.03 0.01
Female 0.26 0.15 0.05 0.03 0.04
  1. Since B = { b m , b f } :

    P ( B ) = P ( b m ) + P ( b f ) P ( B ) = 0.12 + 0.15 P ( B ) = 0.27
  2. Since M F = { b f , h f , a f , o f } :

    P ( M F ) = P ( b f ) + P ( h f ) + P ( a f ) + P ( o f ) P ( M F ) = 0.15 + 0.05 + 0.03 + 0.04 P ( M F ) = 0.27
  3. Since F N = { w f , h f , a f , o f } :

    P ( F N ) = P ( w f ) + P ( h f ) + P ( a f ) + P ( o f ) P ( F N ) = 0.26 + 0.05 + 0.03 + 0.04 P ( F N ) = 0.38

Exercises


  1. Which of the following can be a probability value?

    1 , 1.2 , .2 , 1 π , 2 , 5 7 , 7 5 , 0
    Answer

    1 , 1 π , 5 7 , 0

  2. A box contains 10 white and 10 black marbles. Construct a sample space for the experiment of randomly drawing out, with replacement, two marbles in succession and noting the color each time. (To draw “with replacement” means that the first marble is put back before the second marble is drawn.)

    For each event listed below write out the outcomes that make up the event. If we assume each outcome is equal likely find the probability of each event.
    1. At least one marble of each color is drawn.
    2. No white marble is drawn.
    Answer

    S = { w w , w b , b w , b b }

    1. E = { w b , b w } : P ( E ) = 2 4 = 1 2

    2. N W = { b b } : P ( N W ) = 1 4

  3. A box contains 16 white and 16 black marbles. Construct a sample space for the experiment of randomly drawing out, with replacement, three marbles in succession and noting the color each time. (To draw “with replacement” means that each marble is put back before the next marble is drawn.)

    For each event listed below write out the outcomes that make up the event. If we assume each outcome is equal likely find the probability of each event.
    1. At least one marble of each color is drawn.
    2. No white marble is drawn.
    3. More black than white marbles are drawn.
    Answer

    S = { w w w , w w b , w b w , b w w , w b b , b w b , b b w , b b b }

    1. E = { w w b , w b w , b w w , w b b , b w b , b b w } : P ( E ) = 6 8 = 3 4

    2. N W = { b b b } : P ( N W ) = 1 8

    3. F = { w b b , b w b , b b w , b b b } : P ( F ) = 4 8 = 1 2

  4. A box contains 8 red, 8 yellow, and 8 green marbles. Construct a sample space for the experiment of randomly drawing out, with replacement, two marbles in succession and noting the color each time.
    1. No yellow marble is drawn.
    2. The two marbles drawn have the same color.
    3. At least one marble of each color is drawn.
    Answer

    S = { r r , r y , y r , r g , g r , y y , y g , g y , g g }

    1. N Y = { r r , r g , g r , g g } : P ( N W ) = 4 9

    2. B = { r r , y y , g g } : P ( B ) = 3 9 = 1 3

    3. This event has no outcomes as it is not possible to draw all three colors when selecting only two marbles. We would say the probability is 0.

  5. Your sock drawer is a mess. Twelve black socks and six red socks are mixed together. What are the chances that, without looking, you pick out a red sock?
    Answer

    P ( R ) = 6 18 = 1 3

  6. A ball is drawn randomly from a jar that contains 6 red balls, 2 white balls, and 5 yellow balls. Find the probability of the given event.
    1. A red ball is drawn
    2. A white ball is drawn
    Answer
    1. P ( R ) = 6 13

    2. P ( W ) = 2 13

  7. Suppose you write each letter of the alphabet on a different slip of paper and put the slips into a hat. What is the probability of drawing one slip of paper from the hat at random and getting (assume y is a consonant for this exercise):
    1. A consonant
    2. A vowel
    Answer
    1. P ( C ) = 21 26

    2. P ( V ) = 5 26

  8. A group of people were asked if they had run a red light in the last year. 150 responded "yes", and 185 responded "no". Find the probability that if a person is chosen at random, they have run a red light in the last year.
    Answer

    P ( R ) = 150 335

  9. In a survey, 205 people indicated they prefer cats, 160 indicated they prefer dogs, and 40 indicated they don’t enjoy either pet. Find the probability that if a person is chosen at random, they prefer cats.
    Answer

    P ( C ) = 205 405

  10. Compute the probability of tossing a six-sided die (with sides numbered 1 through 6) and getting a 5.
    Answer

    P ( Roll a 5 ) = 1 6

  11. Compute the probability of tossing a standard six-sided die and getting a 7.
    Answer

    P ( Roll a 7 ) = 0 as it is not possible to roll a 7 with a single standard six-sided die.

  12. Compute the probability of tossing a six-sided die and getting an even number.
    Answer

    P ( E ) = 3 6

  13. Compute the probability of tossing a six-sided die and getting a number less than 3.
    Answer

    P ( Less than 3 ) = 2 6

  14. If you pick one card at random from a standard deck of cards, what is the probability it will be a King?
    Answer

    P ( K ) = 4 52

  15. If you pick one card at random from a standard deck of cards, what is the probability it will be a Diamond?
    Answer

    P ( D ) = 13 52

  16. The following two-way contingency table gives the breakdown of the population in a particular locale according to age and tobacco usage:
    Age Tobacco Use
    Smoker Non-smoker
    Under 30 0.05 0.20
    Over 30 0.20 0.55
    1. The person is a smoker.
    2. The person is under 30.
    3. The person is a smoker who is under 30.
    Answer
    1. P ( Smoker ) = 0.05 + 0.20 = 0.25

    2. P ( Under 30 ) = 0.05 + 0.20 = 0.25

    3. P ( Smoker Under 30 ) = 0.05

  17. The following two-way contingency table gives the breakdown of the population in a particular locale according to party affiliation (A, B, C, or None) and opinion on a bond issue:
    Affiliation Opinion
    Favors Opposes Undecided
    A 0.12 0.09 0.07
    B 0.16 0.12 0.14
    C 0.04 0.03 0.06
    None 0.08 0.06 0.03
    A person is selected at random. Find the probability of each of the following events.
    1. The person is affiliated with party B.
    2. The person is affiliated with some party.
    3. The person is in favor of the bond issue.
    4. The person has no party affiliation and is undecided about the bond issue.
    Answer
    1. P ( B ) = 0.16 + 0.12 + 0.14 = 0.42

    2. P ( Some Party ) = 0.12 + 0.09 + 0.07 + 0.16 + 0.12 + 0.14 + 0.04 + 0.03 + 0.06 = 0.83

    3. P ( In Favor ) = 0.12 + 0.16 + 0.04 + 0.08 = 0.40

    4. P ( No Party and Undecided ) = 0.03

      .