4  Probabilities with More than One Event

4.1 Marginal and Joint Probabilities

Up until this point we have primarily focused on assigning probabilities to particular events. If we have some event of interest, \(A\), then \(P(A)\) is the probability that \(A\) occurs in any manner. If we are using our equally likely probability model, then \(P(A) = \dfrac{N_A}{N}\). This is the probability of the event \(A\) where nothing else is known at all. If we smooth over anything which could alter the likelihood, if we have no additional information, if we want the best guess for the likelihood of occurrence in a vacuum, this is the probability of interest. We refer to such quantities as marginal probabilities.

Definition 4.1 (Marginal Probability) The marginal probability of a single event, \(A\), is the probability that the event happens without considering any additional information. This is the probability that the event happens, denoted \(P(A)\).

It is useful to specifically call these marginal probabilities to differentiate them from probabilities which depend on two or more events. Specifically, we can think about taking the intersection of two events, say \(A \cap B\). In words this is the event that \(A\) occurs and \(B\) occurs. Until this point we have thought about solving for probabilities related to intersections as a two-step procedure: first, find an event \(C = A \cap B\), and then second solve for the marginal probability of \(C\). While this is a useful technique for calculation, sometimes it is more useful to think of the probability of the intersection as the probability of \(A\) and \(B\) occurring simultaneously. We call this the joint probability of \(A\) and \(B\).

Definition 4.2 (Joint Probability) The joint probability of two events, \(A\) and \(B\), is given by the probability of their intersection. That is, \(P(A\cap B)\). This is sometimes denoted \(P(A,B)\) and corresponds to the probability that both \(A\) and \(B\) occur.

The joint probability of more than two events extends in the same way. Suppose there exists a sequence of events, \(A_1, \dots, A_n\), then the joint probability of these events is \[P(A_1,A_2,\dots,A_n) = P\left(\bigcap_{i=1}^n A_i\right).\]

4.2 What are Conditional Probabilities?

Marginal probabilities are often of interest, and frequently are the best tool for summarizing the overall state of the world, or our knowledge regarding the state of the world. Joint probabilities can be useful when working with compound events, or thinking of complex outcomes. However, there are many scenarios which are not covered by either the joint or marginal probabilities. In practice, we know that sometimes information that we have will change our understanding of the probability of an event.

Suppose the event \(A\) corresponds to the event that it snows tomorrow, in some particular city. It is possible to think about how often it snows on average, and report a value related to that as \(P(A)\). Now, what if we know that it is currently the middle of summer, and the city is in the Northern Hemisphere? In this case, while \(P(A)\) does not shrink to \(0\), it becomes far less likely than if we did not have that information. Similarly, if we know that it is winter in the same city, the likelihood that it snows tomorrow increases. In order to formally capture this we can introduce the idea of conditional probability.

Definition 4.3 (Conditional Probability) The conditional probability of an event \(A\) given an event \(B\), is the probability that \(A\) occurs assuming that we know that \(B\) occurs. The conditional probability takes into account additional information codified through the occurrence of an additional event. We write this quantity as \(P(A|B)\), and will read this as “the probability of \(A\) given \(B\).”

Unlike in the case of marginal probabilities, conditional probabilities allow us to condition on extra pieces of information. Instead of asking “what is the probability of this event”, we instead ask, “given that we know this piece of information, what is the probability of this event?” Joint probabilities ask “what is the probability that both of these events occur simultaneously” which is distinct in that we do not have any additional information about the state of the world when working with joint probabilities. The subtle distinction becomes quite powerful, both in terms of manipulating and working with probabilities, but also in terms of expressing the correct events of interest for ourselves.

To make use of conditional probabilities, we will think of the process of conditioning on one or more events. We will talk of the probability of \(A\) conditional on \(B\), where \(A\) and \(B\) are two events of interest. Intuitively, this is the probability of \(A\) happening, supposing that we know that \(B\) has already happened.

Example 4.1 (Six from the Sum) Charles and Sadie are playing a new game using two dice. In the game they each take turns rolling the two dice into a container that the other player cannot see. They add up the dice and then report this sum to the other player. The other player then has to guess whether there is a six showing.

  1. On one round, Charles does not hear the sum that Sadie reports as he was not paying attention. Sadie is strict and insists that she will not repeat herself. What should Charles guess?
  2. Determine a strategy which optimizes the likelihood that the guessing player will be correct.
  1. In this case we want the marginal probability of a six showing up on the roll of two dice. Take \(E\) to represent the event wherein at least one six is showing on the roll of two dice. Thus, \(E^C\) is the event that no sixes are showing. There are \(5\times 5=25\) (using the product rule for counting) ways of not rolling a \(6\), meaning that \[P(E^C) = \dfrac{25}{36} \implies P(E) = 1 - \dfrac{25}{36} = \dfrac{11}{36}.\] Thus, Charles should guess that there is no \(6\) as the probability is only \(11/36 \approx 0.31\).

  2. Here we wish to determine conditional probabilities. We take \(E\) to be the event that at least one \(6\) is showing, and then \(S\) to be a variable representing the sum of the two dice. For \(s= 1, \dots, 12\) we wish to find \(P(E|\{S=s\})\). Notice that for \(s=2,\dots,6\) \(P(E|S=s) = 0\). If the sum is \(2\) we know that there could not have been a \(6\). Moreover, for \(s=11,12\) we know that \(P(E|S=s) = 1\) since the only way to form \(11\) is a five and a six, and the only way to form \(12\) is to have two sixes. This leaves \(s = 7,\dots,10\) to check. The following table gives the set of values, the possible combinations to reach the value, and then the combinations that end up involving a \(6\).

    Value Combinations Involving 6
    7 (1,6) (2,5) (3, 4) (4, 3) (5, 2) (6, 1) (1, 6) (6, 1)
    8 (2,6) (3,5) (4, 4) (5, 3) (6, 2) (2, 6) (6, 2)
    9 (3,6) (4,5) (5, 4) (6, 3) (2, 6) (6, 2)
    10 (4, 6) (5, 5) (6, 4) (4, 6) (6, 4)

    Referencing from this table we can read off the following probabilities \[\begin{align*} P(E|S=7) &= \dfrac{2}{6} = \dfrac{1}{3}\\ P(E|S=8) &= \dfrac{2}{5} \\ P(E|S=9) &= \dfrac{2}{4} \\ P(E|S=10) &= \dfrac{2}{3}. \end{align*}\] As a result, the best strategy is to guess “yes” when a \(10\), \(11\), or \(12\) is rolled and to be indifferent to the guess when a \(9\) is rolled. Otherwise, guess “no.”

Recall that \(A\) and \(B\), as events, are merely subsets of the sample space, \(\mathcal{S}\). Each item in either \(A\) or \(B\) is one of the possible outcomes from the experiment or process that we are observing. Suppose that we know that \(B\) has occurred. What this means is that, one of the outcomes in the set \(B\) was the observed outcome from the experiment. Now, if we want to know \(P(A|B)\), we want to know the probability, working from the assumption that \(B\) has happened, that \(A\) also happens. That is, knowing that \(B\) has happened, what is the probability that \(A\) and \(B\) both happen.

The event that \(A\) and \(B\) both happen is denoted by the intersection, \(A \cap B\). This corresponds to the set of events inside the set \(B\) which also belong to the set \(A\). Now, instead of considering the joint probability directly, we need to acknowledge that for \(A|B\), only the events in \(B\) were possible. That is, instead of being divided by the whole space, we can only divide by the space of \(B\). In some sense, we can view conditioning on \(B\) as treating \(B\) as though it is the full sample space, and finding probabilities within that. In general, \(B\) will be smaller than \(\mathcal{S}\), and so \(P(B) < 1\). Instead of the conditional probability being “out of” \(1\), it will instead be “out of” \(P(B)\), which gives \(P(A|B) = \dfrac{P(A\cap B)}{P(B)}\).

Computing Conditional Probabilities

For an event \(A\), and an event \(B\) with probability \(P(B) > 0\), the conditional probability of \(A\) given \(B\) is \[P(A|B) = \dfrac{P(A\cap B)}{P(B)}.\]

To make this more clear, suppose that we take \(A\) to be the event that a \(2\) is rolled on a fair, six-sided die, and \(B\) to be the event that an even number was rolled. This is an equal probability model, and so each outcome gets \(\dfrac{1}{6}\) probability. The original sample space is \(\mathcal{S} = \{1,2,3,4,5,6\}\), the event \(A\) is \(\{2\}\), and the event \(B\) is \(\{2,4,6\}\). In order for both \(A\) and \(B\) to occur, we note that we need \(A \cap B = \{2\}\). If we know that \(B\) has occurred, then we know that either a \(2\), \(4\), or \(6\) has been rolled, with equal probability for each. Thus, intuitively, we can view \(B\) as the new sample space, and say that rolling a \(2\) has a \(\dfrac{1}{3}\) probability, given that there are \(3\) outcomes and \(1\) of them is the event of interest.

We can also compute this directly. Note that \(P(B) = \dfrac{1}{2}\), and so we need to scale each event by \(\dfrac{1}{1/2}\) in order to make sure that the total probability of our reduced sample space equals \(1\). Then \(P(A\cap B) = \dfrac{1}{6}\), so \[P(A|B) = \dfrac{1/6}{1/2} = \dfrac{1}{3}.\]

Suppose that, instead of a fair die, it was weighted so that \(6\) comes up more frequently than the other options. Consider the probability of observing a six to be \(0.5\), with the other five values each coming up with probability \(0.10\). If \(A\) and \(B\) are the same events as above, then \(P(B) = 0.7\).

If we know that \(B\) has occurred then the new sample space is \(\{2,4,6\}\) where \(P(2) = \dfrac{0.1}{0.7} = \dfrac{1}{7}\), \(P(4) = \dfrac{0.1}{0.7} = \dfrac{1}{7}\), and \(P(6) = \dfrac{0.5}{0.7} = \dfrac{5}{7}\). Note that these three probabilities sum to \(1\) still, which constitutes a valid probability model, and so \(P(A|B) = \dfrac{1}{7}\).

Example 4.2 (Six from the Sum - Revisited) Sadie sees the solution worked out for the best strategy in the dice game above, but is having trouble understanding it in the context of conditional probability more broadly. For the sums \(7, 8, 9\), and \(10\), describe the process of limiting the sample space to get the correct conditional probability, and use the formula to show that the probabilities worked out in Example 4.1 are correct.

The relevant table of solutions is copied from above.

Value Combinations Involving 6
7 (1,6) (2,5) (3, 4) (4, 3) (5, 2) (6, 1) (1, 6) (6, 1)
8 (2,6) (3,5) (4, 4) (5, 3) (6, 2) (2, 6) (6, 2)
9 (3,6) (4,5) (5, 4) (6, 3) (2, 6) (6, 2)
10 (4, 6) (5, 5) (6, 4) (4, 6) (6, 4)

Here, given a sum, we know that one of the events listed under “combinations” has occurred. As a result, once we have conditioned on what the sum is, we are able to treat the “combinations” column as the full sample space of possible outcomes. Each of these in this case is equally likely, and so we can divide the number which contain a \(6\), by the total number in the reduced sample space.

To do this algebraically, we first note that \[\begin{align*} P(S=7) &= \dfrac{6}{36} \\ P(S=8) &= \dfrac{5}{36} \\ P(S=9) &= \dfrac{4}{36} \\ P(S=10) &= \dfrac{3}{36}. \end{align*}\] Now, suppose we consider the event \(E\cap\{S=7\}\). This is the set \(\{(1,6),(6,1)\}\) and so \(P(E\cap\{S=7\}) = 2/36\). In fact, the same holds true for all of the joint events. As a result, we get \[\begin{align*} P(E|S=7) &= \dfrac{\dfrac{2}{36}}{\dfrac{6}{36}} = \dfrac{2}{6} \\ P(E|S=8) &= \dfrac{\dfrac{2}{36}}{\dfrac{5}{36}} = \dfrac{2}{5} \\ P(E|S=9) &= \dfrac{\dfrac{2}{36}}{\dfrac{4}{36}} = \dfrac{2}{4} \\ P(E|S=10) &= \dfrac{\dfrac{2}{36}}{\dfrac{3}{36}} = \dfrac{2}{3}. \end{align*}\] This corresponds exactly to the values we found before.

Sometimes, we wish to condition on more than one event. To do so, the same process extends naturally. For instance, suppose we want to know the probability of \(A\) given \(B\) and \(C\). This would be written \[P(A|B,C) = \dfrac{P(A\cap B\cap C)}{P(B \cap C)} = \dfrac{P(A,B,C)}{P(B,C)}.\] Moving beyond two events occurs in the expected way.

4.3 Using Conditional Probabilities

Conditional probability is a mechanism for capturing our knowledge of the world, and using that to update our sense of the uncertainties at play. For instance, suppose that we are interested in drawing a random card from a deck of \(52\), and we want to know the probability that it is a heart. Without any additional knowledge, the probability of this event is \(\dfrac{1}{4}\). Now, suppose that you know that it is a red card. In this case, we now know that it is either a heart or a diamond, and there are equal numbers of each, meaning that the new probability is \(0.5\). We can work this out directly \[P(\text{Heart}|\text{Red}) = \dfrac{P(\text{Heart},\text{Red})}{P(\text{Red})} = \dfrac{P(\text{Heart})}{1/2} = \dfrac{1/4}{1/2} = 0.5.\] Suppose instead that we had been told that the card was an ace. Here we now know that there are four possible outcomes that correspond to an ace, and only one of these is a heart, meaning the probability is \(\dfrac{1}{4}\). In this case, \(P(A|B) = P(A)\), and our beliefs did not update.

What if instead we had considered the second event to be “the card was a spade.” In this case if we want to know \(P(A|B)\) then, given a spade being drawn, we know that the probability of drawing a heart is \(0\).

Example 4.3 (Charles’s Mismatched Urn Mishap) Charles’s love of probability prompts a spontaneous decision: buy an urn and some coloured balls to fill it up with. Unfortunately, the supplier of the balls misunderstood the request and sent over an assortment of different shapes rather than just spheres. There are some spheres, some cubes, some pyramids, and some cones. There are also five different colours present, red, blue, green, yellow, and black. Charles is slightly dismayed as, when reaching into the urn, it is very easy to feel what shape you are pulling out before you see the object, and the distribution of colour-shape combinations is not even. Despite the dismay, Charles shows Sadie, who is deeply excited, explaining how this mismatched urn is perfect for understanding conditional probabilities deeply. The distribution of shapes and colours is presented in the following table.

Colour Sphere Cube Pyramid Cone
Red 2 3 2 0
Blue 1 0 0 6
Green 2 2 1 2
Yellow 0 4 2 1
Black 3 1 1 2
  1. What is the probability of drawing each colour, if items are selected at random, without knowledge of the shape?
  2. Assuming that the shape is known, what is the probability of selecting each colour?
  1. Note that there are \(2+3+2=7\) red, \(1+6=7\) blue, \(2+2+1+2=7\) green, \(4+2+1=7\) yellow, and \(3+1+1+2=7\) black objects. As a result, each colour is going to be equally likely, with a probability of \(0.2\).

  2. Take \(S\), \(C\), \(P\), \(Cn\) to be the events that a sphere, cube, pyramid, or cone are drawn, respectively. Take \(R\), \(B\), \(G\), \(Y\), \(Bk\) to be the events that a red, blue, green, yellow, or black object is drawn, respectively. We want the probability of each colour, given the corresponding shape. There are a total of \(20\) conditional probabilities to solve for here. We walk through, in full, the first conditional probability calculation. After, the same process follows to get the (provided) answer.

    First note that there are \(8\) spheres, \(10\) cubes, \(6\) pyramids, and \(11\) cones. Thus, the marginal probabilities are \(P(S) = 8/35\), \(P(C) = 10/35\), \(P(P) = 6/35\), and \(P(Cn) = 11/35\). Note further that the joint probability between any colour-shape combination is going to be given by the number of that combination that exist, divided by \(35\). Thus, suppose we want \(P(R|S)\). There are \(2\) red spheres, so \(P(R \cap S) = 2/35\). The marginal probability \(P(S) = 8/35\), so taken together this gives \[P(R|S) = \dfrac{2/35}{8/35} = \dfrac{2}{8} = \dfrac{1}{4}.\]

    Applying the same process gives the following probabilities, reported in the table for convenience.

    Colour Sphere Cube Pyramid Cone
    Red \(\dfrac{2}{8} = \dfrac{1}{4}\) \(\dfrac{3}{10}\) \(\dfrac{1}{3}\) \(0\)
    Blue \(\dfrac{1}{8}\) \(0\) \(0\) \(\dfrac{6}{11}\)
    Green \(\dfrac{2}{8} = \dfrac{1}{4}\) \(\dfrac{1}{5}\) \(\dfrac{1}{6}\) \(\dfrac{2}{11}\)
    Yellow \(0\) \(\dfrac{2}{5}\) \(\dfrac{1}{3}\) \(\dfrac{1}{11}\)
    Black \(\dfrac{3}{8}\) \(\dfrac{1}{10}\) \(\dfrac{1}{6}\) \(\dfrac{2}{11}\)

4.3.1 The Multiplication Rule

While sometimes we will want to work out the conditional probability using our knowledge of the joint and marginal probabilities, there are other times when it is easier to determine the conditional probability directly. In these settings we may wish to understand the marginal or joint probabilities. That is, we may know \(P(A|B)\), but we want to make statements regarding \(P(A)\) or \(P(A,B)\).

To do so, we can rearrange the defining relationship of conditional probability, to solve for the quantities of interest. Because of the importance of this procedure, we actually give this mostly straightforward rearrangement a special name.

Multiplication Rule

The multiplication rule states that, for two events \(A\) and \(B\) where \(P(B) > 0\), \[P(A\cap B) = P(A|B)P(B).\]

Note that, by multiplying both sides of the definition of \(P(A|B)\) by \(P(B)\) gives the result. In words, it states that we can solve for the joint probability of \(A\) and \(B\) by multiplying the conditional probability of \(A\) given \(B\), by the marginal probability of \(B\). This is symmetric in \(A\) and \(B\) so that \[P(A\cap B) = P(B|A)P(A).\] This is useful as sometimes it is easier to determine \(B\) given \(A\).

Example 4.4 (Package Delivery Times) To thank Sadie for the help in seeing the silver lining with the urn mishap, Charles decides to order a small gift online, sending it direct to Sadie. Unfortunately, the website does not list which delivery company each package is sent out with. After some sleuthing, Charles determines that there are two different companies that it may have been sent with. Looking at online reviews it appears as though company \(A\) is late \(75\%\) of the time while company \(B\) is late \(15\%\) of the time.

If the store that Charles ordered from sends out \(10\%\) of packages with company \(A\), and the rest with company \(B\), which is more likely: that the package is late and was sent with \(A\), or that the package was late and sent with \(B\)?

We will take \(L\) to correspond to the events where the package is late, \(A\) to the events where the package was sent with \(A\) and \(B\) to be the events where the package was sent with \(B\). We wish to compare \(P(L, A)\) and \(P(L, B)\). We know that \(P(A) = 0.1\) and \(P(B) = 0.9\), and we know that \(P(L|A) = 0.75\) and \(P(L|B) = 0.15\). As a result, we can use the multiplication rule to find the joint probabilities.

First, \[P(L, A) = P(L|A)P(A) = (0.75)(0.1) = 0.075.\] Additionally, \[P(L, B) = P(L|B)P(B) = (0.15)(0.9) = 0.135.\]

As a result it is more likely to have the package late and sent with \(B\) than to have the package late and sent with \(A\).1

4.3.2 Partitions and the Law of Total Probability

While the multiplication rule gives us the capacity to solve for joint probabilities, often we wish to make statements regarding marginal probabilities. Fortunately, we can extend the process outlined in the multiplication rule to solve directly for marginal probabilities as well. To do so, we first introduce the concept of a partition.

Definition 4.4 (Partition) A partition is a collection of sets which divide up the sample space such that all the sets are disjoint from one another, and the sample space is given by the union of all the sets. That is, \(A_1,\dots,A_n\) is a partition of \(\mathcal{S}\) if:

  1. \(A_i \cap A_j = \emptyset\) for all \(i \neq j\), and
  2. \[\bigcup_{i=1}^n A_i = \mathcal{S}.\]

For instance, if the sample space were all the positive integers, we could partition this space into all the even numbers as one set and all the odd numbers as a second. We could also partition this into the set of numbers which are less than \(10\), the set of numbers that are greater than \(10\), and then \(10\). In both examples we have sets whose union forms the full sample space with no overlap. Note that we could not partition the set into multiples of \(2\) and multiples of \(3\), since (i) not all values are contained between these two sets,2 and (ii) there is overlap between these two sets.3

Example 4.5 (Partitions of the Coin Game) Charles and Sadie are thinking back with fondness to their original coin flipping game, where they would toss a coin three times in a row. If two or more heads showed up, Charles would pay. Otherwise, Sadie would.

To help them reminisce, write down three different partitions of the sample space, each with a different number of partitioning sets. Describe your partitions using words.

There are many possible partitions to write down. The following are examples.

  1. We can partition the space into games where Charles pays and games where Sadie pays. This gives \[\begin{align*} B_1 &= \{(\text{H},\text{H},\text{H}), (\text{H},\text{H},\text{T}), (\text{H},\text{T},\text{H}), (\text{T},\text{H},\text{H})\}\\ B_2 &= \{(\text{T},\text{T},\text{T}), (\text{T},\text{T},\text{H}), (\text{T},\text{H},\text{T}), (\text{H},\text{T},\text{T})\}.\end{align*}\]

  2. We can partition the space into those with \(0\), \(1\), \(2\), or \(3\) heads. This gives \[\begin{align*} B_1 &= \{(\text{H},\text{H},\text{H})\}\\ B_2 &= \{(\text{T},\text{T},\text{H}), (\text{T},\text{H},\text{T}), (\text{H},\text{T},\text{T})\}\\ B_3 &= \{(\text{H},\text{H},\text{T}), (\text{H},\text{T},\text{H}), (\text{T},\text{H},\text{H})\}\\ B_4 &= \{(\text{H},\text{H},\text{H})\}. \end{align*}\]

  3. We can partition the space into the number of times the sequence switches between heads and tails. There will be either \(0\) switches, \(1\) switch, or \(2\) switches. This gives \[\begin{align*} B_1 &= \{(\text{H},\text{H},\text{H}), (\text{T},\text{T},\text{T})\}\\ B_2 &= \{(\text{T},\text{T},\text{H}), (\text{H},\text{T},\text{T}), (\text{H},\text{H},\text{T}), (\text{T},\text{H},\text{H})\}\\ B_3 &= \{(\text{T},\text{H},\text{T}), (\text{H},\text{T},\text{H})\}. \end{align*}\]

Partitions allow us to move from discussions regarding the joint probability of events to the marginal probability of an event. Suppose that we have a partition given by \(B_1, B_2, \dots\). This means that our full sample space can be cut up into these various non-overlapping sets, and every single outcome belongs to exactly one of them. Now, suppose we are interested in some other event \(A\). We can ask: how can \(A\) occur, in terms of the events \(B_1, B_2, \dots\)?

Since every single event in the sample space belongs to exactly one of our partitioning sets, then it must be the case that every single event in \(A\) belongs to exactly one of our partitioning sets. This means that if we consider \(A\cap B_j\), for all \(j\), then every single event in \(A\) must belong to exactly one of these. In other words, it must be the case that \[A = \bigcup_{j} A\cap B_j,\] for any partition \(B_1, B_2,\dots\). Moreover, every single \(A\cap B_j\) is disjoint from every other \(A \cap B_\ell\), whenever \(\ell \neq j\). This means that we can use the axiom of additivity to give \[P(A) = P\left(\bigcup_{j} A\cap B_j\right) = \sum_{j} P(A \cap B_j).\] In other words: the marginal probability of \(A\) can be found by summing over all joint probabilities between \(A\) and sets that form a partition. This argument gives the law of total probability.

Figure 4.1: This graphic shows the argument that summing over the joint probabilities between an event and a partition gives the full marginal probability. Note that \(B_1,\dots,B_7\) forms a partition of the space where every possible outcome is contained in exactly one of these sets. Then, if we take an arbitrary event \(A\), we can divide \(A\) into the components that intersect with each partitioning set, namely \(A \cap B_1\), \(A \cap B_2\), and so forth.
The Law of Total Probability

Given a partition, \(B_1, B_2, \dots\), and an event \(A\), the law of total probability states that \[P(A) = \sum_i P(A, B_i) = \sum_i P(A|B_i)P(B_i).\]

Intuitively, since the whole sample space is divided into the different \(B_i\)s, this rule breaks down the calculation of \(A\) happening into manageable chunks. Each term in the summation is “the probability that \(A\) happens, given \(B_i\) happening” weighted by how likely it is that \(B_i\) happens. Then by summing over all possible \(B_i\), we know that we must be capturing all possible ways that \(A\) can occur since all parts of the sample space are contained in exactly one of the sets of our partition. The law of total probability is an indispensable tool for computing probabilities in practice.

Example 4.6 (Sadie’s Possibly Late Package) Sadie has still not received the package that Charles had ordered. While it is not late yet, Charles decides to figure out the probability that the package will end up being late. Recall that company \(A\) is late \(75\%\) of the time while company \(B\) is late \(15\%\) of the time, and the store that Charles ordered from sends out \(10\%\) of packages with company \(A\), and the rest with company \(B\), which is more likely.

What is the probability that the package arrives late?

Note that \(A, B\) forms a partition of the sample space as every package is either sent with \(A\) or with \(B\), and no package can be sent with both. Further, if \(L\) represents the event that the package is late, then we know that \(P(L|A) = 0.75\) and \(P(L|B) = 0.15\). Since \(P(A) = 0.1\) and \(P(B) = 0.9\), an application of the law of total probability gives \[P(L) = P(L|A)P(A) + P(L|B)P(B) = (0.75)(0.1) + (0.15)(0.9) = 0.21.\] As a result, knowing nothing else, the package has a probability of \(0.21\) of being late.

Example 4.7 (Charles’ Many Urns) While shopping at some garage sales one Sunday morning, Charles and Sadie stumble across a wonderful find! They see three urns which are exactly identical to the one that Charles had already purchased to store the different balls which turned out to not be balls at all. Realizing the opportunity they splurge and purchase them all, and then divide the various objects between the four urns, placing all the spheres in one container, all the cubes in another, all the pyramids in a third, and all the cones in a fourth.

Once done, they use a different selection mechanism. First, they pick an urn at random. Next, they randomly grab one of the items from within it. The distribution of colours and shapes is included in the following table.

Colour Sphere (Urn 1) Cube (Urn 2) Pyramid (Urn 3) Cone (Urn 4)
Red 2 3 2 0
Blue 1 0 0 6
Green 2 2 1 2
Yellow 0 4 2 1
Black 3 1 1 2
  1. What is the probability that they select any of the five colours under this sampling scheme?
  2. How does this change if the probability that each urn is selected is proportional to the number of items in it? (Thus, urn 1 is selected with probability \(8/35\), and so forth).
  1. Let \(R\), \(B\), \(G\), \(Y\), and \(Bk\) be the events that a red, blue, green, yellow, or black ball are selected. Further, let \(U_1\), \(U_2\), \(U_3\), and \(U_4\) be the events that the first, second, third, or fourth urn are selected. Then note that, according to the law of total probability, \[P(R) = P(R, U_1) + P(R, U_2) + P(R, U_3) + P(R, U_4) = \sum_{j=1}^4 P(R|U_j)P(U_j).\] An equivalent argument holds for each of the other colours. Now, \[P(R|U_j) = \dfrac{N_{R\cap U_j}}{N_{U_j}},\] where \(N_{R\cap U_j}\) is the number of red objects in urn \(j\) and \(N_{U_j}\) is the total number in urn \(j\). Plugging this in and simplifying we get \[P(R) = \sum_{j=1}^4P(R|U_j)\left(\dfrac{1}{4}\right) = \dfrac{1}{4}\left\{\dfrac{2}{8} + \dfrac{3}{10} + \dfrac{2}{6}\right\} = \dfrac{53}{240}.\]

    We can apply analogous arguments to the other colours giving \[\begin{align*} P(B) &= \dfrac{1}{4}\left\{\dfrac{1}{8} + \dfrac{6}{11}\right\} = \dfrac{59}{352} \\ P(G) &= \dfrac{1}{4}\left\{\dfrac{2}{8} + \dfrac{2}{10} + \dfrac{1}{6} + \dfrac{2}{11}\right\} = \dfrac{527}{2640}\\ P(Y) &= \dfrac{1}{4}\left\{\dfrac{4}{10} + \dfrac{2}{6} + \dfrac{1}{11}\right\} = \dfrac{34}{165}\\ P(Bk) &= \dfrac{1}{4}\left\{\dfrac{3}{8} + \dfrac{1}{10} + \dfrac{1}{6} + \dfrac{2}{11}\right\} = \dfrac{1087}{5280}. \end{align*}\]

    Note that these probabilities sum to \(1\). In decimal these simplify to approximately 0.221, 0.168, 0.2, 0.206, 0.206.

  2. Using the same setup as before, we have \[P(R) = P(R, U_1) + P(R, U_2) + P(R, U_3) + P(R, U_4) = \sum_{j=1}^4 P(R|U_j)P(U_j).\] Now \(P(U_1) = 8/35\), \(P(U_2) = 10/35\), \(P(U_3) = 6/35\), and \(P(U_4) = 11/35\). Moreover, the conditional probabilities themselves do not change, and so instead we have \[P(R) = \left\{\dfrac{2}{8}\cdot\dfrac{8}{35} + \dfrac{3}{10}\cdot\dfrac{10}{35} + \dfrac{2}{6}\dfrac{6}{35} + \dfrac{0}{11}\cdot\dfrac{11}{35}\right\} = \dfrac{N_R}{35}.\] Note that when multiplying by the marginal probability of the urn, the denominator will always cancel. As a result, we end up with the total number of reds over \(35\), which leads to \(P(R) = \dfrac{1}{5}\). The same will be true for the other colours, and as a result, if we choose the urn based on a weighted selection, this will result in equal probability once more.

4.3.3 Bayes’ Theorem

We have seen the direct computation of marginal probabilities (while using an equally likely outcome model), the computation of conditional probabilities, the use of the multiplication rule for joint probabilities, and the use of the law of total probability to indirectly calculate marginal probabilities through conditioning arguments. Throughout these discussions we have been primarily concerned with keeping events \(A\) and \(B\) arbitrary. Everything that we have indicated for \(P(A)\) holds for \(P(B)\), as does \(P(A|B)\) and \(P(B|A)\). In reality, it will often be the case that conditioning on one of the events will be natural, while conditioning on the other will be more tricky. In these events, it can be useful to be able to transform statements regarding \(P(A|B)\) into statements regarding \(P(B|A)\), and vice versa.

Note that because the definitions are symmetric, \[P(A|B)P(B) = P(A,B) = P(B|A)P(A).\] This is an application of the multiplication rule in two different orientations. If we divide both sides of the equality by \(P(B)\), assuming that it is not \(0\), then we get \[P(A|B) = \dfrac{P(B|A)P(A)}{P(B)}.\] Now, if we form a partition, say \(A,A_2,A_3,\dots\), then we can rewrite \(P(B)\) using the law of total probability as \[P(B) = P(B|A)P(A) + P(B|A_2)P(A_2) + \cdots = P(B|A)P(A) + \sum_{i=2}P(B|A_i)P(A_i).\] Taken together this gives a result known as Bayes’ Theorem.4

Bayes’ Theorem

Suppose that there are two events, \(A\) and \(B\), with \(P(B) > 0\). Moreover, suppose that \(A\) taken with \(A_2, A_3, \dots\) forms a partition. Then Bayes’ Theorem states that \[P(A|B) = \dfrac{P(B|A)P(A)}{P(B)} = \dfrac{P(B|A)P(A)}{P(B|A)P(A) + \sum_{i=2} P(B|A_i)P(A_i)}.\]

Bayes’ Theorem allows us to convert statements regarding \(P(B|A)\) into statements regarding \(P(A|B)\). Note that, as we derived above, Bayes’ Theorem is an application of the multiplication rule and an application of the law of total probability.5 Sometimes we may have \(P(B)\) directly, rendering the law of total probability in the denominator unnecessary.

Often, the natural partition to select when we do need the law of total probability is to take \(A\) and \(A^C\). Note that any set with its complement forms a partition, since by definition they occupy the entire space and are non-overlapping. When this is done we get the slightly more compact relationship of \[P(A|B) = \dfrac{P(B|A)P(A)}{P(B|A)P(A) + P(B|A^C)P(A^C)}.\]

Bayes’ Theorem differs from our previous relationships as it allows us to translate one set of conditional probabilities into another. Every other relationship we have looked at has moved between types of probabilities, whereas Bayes’ Theorem deals directly with conditional relationships.

A commonly used example application for Bayes’ Theorem is medical testing. Suppose that we know the performance characteristics of a particular medical test: it is \(99\%\) accurate for positive cases, and \(95\%\) accurate for negative cases. That is, with probability \(0.99\) it correctly returns positive when an individual is infected, and with probability \(0.95\) it returns negative when an individual is not infected. These are both statements of conditional probability.

If we take \(A\) to be the event that the test returns positive, and \(B\) to be the event that the patient is infected,6 then we are saying that \(P(A|B) = 0.99\) and \(P(A^C|B^C) = 0.95\) which means that \(P(A|B^C) = 0.05\).7 Suppose that we know that, across the entire population, one in a thousand individuals is likely to be infected. This means that \(P(B) = 0.001\).

Now if a random individual goes into a doctor’s office and tests positive for the disease, how likely are they to actually be infected? In this case we want to know the probability of them being infected given that they have tested positive. In notation, this is \(P(B|A)\). We do not know this quantity directly, but given an application of Bayes’ Theorem, we can find it. Using the natural partition of \(B\) and \(B^C\), we get \[P(B|A) = \dfrac{P(A|B)P(A)}{P(A|B)P(B)+P(A|B^C)P(B^C)} = \dfrac{(0.99)(0.001)}{(0.99)(0.001)+(0.05)(0.999)} \approx 0.019.\] That is, despite the fact that this test is exceptionally effective at detecting this disease, a positive test still means that an individual has a probability of only \(0.019\) of actually having the illness.8

Example 4.8 (Sadie’s Late Package) Sadie eventually received the package that Charles had sent, but it arrived very late. Sadie was not home when the package was delivered and there no obvious markings on the box to indicate which of the two delivery companies had sent it.

Given this information, and knowing that company \(A\) is late \(75\%\) of the time while company \(B\) is late \(15\%\) of the time, and the store that Charles ordered from sends out \(10\%\) of packages with company \(A\), and the rest with company \(B\), what is the probability that the package was delivered by each of the two companies?

We want to determine \(P(A|L)\) and \(P(B|L)\). We know that \(P(L|A) = 0.75\) and \(P(L|B) = 0.15\). Moreover, we know that \(P(A) = 0.1\) and \(P(B) = 0.9\). Applying Bayes’ Theorem directly we get \[P(A|L) = \dfrac{P(L|A)P(A)}{P(L|A)P(A) + P(L|B)P(B)} = \dfrac{(0.75)(0.10)}{(0.75)(0.10) + (.10)(0.90)} = \dfrac{5}{11}.\] Similarly, we get \[P(B|L) = \dfrac{P(B|A)P(B)}{P(L|A)P(A) + P(L|B)P(B)} = \dfrac{(0.15)(0.90)}{(0.75)(0.10) + (.10)(0.90)} = \dfrac{6}{11}.\] Note, we could have also used the fact that \(P(A|L) + P(B|L) = 1\) to determine this. As a result, it is more likely that \(B\) delivered the package than \(A\).9

Bayes’ Theorem highlights a key lesson when considering conditional probabilities, and it’s a common mistake to make which should be avoided at all costs. Namely, we cannot interchange \(P(A|B)\) with \(P(B|A)\). These probabilities are not necessarily highly correlated with one another, and it is important to distinguish clearly which is the probability of interest.10 The stakes of these types of confusion can be quite high, and it is tremendously important to ensure that you are conditioning on the correct events. Fortunately, Bayes’ Theorem allows us to translate between events for conditioning, giving a mechanism from translating between the two.11

4.4 Independence

We have seen that, in most cases, conditioning on an event changes the probability of that event. For instance, if we want to know the probability it is raining, if we condition on knowing that it is a day full of gray skies, the conditional probability is likely higher than the marginal probability. By considering how the marginal and conditional probabilities differ, we are in effect indicating a dependence of the events on one another. In terms of probability, this dependence is captured by an influence on the degree of uncertainty present depending on what we know.

It is totally possible that two events do not influence one another at all. The weather outside today is likely not influenced by your favourite sports team’s performance last night.12 In this case, we would have \(P(A|B) = P(A)\).

We saw an example of this previously when we wanted to know the probability of a randomly selected card being a heart (\(A\)) given that it was an ace (\(B\)). We found that this was \(\dfrac{1}{4}\), exactly the same as the probability if we did not know that it was an ace. Thus, here we have \(P(A|B)=P(A)\). We could have also said that \(P(B|A)=P(B)=\dfrac{1}{13}\). The symmetry of these events makes it somewhat more convenient to express this relationship differently.

Instead of writing \(P(A|B) = P(A)\) and \(P(B|A) = P(B)\), we can multiply the first relationship by \(P(B)\) on both sides, or the second by \(P(A)\) on both sides. The multiplication rule gives \(P(A|B)P(B) = P(A,B)\), and so the first relationship becomes \(P(A,B) = P(A)P(B)\). The second follows exactly the same. Any two events that satisfy this relationship are said to be independent.

Definition 4.5 (Independence) Any two events, \(A\) and \(B\), which satisfy \(P(A,B) = P(A)P(B)\) are said to be independent. If \(A\) and \(B\) are independent we write \(A\perp B\), and read “\(A\) is independent of \(B\)”. Any events which are not independent are said to be dependent, and we write \(A\not\perp B\).

Note that independence is always a symmetric property: if \(A\) is independent of \(B\), then \(B\) is independent of \(A\). To check whether two events are independent, we check whether their joint probability is equal to the product of their marginal probabilities.

Note that if \(A\perp B\), then \(A^C\perp B\), \(A^C\perp B^C\), and vice versa. To see this note \[P(A^C, B) + P(A, B) = P(B),\] by the Law of Total Probability. Then, by independence of \(A\) and \(B\) this gives \[\begin{align*} P(A^C, B) + P(A)P(B) &= P(B) \\ \implies P(A^C, B) &= P(B) - P(A)P(B) \\ &= P(B)(1 - P(A)) \\ &= P(B)P(A^C),\end{align*}\] as is required. The other combinations follow in the same manner.

If \(P(A)\neq 0\), then we can divide both sides by \(P(A)\) to gives \(P(B|A) = P(B)\). Similarly, if \(P(B)\neq 0\), then we can divide both sides by \(P(B)\) to give \(P(A|B)=P(A)\). This expression in terms of conditional probabilities is the more intuitive expression of independence. It directly captures the idea that “knowing \(B\) does not change our belief about \(A\)”. However, we must be careful. This conditional argument is only valid when the event that is being conditioned on is not probability \(0\), where the defining relationship, \(P(A,B) = P(A)P(B)\), will hold for all events. Recall that, in general, \[P(A\cap B) = P(A)+P(B)-P(A\cup B).\] It is only when assuming independence that this simplifies further.

Remark (The Multiplication and Addition Rules of Probability). Often, in elementary probability and statistics courses, the Multiplication Rule and the Addition Rule for calculating probabilities are emphasized and named. It is emphasized that to calculate probabilities, we will often multiply together other probabilities, or else add a sequence of them. These rules often form the backbone of probability computation. At this point, we have now seen both the typical Multiplication Rule and the typical Addition Rule, without drawing specific attention to them as such.

The multiplication rule states that, for independent events \(A\) and \(B\), \(P(A\cap B) = P(A)P(B)\). That is, with independent events if you want to know the probability that they both happen, you multiply together the probabilities. The addition rule states that, for disjoint events \(A\) and \(B\), \(P(A \cup B) = P(A) + P(B)\). That is, with disjoint events if you want to know the probability that either or occurs, you add them together. These can be quite handy tools when correctly applied.

For instance, if we assume that we flip a coin then roll a die, we can say that the probability that you get a head and an even number must be \(\dfrac{1}{2}\times\dfrac{1}{2} = \dfrac{1}{4}\) since \(P(\text{Heads}) = \dfrac{1}{2}\) and \(P(\text{Even}) = \dfrac{1}{2}\), and the coin flip must be independent of the die roll. Similarly, suppose that we wish to know what the probability that, when drawing a card from a standard deck of \(52\) cards, the result is an Ace, Three, Six, or King. Here we can say that this must be \(\dfrac{4}{52} + \dfrac{4}{52} + \dfrac{4}{52} + \dfrac{4}{52} = \dfrac{4}{13}\). This is because each rank has a \(\dfrac{4}{52}\) chance of occurring, and each of these events are disjoint.

So, if the multiplication and addition rules are handy, why do we not discuss them directly in these notes? As pointed out, we did learn both of the rules: the multiplication rule is the defining relationship for independent events, and the addition rule is the axiom of additivity. However, highlighting these as substantially more important than the surrounding topics can lead to deeper struggles with probability calculations. Specifically, these two techniques apply in only very limited situations: if we know that events are independent, or we know that events are disjoint. Most of the time, neither assumption is going to hold, and in those settings we need to move past multiplying or adding to critically thinking about the specific quantities. This can lead to deeper confusion in translating these beyond simple examples.

The concern here moves beyond ensuring success in introductory sequences of probability and statistics. Uncritical deference to the multiplication and addition rules can have dire real-world consequences. As one particularly sobering example, Sally Clark was convicted in 1999 of the murder of her two sons, each of whom had died a few weeks following their birth. The defense claimed that the deaths were the results of Sudden Infant Death Syndrome (SIDS). The prosecution presented a statistical analysis from a paediatrician, Roy Meadow, who claimed that the probability of an infant dying of SIDS is approximately \(\dfrac{1}{8500}\). An application of the multiplication rule then tells us that the probability that both of Sally Clark’s sons would die of SIDS is \[\dfrac{1}{8500}\times\dfrac{1}{8500} = \dfrac{1}{72,250,000}.\] This is very rare, and seemed strong enough evidence to convict Sally Clark. The conviction was later overturned on an appeal, owing to both additional evidence coming to light and a critique of the statistical reasoning employed in the case published by the Royal Society of Statistics, but only after Sally Clark had served more than three years of a prison sentence.

The stakes of these types of calculations can be tremendously high, and it is critically important to not rely on simplified rules when working with probabilities and statistics. As individuals armed with the tools of probability and statistics it is imperative that we remain responsible with our application of these techniques; there is an air of authority that is carried by quantitative statements, and when misplaced, the consequences can be dire.

Example 4.9 (Independence and Board Games) Charles and Sadie are invited over by their friend, Garth, to play some board games. Both of them are quite excited by this prospect! Board games feel like the next logical step for the two of them to highlight their love of both probability and games! Garth has a large board game collection, and begins to explain the games to Charles and Sadie across a variety of axes:

  • Whether the games are competitive or cooperative.
  • How many players the games play best at (\(\{1, 2, 3, 4+\}\)).
  • How “heavy” the games tend to be (friendly for everyone, moderately involved, or very heavy).

While Garth continues on about several other topics including the themes, the mechanics, or the rating on BoardGameGeek, Charles drifts off wondering whether the traits listed are independent in Garth’s collection. Suppose that Garth has \(100\) games, of which:

  • \(25\) are cooperative.
  • \(5\) are best played at one player, \(20\) at two, and \(55\) at three.
  • \(10\) are friendly for everyone and \(45\) are moderately involved.
  1. If \(5\) games are best played at two players and are cooperative, are these events independent?
  2. If \(20\) games are heavy and best played with \(4+\) players, are these events independent?
  3. If \(0\) games are both moderately involved and cooperative, are these events independent?
  4. Charles figures that competitive games and two player games are independent. How many competitive two player games are there?
  5. Is it possible that competitive games are independent of heavy games?
  1. We are suggesting that \(P(A,B) = 0.05\), where \(A\) is “two player” and \(B\) is “cooperative”. We know that \(P(B) = 0.25\) and \(P(A) = 0.20\). Calculating \(P(A)P(B) = (0.20)(0.25) = 0.05 = P(A,B)\), and so these traits are independent.

  2. We have \(P(A,B) = 0.2\) where \(A\) is “heavy” and \(B\) is “\(4+\) players”. We know that \(P(A) = 0.45\) and \(P(B) = 0.20\), and so \(P(A)P(B) = (0.45)(0.20) = 0.09 \neq P(A,B)\). As a result, these traits are not independent.

  3. We know that \(P(A) > 0\) and \(P(B) > 0\) for \(A\) being moderately involved and \(B\) being cooperative. As a result, \(P(A)P(B) > 0\), and so if \(P(A,B) = 0\), they must not be independent.

  4. Taking \(A\) to be “competitive” we have \(P(A) = 0.75\). Taking \(B\) to be “two player” we have \(P(B) = 0.20\). As a result, if \(A\perp B\) then \(P(A,B) = P(A)P(B) = (0.75)(0.20) = 0.15\). Thus, there must be \(15\) competitive, two player games.

  5. Taking \(A\) to be “competitive” we have \(P(A) = 0.75\). Taking \(B\) to be “heavy” we have \(P(B) = 0.45\). Thus, if they were independent, we would have \(P(A,B) = P(A)P(B) = (0.75)(0.45) = 0.3375\). This would require \(34.75\) games to be heavy, competitive games, and so we must conclude that they are not independent.

4.4.0.1 Mutually Exclusive Events

Importantly, if \(A\perp B\), then \(A\cap B\neq \emptyset\) unless at least one of \(A\) or \(B\) equals \(\emptyset\) or \(\mathcal{S}\). To see this recall that \(P(\emptyset) = 0\), and so if \(A\cap B = \emptyset\) then \(P(A\cap B) = P(A)P(B) = P(\emptyset) = 0\). Moreover, \(P(\mathcal{S}) = 1\) and \(\mathcal{S} \cap B = B\), so \(P(\mathcal{S} \cap B) = 1P(B) = P(B)\). This only holds if either probability is \(0\) or \(1\). This may seem to be a rather technical point, however, it is the source of much confusion regarding independence. In particular, it is common to mistake independent events for mutually exclusive events.

Definition 4.6 (Mutually Exclusive Events) Two events, \(A\) and \(B\) are said to be mutually exclusive if they are disjoint. In particular, if \(P(A,B) = 0\) then \(A\) and \(B\) are mutually exclusive. If \(A\) and \(B\) are mutually exclusive events, with \(P(A) > 0\) and \(P(B) > 0\), then \(A\not\perp B\).

Whenever only one event from a set of events can happen, we refer to the events as being mutually exclusive. If one happens, we know that the others did not. Mutually exclusive events, with positive probability, are always dependent since knowing that \(A\) occurs dramatically shifts our belief about \(B\),\(C\), and \(D\).13

The primary concern with mutually exclusive and independent events is a linguistic one. We often use words like independent to mean unrelated, and in a sense, mutually exclusive events are unrelated in that one has nothing to do with another. However, in statistics and probability, when we discuss independence, it is not an independence of the events themselves but rather an independence relating to our beliefs regarding the uncertainty associated with the events. In this sense, mutually exclusive events are very informative regarding the uncertainty associated with them.

Example 4.10 (Charles and Sadie Cooking Dinner) Suppose that Charles and Sadie always eat dinner together. It will either be the case that Charles cooks at home, that Sadie cooks at home, that the two of them order in, or that they go out to eat. If we take these to be four events, \(A\), \(B\), \(C\), and \(D\), then are any of these events independent? Mutually exclusive? Explain.

Assuming, as it seems reasonable given the information in the question, that only one of the possible tasks occurs for dinner in a night, then we know that \[P(A,B) = P(A,C) = P(A,D) = P(B,C) = P(B,D) = P(C,D) = 0.\] As a result, all of the events described are mutually exclusive, and correspondingly, are not independent.

4.5 Contingency Tables

Through to this point we have discussed probabilities in the abstract, either through an enumeration of equally likely outcomes, or else by directly specifying the likelihood of various events. While these are useful in many regards, we are often looking for more concise manners of summarizing information of interest. One tool for accomplishing this is a contingency table.

Definition 4.7 (Contingency Table) A contingency table is a tabular summary of information which summarizes the joint probabilities of two or more variables. Typically a contingency table will take one factor for the columns and a secondary factor for the rows, where each cell then represents the frequency with which observations occur in the two corresponding categories simultaneously.

To begin, you could imagine constructing a frequency table relaying the frequency with which undergraduate students are enrolled in various faculties at a particular university. This tells you, of the whole population of students at the university, what is the faculty breakdown. By dividing the number in each faculty by the total number of students, you convert the frequencies to proportions, and these proportions can be viewed as probabilities. (See Table 4.1.14) The interpretation of proportions as probabilities implies a very specific statistical experiment. In particular, the proportion represents the probability that an individual selected at random from the entire population has the given trait. This is frequently a probability of interest, which makes these summary tables a useful tool.15

Table 4.1: Frequency and corresponding proportions of enrolment by faculty in a university.
Faculty Enrolment Proportion
Arts 1266 0.2182382
Computer Science 749 0.1291157
Education 786 0.1354939
Engineering 1315 0.2266851
Law 266 0.0458542
Nursing 543 0.0936046
Science 876 0.1510084
Total 5801 1.0000000

When a single trait is displayed we refer to these tabular summaries as frequency tables or frequency distributions. A contingency table instead plots two or more traits on the same table, with each cell representing the frequency of both traits occurring simultaneously in the population. Extending the university example, we may further include the student’s current year of study to see the breakdown of both faculty and year of study, in one table.

Table 4.2: Frequency of enrolment by faculty and year of study in a university.
Year 1 Year 2 Year 3 Year 4 Year 5+
Arts 222 276 273 225 270 1266
Comp. Sci. 164 87 164 90 244 749
Education 100 136 184 128 238 786
Engineering 189 290 354 298 184 1315
Law 54 50 58 37 67 266
Nursing 55 90 154 112 132 543
Science 242 114 206 183 131 876
1026 1043 1393 1073 1266 5801

By including two (or more) factors in the table we are able to capture not only the marginal probabilities for the population, but also the joint probabilities for the population, and in turn, the conditional probabilities for the population. Being able to concisely summarize all of these concepts regarding traits in a population of interest renders contingency tables immensely useful in the study of uncertainty broadly.

Consider the two-way contingency table, Table 4.2. Each cell consists of frequency with which a combination of the two traits occurs in the population.16 If we take events corresponding to each of the levels of the two variables of interest, then these central cells represent the frequency of joint events. That is, each interior cell gives the total number of observations with a set level for variable one17 AND a set level for variable two18. For instance, there are 50 individuals who are in Law and studying in Year 2.

Each row is then summed, with the total number following into the corresponding row recorded in the right-hand margin. Each column is summed, with the total number corresponding to the given column recorded in the bottom margin. For instance there are a total of 786 students in Education and a total of 1393 in Year 3. Then the margin totals are summed, and the total is recorded in the lower right margin space (in this case, 5801). Whether the rows or columns are summed, they should sum to the same total, which is the total of the population under consideration. This is the same as simply adding all the observed interior frequencies. To turn a frequency into a probability, you need only divide the correct frequency by the correct total.

Table 4.3: Proportions of enrolment by faculty and year of study in a university.
Year 1 Year 2 Year 3 Year 4 Year 5+
Arts 0.0382693 0.0475780 0.0470609 0.0387864 0.0465437 0.2182382
Comp. Sci. 0.0282710 0.0149974 0.0282710 0.0155146 0.0420617 0.1291157
Education 0.0172384 0.0234442 0.0317187 0.0220652 0.0410274 0.1354939
Engineering 0.0325806 0.0499914 0.0610240 0.0513705 0.0317187 0.2266851
Law 0.0093087 0.0086192 0.0099983 0.0063782 0.0115497 0.0458542
Nursing 0.0094811 0.0155146 0.0265471 0.0193070 0.0227547 0.0936046
Science 0.0417169 0.0196518 0.0355111 0.0315463 0.0225823 0.1510084
0.1768661 0.1797966 0.2401310 0.1849681 0.2182382 1.0000000

For the standard joint probabilities, you take the interior cell count and divide by the population total. Here we are saying that some fixed number, \(m\), of the \(N\) total individuals have both traits under consideration. For instance, the joint probability that a student is in Law and studying in Year 2 is 0.0086192.19 If instead you wish to find a marginal probability, you have to consider the value in the corresponding margin: this is the total number of individuals with the given trait, ignoring the level of the other variable. These marginal values are also divided by the total population size. For instance, there is a 0.1354939 probability of observing a student in Education and a probability 0.240131 of observing a student in Year 3.

Outside of joint and marginal probabilities, we can also find conditional probabilities. To do so, we restrict our focus to either only one row, or one column. Then, we can take the joint cell and divide by the value in the margin, which gives the conditional probability of interest. Note, this works with either the contingency table directly or with the propensity table. The reasoning is that the propensity table divided by the same totals in the numerator and the denominator. Suppose we take some cell, \(A\cap B\), which has \(N_{A, B}\) in the contingency table. Then, \[P(A\cap B) = \dfrac{N_{A, B}}{N} \quad\quad\text{and}\quad\quad P(B) = \dfrac{N_B}{N}.\] If we consider then forming \(P(A|B)\) we get \[P(A|B) = \dfrac{P(A \cap B)}{P(B)} = \dfrac{N_{A \cap B}/\cancel{N}}{N_B/\cancel{N}} = \dfrac{N_{A, B}}{N_B}.\] Thus, given that we know a student is in Education, then the probability that they are in Year 3 is approximately 0.2341.

Note that these procedures are exactly in line with what we had seen before. The conditional probability is defined as the joint probability divided by the marginal probability. The process of computing the marginal can be seen as an application of the law of total probability. As a result, contingency tables can be a useful, tangible tool for investigating the techniques we have been discussing: they are not a substitute for direct manipulation of the mathematical objects, but they can present insight into the underlying processes where it may be hard to derive that insight otherwise.

Recall that the law of total probability states that, if \(B_1,\dots,B_n\) forms a partition of the sample space, then \[P(A) = \sum_{i=1}^n P(A|B_i)P(B_i) = \sum_{i=1}^n P(A, B_i)\]. In a contingency table it is easier to see how either of the two factors at play (either those in the rows or the columns) forms a partition of the space. Every observation has to fall in exactly one row and exactly one column. Thus, if we want to know the marginal frequency of a single trait (say represented by some row of the table), then one way to find this total is to sum up every observation in each of the corresponding columns. This is precisely the same process as the law of total probability. Note that, denoting each column as \(B_i\), and supposing there are \(k\) total columns, then the total number in a row is taken to be \[N_A = \sum_{i=1}^k N_{A, B_i},\] simply as the summation of the corresponding row. By definition, \(P(A) = \dfrac{N_A}{N}\), and so dividing both sides by \(N\) gives \[P(A) = \dfrac{N_A}{N} = \dfrac{1}{N}\sum_{i=1}^k N_{A, B_i} = \sum_{i=1}^k \dfrac{N_{A, B_i}}{N} = \sum_{i=1}^n P(A\cap B_i).\]

It is important to note that there is redundant information within a contingency table. For instance, the margins need not be listed explicitly, as they can be directly calculated from the interior points. The same goes for interior points, given the margins of the table (assuming some interior points are also presented). This can be useful for a compact representation of the information, and manipulating these tables – being able to find the required information in many places – should become familiar to you as you continue to work with them more and more.

Example 4.11 (The Evolving Contents of Charles’ Urns) After learning of contingency tables, Sadie points out to Charles that with the whole urn debacle they went through, the two of them actually ended up using a contingency table to summarize the information. How neat! In the interim, there has been some development in the contents of Charles’s urns, and armed with the new knowledge of contingency tables, the following summary is produced.

Cone Cube Pyramid Sphere
Black 2 A 1 3 10
Blue 1 2 1 1 5
Green 3 3 B 5 15
Red 2 C 3 2 D
Yellow 5 3 4 1 13
13 E 13 12 50

Sadly, Charles does not have the best writing and Sadie cannot make out what values were written in for the cells marked \(A\), \(B\), \(C\), \(D\), and \(E\).

  1. What are the values for the missing values?
  2. What is the probability that a black cube is drawn?
  3. What is the probability that a red object is drawn?
  4. What is the probability that a cube is drawn?
  5. Given that the drawn object was a pyramid, what is the probability that it is green?
  6. Given that the object is green, what is the probability that it is a pyramid.
  1. We start by filling in the missing cells, in order.

    1. For \(A\) we can note that \(2 + A + 1 + 3 = 10\), which gives that \(A = 4\). We could have also tried a column sum, however, this would involve \(3\) unknowns and so would not have given a numeric result.

    2. For \(B\) either a row sum or column sum would produce the correct answer. We either take \(3 + 3 + B + 5 = 15\) giving \(B = 4\) or \(1 + 1 + B + 3 + 4 = 13\) giving \(B=4\).

    3. \(C\) is more challenging as we either have that \(2 + C + 3 + 2 = D\) or \(4 + 2 + 3 + C + 3 = E\)20 As a result, for this technique we will need to find either \(D\) or \(E\) first. There is an alternative technique to use, which would be to note that we have all the internal points specified at this point, and we know that these sum to \(50\). As a result, we could note that \(C = 50 - 10 - 5 - 15 - 13 - 2 - 3 - 2 = 0\) or that \(C = 50 - 13 - 13 - 12 - 4 - 2 - 3 - 3 = 0\).

    4. We can find \(D\) either by subtracting from the total, \(D = 50 - 10 - 5 - 15 - 13 = 7\), or by adding along the row, \(2 + 0 + 3 + 2 = 7\).21 Note that with \(D = 7\), had we not found \(C = 0\) above, we could now take \(2 + C + 3 + 2 = 7\) and find \(C = 0\).

    5. Like \(D\), we have two options for working this out. Either \(E = 50 - 13 - 13 - 12 = 12\) or \(E = 4 + 2 + 3 + 0 + 3 = 12\).22 Like with \(D\), had we solved for \(E\) first, then we could find \(C\) via \(4 + 2 + 3 + C + 3 = 12\).

  2. Here we want \(P(\text{Black}, \text{Cube})\). This is given by the frequency of black cubes divided by \(50\). Thus, \[P(\text{Black}, \text{Cube}) = \dfrac{A}{50} = \dfrac{4}{50} = 0.08.\]

  3. Here we want \(P(\text{Red})\). This is given by the marginal frequency of red objects divided by \(50\). Thus, \[P(\text{Red}) = \dfrac{D}{50} = \dfrac{7}{50} = 0.14.\]

  4. Here we want \(P(\text{Cube})\). This is given by the marginal frequency of cubes divided by \(50\). Thus, \[P(\text{Cube}) = \dfrac{E}{50} = \dfrac{12}{50} = 0.24.\]

  5. Here we want \(P(\text{Green}|\text{Pyramid})\). This is given by the frequency of green pyramids divided by the marginal frequency of pyramids. Thus \[P(\text{Green}|\text{Pyramid}) = \dfrac{B}{13} = \dfrac{4}{13} \approx 0.308.\]

  6. Here we want \(P(\text{Pyramid}|\text{Green})\). This is given by the frequency of green pyramids divided by the marginal frequency of green objects.23 Thus \[P(\text{Pyramid}|\text{Green}) = \dfrac{B}{15} = \dfrac{4}{15} \approx 0.267.\]

It is also important to recognize that independence and mutually exclusive events can be codified via the table as well. Zeros on the interior points indicate events which are mutually exclusive: if we know that one of them occurred, we also know that the other one did not. For independence, it requires a degree of solving proportions. We can either check that the joint probability (\(N_{A,B}/N\)) is equal to the product of the two marginal probabilities, (\(N_AN_B/N^2\)), or else (assuming that the events are all non-zero), that the conditional probability (\(N_{A,B}/N_A\)) equals the marginal probability \(N_B/N\). Either way this is represented by \(N\times N_{A,B} = N_AN_B\), and when this holds, we can conclude that the events are independent.

Example 4.12 (Independent or Mutually Exclusive Urn Shapes) After helping Charles neatly fill in the contingency table, Sadie begins to wonder about whether there are any shape-colour combinations which are independent, or if any are mutually exclusive.

Cone Cube Pyramid Sphere
Black 2 4 1 3 10
Blue 1 2 1 1 5
Green 3 3 4 5 15
Red 2 0 3 2 7
Yellow 5 3 4 1 13
13 12 13 12 50
  1. Are there any events represented in the contingency table which are mutually exclusive?
  2. Are there any events represented in the contingency table which are independent?
  1. Mutually exclusive events are codified via a \(0\) frequency. We can see then that \(\text{Cube}\) and \(\text{Red}\) are mutually exclusive. That is, if we know that an object is red, we also know that it is not a cube. And if we know that an object is a cube, then we know that it is not red.

  2. Independence is more cumbersome to check. We require the product of marginal counts to be equal to the total times by the joint counts. As a result, the easiest way to check is to consider the product of each marginal value in the columns by each marginal value in the row, divided by \(50\). If this value corresponds to the value in that row/column pairing, then we know that those features are independent. Note that immediately we can rule out any products which are not divisible by \(50\), since we know that \(N_{A,B}\) is an integer for all \(A,B\) and \(N_{A,B} = N_AN_B/50\) if there is independence. To this end, we only need to check the product of each of the row totals with each of \(12\) and \(13\) (as those are the only two unique column totals). This gives, in order \(120\), \(130\), \(60\), \(65\), \(180\), \(195\), \(84\), \(91\), \(156\), and \(169\). None of these values are divisible by \(50\) and as a result we know that there is no independence codified in this table.

4.5.0.1 Contingency Tables and Data Frames in R

In R, we can use the table and prop.table to calculate the (interior) of a contingency table. This will return a table type object in R, which can be thought of as a matrix of sorts. On it, we can use the functions rowSums and colSums to get the summation of the rows and columns, respectively. Typically, the object we pass to table will be a data frame. A data frame is another R object type that we have not yet seen. The idea with a data frame is that we have multiple columns with different variables (of possibly different types) represented. Each row corresponds to a single observation, and then each column is read as a feature of those observations. Data frames are essentially large spreadsheets (or data tables) which indicate the various observations that we have. In practice, data frames are the most commonly used object in an R analysis, and we will see them plenty going forward.

With a data frame specified, we can then use the table function called on it to form a contingency table.

Then, to get the totals, we can either use rowSums or colSums to return a vector with the corresponding row or column sums.

Finally, we can take our formed contingency table, and use prop.table on it, in order to return a table with the proportions (rather than the frequencies).

Self-Assessment

Note: the following questions are still experimental. Please contact me if you have any issues with these components. This can be if there are incorrect answers, or if there are any technical concerns. Each question currently has an ID with it, randomized for each version. If you have issues, reporting the specific ID will allow for easier checking!

For each question, you can check your answer using the checkmark button. You can cycle through variants of the question by pressing the arrow icon.

Self Assessment 4.1

If \(A\) and \(B\) are independent, then knowing \(A\) occurs means that \(B\) did not occur.

(Question ID: 0463616548)

If \(A \perp B\) then \(B \perp A\).

(Question ID: 0886470639)

If \(A\) and \(B\) are independent, then \(A\) causes \(B\) to not occur.

(Question ID: 0251015970)

If \(P(A)\) is \(0.69\) and \(P(B)\) is \(0.95\) and \(P(A,B)\) is \(0.6555\), then \(A \perp B\).

(Question ID: 0080960978)

Two events are independent if the occurrence of one event does not influence the probability of the other.

(Question ID: 0010667899)

If two events are mutually exclusive, they must be independent.

(Question ID: 0677848547)

If two events are mutually exclusive, they must be independent.

(Question ID: 0837129019)

If \(A \perp B\) then \(B \perp A\).

(Question ID: 0232227299)

If \(A\) and \(B\) are independent, then \(A\) causes \(B\) to not occur.

(Question ID: 0414182213)

If \(A\) and \(B\) are independent, then \(A\) causes \(B\) to not occur.

(Question ID: 0220364941)

If \(P(A)\) is \(0.16\) and \(P(B)\) is \(0.52\) and \(P(A,B)\) is \(0.0832\), then \(A \perp B\).

(Question ID: 0421247229)

If \(P(A) = P(B)\), then \(A\) and \(B\) are independent.

(Question ID: 0038799228)

If \(A \perp B\) then \(B \perp A\).

(Question ID: 0589550286)

If a coin is tossed twice, the outcomes of each toss will be independent.

(Question ID: 0718561306)

If \(A\) and \(B\) are independent, then knowing \(A\) occurs means that \(B\) did not occur.

(Question ID: 0525199498)

If events \(A\) and \(B\) are independent, and \(P(B) > 0\), then \(P(A|B) = P(A)\).

(Question ID: 0740943891)

Two events are independent if the occurrence of one event does not influence the probability of the other.

(Question ID: 0979790030)

If two events are mutually exclusive, they must be independent.

(Question ID: 0323323644)

If \(A \perp B\) then \(B \perp A\).

(Question ID: 0222888025)

If \(A\) and \(B\) are independent, then knowing \(A\) occurs means that \(B\) did not occur.

(Question ID: 0533882029)

Self Assessment 4.2

A sample space may have multiple different partitions.

(Question ID: 0206372290)

Any collection of sets whose union equals the sample space is a partition.

(Question ID: 0334323302)

The intersection of any sets in a partition is the empty set.

(Question ID: 0616554423)

The number of sets in a partition must equal the number of outcomes in a sample space.

(Question ID: 0888263175)

A sample space may have multiple different partitions.

(Question ID: 0309644160)

Any collection of sets whose union equals the sample space is a partition.

(Question ID: 0825751779)

Every event from a statistical experiment belongs to exactly one set in a partition.

(Question ID: 0270748397)

The number of sets in a partition must equal the number of outcomes in a sample space.

(Question ID: 0054315215)

A sample space may have multiple different partitions.

(Question ID: 0182305286)

Any collection of sets whose union equals the sample space is a partition.

(Question ID: 0462598950)

Any collection of sets whose union equals the sample space is a partition.

(Question ID: 0072294476)

A sample space may have multiple different partitions.

(Question ID: 0014340824)

The union of all sets in a partition forms the sample space.

(Question ID: 0627915722)

A partition must consist of events that have equal probability.

(Question ID: 0364696965)

A sample space may have multiple different partitions.

(Question ID: 0809684567)

Any two mutually exclusive events form a partition.

(Question ID: 0768452451)

An event and its complement always form a partition.

(Question ID: 0394146805)

The intersection of any sets in a partition is the empty set.

(Question ID: 0109453452)

The intersection of any sets in a partition is the empty set.

(Question ID: 0697175778)

Every outcome from a statistical experiment belongs to exactly one set in a partition.

(Question ID: 0435420953)

Self Assessment 4.3

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is red?

(Question ID: 0845641218)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a spade?

(Question ID: 0846213330)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a 2?

(Question ID: 0414665215)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is black?

(Question ID: 0697266853)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a Queen?

(Question ID: 0145275906)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is red?

(Question ID: 0239792144)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a diamond?

(Question ID: 0667373442)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is red?

(Question ID: 0429241699)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a Queen?

(Question ID: 0650498398)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is an 8?

(Question ID: 0957570354)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is black?

(Question ID: 0170669903)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a heart?

(Question ID: 0897607165)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is black?

(Question ID: 0034561761)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a club?

(Question ID: 0211864602)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a diamond?

(Question ID: 0274299685)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a 2?

(Question ID: 0653764521)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a Queen?

(Question ID: 0108732007)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a club?

(Question ID: 0779711928)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a 4?

(Question ID: 0135393923)

Suppose that two cards are drawn, without replacement, from a deck of cards. What is the probability that the second card is a spade?

(Question ID: 0817996346)

Self Assessment 4.4

Of voters in a particular riding from the last election, 33% voted for the incumbent and 19% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0202999725)

Of visitors to a local clinic, 36% received a flu shot and 22% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0302978249)

Of active users on a social media platform in the past week, 46% shared a news article and 15% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0655440521)

Of adults in a particular town who joined community programs this year, 26% joined a visual arts program and 10% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0902623666)

Of voters in a particular riding from the last election, 17% voted for the incumbent and 15% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0880575605)

Of adults in a particular town who joined community programs this year, 5% joined a visual arts program and 2% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0845831611)

Of people in a certain city who bought a new vehicle in the past year, 7% bought an electric vehicle and 1% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0852382924)

Of people in a certain city who bought a new vehicle in the past year, 40% bought an electric vehicle and 20% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0280858143)

Of visitors to a local clinic, 24% received a flu shot and 18% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0358993296)

Of visitors to a local clinic, 45% received a flu shot and 15% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0630530527)

Of active users on a social media platform in the past week, 32% shared a news article and 22% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0271785822)

Of active users on a social media platform in the past week, 49% shared a news article and 15% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0394859735)

Of voters in a particular riding from the last election, 25% voted for the incumbent and 23% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0311599269)

Of adults in a particular town who joined community programs this year, 9% joined a visual arts program and 7% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0111698630)

Of voters in a particular riding from the last election, 30% voted for the incumbent and 28% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0536544732)

Of students at a particular University who joined a club in the past year, 33% joined a sports club and 8% joined the scocer club.

Given that a person joined a sports club, what is the probability that it was the soccer club?

(Question ID: 0181438981)

Of students at a particular University who joined a club in the past year, 15% joined a sports club and 6% joined the scocer club.

Given that a person joined a sports club, what is the probability that it was the soccer club?

(Question ID: 0397755901)

Of adults in a particular town who joined community programs this year, 45% joined a visual arts program and 26% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0267030643)

Of voters in a particular riding from the last election, 40% voted for the incumbent and 26% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0036308489)

Of visitors to a local clinic, 8% received a flu shot and 3% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0268463464)

Of active users on a social media platform in the past week, 8% shared a news article and 4% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0073020280)

Of voters in a particular riding from the last election, 29% voted for the incumbent and 24% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0571907673)

Of people in a certain city who bought a new vehicle in the past year, 27% bought an electric vehicle and 3% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0760800273)

Of adults in a particular town who joined community programs this year, 49% joined a visual arts program and 8% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0000373466)

Of voters in a particular riding from the last election, 42% voted for the incumbent and 15% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0944141721)

Of people in a certain city who bought a new vehicle in the past year, 10% bought an electric vehicle and 1% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0035897931)

Of active users on a social media platform in the past week, 37% shared a news article and 10% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0718195761)

Of students at a particular University who joined a club in the past year, 28% joined a sports club and 24% joined the scocer club.

Given that a person joined a sports club, what is the probability that it was the soccer club?

(Question ID: 0358480917)

Of people in a certain city who bought a new vehicle in the past year, 32% bought an electric vehicle and 8% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0396986507)

Of active users on a social media platform in the past week, 50% shared a news article and 33% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0518065905)

Of visitors to a local clinic, 24% received a flu shot and 9% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0268124433)

Of active users on a social media platform in the past week, 6% shared a news article and 3% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0123070417)

Of adults in a particular town who joined community programs this year, 45% joined a visual arts program and 17% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0406400332)

Of voters in a particular riding from the last election, 33% voted for the incumbent and 1% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0726386801)

Of visitors to a local clinic, 22% received a flu shot and 1% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0723701137)

Of adults in a particular town who joined community programs this year, 34% joined a visual arts program and 17% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0056713185)

Of students at a particular University who joined a club in the past year, 46% joined a sports club and 17% joined the scocer club.

Given that a person joined a sports club, what is the probability that it was the soccer club?

(Question ID: 0727494987)

Of voters in a particular riding from the last election, 32% voted for the incumbent and 17% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0730113889)

Of visitors to a local clinic, 8% received a flu shot and 5% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0760890704)

Of voters in a particular riding from the last election, 29% voted for the incumbent and 12% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0505174891)

Of active users on a social media platform in the past week, 46% shared a news article and 28% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0536141918)

Of adults in a particular town who joined community programs this year, 8% joined a visual arts program and 4% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0449352415)

Of visitors to a local clinic, 32% received a flu shot and 9% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0928907043)

Of visitors to a local clinic, 7% received a flu shot and 3% received both a flu shot and a COVID booster.

Given that a person received a flu shot, what is the probability that they also received a COVID booster?

(Question ID: 0606186801)

Of voters in a particular riding from the last election, 25% voted for the incumbent and 15% were 65+ and voted for the incumbent.

Given that a person voted for the incumbent, what is the probability that they are 65+?

(Question ID: 0007651406)

Of active users on a social media platform in the past week, 17% shared a news article and 10% shared a news article about local politics.

Given that a person shared a news article, what is the probability that it was about local politics?

(Question ID: 0640213893)

Of people in a certain city who bought a new vehicle in the past year, 47% bought an electric vehicle and 11% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0084584505)

Of adults in a particular town who joined community programs this year, 10% joined a visual arts program and 7% joined a water colour program.

Given that a person joined a visual arts program, what is the probability that it was a water colour program?

(Question ID: 0686843708)

Of people in a certain city who bought a new vehicle in the past year, 29% bought an electric vehicle and 6% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0116789180)

Of people in a certain city who bought a new vehicle in the past year, 19% bought an electric vehicle and 5% bought an electric truck.

Given that a person bought an electric vehicle, what is the probability that it was a truck?

(Question ID: 0009552472)

Self Assessment 4.5

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 336 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 80 32
Failed 160 64
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0091934752)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 159 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 49 41
Failed 58 11
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0832570880)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 288 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 36 79
Failed 54 119
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0528297641)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 62 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 18 13
Did Not Make a Purchase 18 13
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0531660683)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 297 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 42 57
Failed 84 114
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0891927874)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 93 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 20 42
Did Not Make a Purchase 10 21
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0332955951)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 70 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 8 20
Disease Absent 12 30
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0780870654)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 165 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 20 35
Did Not Make a Purchase 40 70
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0303351515)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 351 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 28 89
Did Not Make a Purchase 56 178
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0507927141)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 105 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 6 64
Did Not Make a Purchase 3 32
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0605217354)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 248 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 94 30
Gene 2 Recessive 94 30
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0336952340)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 165 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 18 92
Disease Absent 9 46
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0642870191)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 187 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 44 28
Did Not Make a Purchase 84 31
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0036609094)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 196 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 60 43
Gene 2 Recessive 76 17
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0685851373)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 119 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 72 6
Gene 2 Recessive 35 6
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0073675914)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 149 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 67 12
Disease Absent 47 23
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0822236064)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 147 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 69 53
Disease Absent 17 8
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0515560155)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 188 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 89 28
Gene 2 Recessive 60 11
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0612936733)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 146 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 96 10
Disease Absent 25 15
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0731048293)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 268 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 38 96
Gene 2 Recessive 38 96
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0113595290)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 308 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 12 76
Did Not Make a Purchase 30 190
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0067186262)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 221 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 62 19
Did Not Make a Purchase 92 48
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0265420981)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 158 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 56 23
Failed 56 23
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0129240226)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 207 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 56 9
Gene 2 Recessive 82 60
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0564514460)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 336 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 14 82
Disease Absent 35 205
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0952304733)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 336 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 74 22
Disease Absent 185 55
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0706310131)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 212 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 86 55
Disease Absent 43 28
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0865386634)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 132 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 12 21
Disease Absent 36 63
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0533239895)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 182 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 68 53
Failed 34 27
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0716724041)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 189 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 47 32
Disease Absent 63 47
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0854206092)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 308 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 82 41
Did Not Make a Purchase 123 62
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0161063937)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 177 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 63 44
Failed 47 23
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0956972191)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 238 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 99 28
Disease Absent 62 49
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0223096894)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 225 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 43 42
Gene 2 Recessive 86 54
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0371873371)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 497 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 50 92
Failed 125 230
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0185212892)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 224 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 14 50
Failed 35 125
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0871347881)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 300 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 14 86
Disease Absent 28 172
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0260738443)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 82 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 20 10
Gene 2 Recessive 34 18
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0099045902)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 281 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 96 86
Did Not Make a Purchase 61 38
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0245407051)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 250 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 74 68
Did Not Make a Purchase 68 40
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0468785218)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 102 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 18 50
Did Not Make a Purchase 9 25
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0228650863)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 345 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 96 67
Failed 97 85
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0457655792)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 392 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 94 18
Did Not Make a Purchase 235 45
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0698831090)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 242 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 26 71
Disease Absent 39 106
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0173760205)

A new medical test has been developed to test for a particular disease. They measure the response status of the test, along with the true disease status.

A sample of size 310 is taken, and summarized in the following contingency table.

Test Positive Test Negative
Disease Present 94 71
Disease Absent 77 68
  1. What is the probability that a randomly sampled individual tests positive?
  2. What is the probability that a randomly sampled individual has the disease present?
  3. Given that a randomly sampled individual tests positive, what is the probability that this individual has the disease present?
  4. Are the test result and disease status independent, when considering a randomly sampled individual?

(Question ID: 0489679097)

A teacher is interested in investigating the efficacy of a peer tutoring program. They measure whether each individual has tutoring, and then whether they pass or fail the relevant exam.

A sample of size 157 is taken, and summarized in the following contingency table.

Received Tutoring Did not Receive Tutoring
Passed 73 18
Failed 57 9
  1. What is the probability that a randomly sampled individual receives tutoring?
  2. What is the probability that a randomly sampled individual passes?
  3. Given that a randomly sampled individual receives tutoring, what is the probability that this individual passes?
  4. Are tutoring involvement and test outcome independent, when considering a randomly sampled individual?

(Question ID: 0746116788)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 130 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 26 39
Did Not Make a Purchase 26 39
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0026187982)

A geneticist is studying two genes. Each gene can be either dominant or recessive.

A sample of size 138 is taken, and summarized in the following contingency table.

Gene 1 Dominant Gene 1 Recessive
Gene 2 Dominant 32 23
Gene 2 Recessive 48 35
  1. What is the probability that a randomly sampled individual has dominant gene 1?
  2. What is the probability that a randomly sampled individual has dominant gene 2?
  3. Given that a randomly sampled individual has dominant gene 1, what is the probability that this individual has dominant gene 2?
  4. Are the two genes independent, when considering a randomly sampled individual?

(Question ID: 0736777941)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 250 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 99 58
Did Not Make a Purchase 80 13
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0341121683)

A marketing team wishes to understand the efficacy of their marketing program. They measure the purchase behaviour for individuals, based on whether or not they saw an ad.

A sample of size 183 is taken, and summarized in the following contingency table.

Saw the Ad Did Not See the Ad
Made a Purchase 68 6
Did Not Make a Purchase 96 13
  1. What is the probability that a randomly sampled individual sees the ad?
  2. What is the probability that a randomly sampled individual makes a purchase?
  3. Given that a randomly sampled individual sees the ad, what is the probability that this individual makes a purchase?
  4. Are the marketing efficacy and purchase behaviour independent, when considering a randomly sampled individual?

(Question ID: 0891681966)

Self Assessment 4.6

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.61, that the customer is in demographic group two is 0.33, and that the customer is in demographic group three is 0.06.

The probability that the customer makes a purchase given that they are in demographic group one is 0.36, given that they are in demographic group two is 0.1, and given that they are in demographic group three is 0.28.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0078066577)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.17, that the customer is in demographic group two is 0.78, and that the customer is in demographic group three is 0.05.

The probability that the customer makes a purchase given that they are in demographic group one is 0.03, given that they are in demographic group two is 0.31, and given that they are in demographic group three is 0.34.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0005613862)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.47, that the customer is in demographic group two is 0.29, and that the customer is in demographic group three is 0.24.

The probability that the customer makes a purchase given that they are in demographic group one is 0.06, given that they are in demographic group two is 0.47, and given that they are in demographic group three is 0.5.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0281924173)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.48, that the customer is in demographic group two is 0.1, and that the customer is in demographic group three is 0.42.

The probability that the customer makes a purchase given that they are in demographic group one is 0.04, given that they are in demographic group two is 0.42, and given that they are in demographic group three is 0.24.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0686936044)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.16, that this product is manufactured at the second factory is 0.38, and that this product is manufactured at the third factory is 0.46.

The probability that this product is defective given that it is manufactured at the first factory is 0.3, given that it is manufactured at the second factory is 0.08, and given that it is manufactured at the third factory is 0.15.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0907234856)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.11, that this product is manufactured at the second factory is 0.76, and that this product is manufactured at the third factory is 0.13.

The probability that this product is defective given that it is manufactured at the first factory is 0.4, given that it is manufactured at the second factory is 0.1, and given that it is manufactured at the third factory is 0.11.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0031893021)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.39, that the customer is in demographic group two is 0.02, and that the customer is in demographic group three is 0.59.

The probability that the customer makes a purchase given that they are in demographic group one is 0.36, given that they are in demographic group two is 0.33, and given that they are in demographic group three is 0.47.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0506345661)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.23, that the customer is in demographic group two is 0.43, and that the customer is in demographic group three is 0.34.

The probability that the customer makes a purchase given that they are in demographic group one is 0.07, given that they are in demographic group two is 0.19, and given that they are in demographic group three is 0.12.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0675911674)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.45, that a line of code was written by Barb is 0.5, and that a line of code was written by Celeste is 0.05.

The probability that a line of code has an error given that it was written by Alice is 0.38, given that it was written by Barb is 0.4, and given that it was written by Celeste is 0.03.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0676748067)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.28, that this product is manufactured at the second factory is 0.56, and that this product is manufactured at the third factory is 0.16.

The probability that this product is defective given that it is manufactured at the first factory is 0.39, given that it is manufactured at the second factory is 0.07, and given that it is manufactured at the third factory is 0.49.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0551451957)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.1, that the customer is in demographic group two is 0.75, and that the customer is in demographic group three is 0.15.

The probability that the customer makes a purchase given that they are in demographic group one is 0.19, given that they are in demographic group two is 0.29, and given that they are in demographic group three is 0.23.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0648288341)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.1, that this product is manufactured at the second factory is 0.09, and that this product is manufactured at the third factory is 0.81.

The probability that this product is defective given that it is manufactured at the first factory is 0.17, given that it is manufactured at the second factory is 0.34, and given that it is manufactured at the third factory is 0.28.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0407509055)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.13, that a line of code was written by Barb is 0.69, and that a line of code was written by Celeste is 0.18.

The probability that a line of code has an error given that it was written by Alice is 0.23, given that it was written by Barb is 0.39, and given that it was written by Celeste is 0.43.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0916888651)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.12, that the customer is in demographic group two is 0.32, and that the customer is in demographic group three is 0.56.

The probability that the customer makes a purchase given that they are in demographic group one is 0.16, given that they are in demographic group two is 0.33, and given that they are in demographic group three is 0.26.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0805171559)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.39, that this product is manufactured at the second factory is 0.08, and that this product is manufactured at the third factory is 0.53.

The probability that this product is defective given that it is manufactured at the first factory is 0.47, given that it is manufactured at the second factory is 0.02, and given that it is manufactured at the third factory is 0.02.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0115948564)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.12, that the customer is in demographic group two is 0.61, and that the customer is in demographic group three is 0.27.

The probability that the customer makes a purchase given that they are in demographic group one is 0.05, given that they are in demographic group two is 0.3, and given that they are in demographic group three is 0.41.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0921784945)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.74, that a line of code was written by Barb is 0.16, and that a line of code was written by Celeste is 0.1.

The probability that a line of code has an error given that it was written by Alice is 0.29, given that it was written by Barb is 0.39, and given that it was written by Celeste is 0.25.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0510746049)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.29, that this product is manufactured at the second factory is 0.26, and that this product is manufactured at the third factory is 0.45.

The probability that this product is defective given that it is manufactured at the first factory is 0.41, given that it is manufactured at the second factory is 0.45, and given that it is manufactured at the third factory is 0.47.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0157320951)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.7, that the customer is in demographic group two is 0.07, and that the customer is in demographic group three is 0.23.

The probability that the customer makes a purchase given that they are in demographic group one is 0.22, given that they are in demographic group two is 0.35, and given that they are in demographic group three is 0.46.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0109864455)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.1, that the customer is in demographic group two is 0.69, and that the customer is in demographic group three is 0.21.

The probability that the customer makes a purchase given that they are in demographic group one is 0.41, given that they are in demographic group two is 0.24, and given that they are in demographic group three is 0.3.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0577508944)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.09, that the customer is in demographic group two is 0.77, and that the customer is in demographic group three is 0.14.

The probability that the customer makes a purchase given that they are in demographic group one is 0.23, given that they are in demographic group two is 0.08, and given that they are in demographic group three is 0.05.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0829889272)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.07, that a line of code was written by Barb is 0.4, and that a line of code was written by Celeste is 0.53.

The probability that a line of code has an error given that it was written by Alice is 0.3, given that it was written by Barb is 0.23, and given that it was written by Celeste is 0.2.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0252461843)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.41, that the customer is in demographic group two is 0.15, and that the customer is in demographic group three is 0.44.

The probability that the customer makes a purchase given that they are in demographic group one is 0.01, given that they are in demographic group two is 0.27, and given that they are in demographic group three is 0.25.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0764666887)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.76, that a line of code was written by Barb is 0.05, and that a line of code was written by Celeste is 0.19.

The probability that a line of code has an error given that it was written by Alice is 0.16, given that it was written by Barb is 0.27, and given that it was written by Celeste is 0.06.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0780348937)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.15, that this product is manufactured at the second factory is 0.45, and that this product is manufactured at the third factory is 0.4.

The probability that this product is defective given that it is manufactured at the first factory is 0.28, given that it is manufactured at the second factory is 0.09, and given that it is manufactured at the third factory is 0.39.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0334180628)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.53, that a line of code was written by Barb is 0.36, and that a line of code was written by Celeste is 0.11.

The probability that a line of code has an error given that it was written by Alice is 0.12, given that it was written by Barb is 0.36, and given that it was written by Celeste is 0.28.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0442948315)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.09, that this product is manufactured at the second factory is 0.39, and that this product is manufactured at the third factory is 0.52.

The probability that this product is defective given that it is manufactured at the first factory is 0.13, given that it is manufactured at the second factory is 0.49, and given that it is manufactured at the third factory is 0.33.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0241317644)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.48, that this product is manufactured at the second factory is 0.1, and that this product is manufactured at the third factory is 0.42.

The probability that this product is defective given that it is manufactured at the first factory is 0.37, given that it is manufactured at the second factory is 0.03, and given that it is manufactured at the third factory is 0.44.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0991445220)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.25, that the customer is in demographic group two is 0.68, and that the customer is in demographic group three is 0.07.

The probability that the customer makes a purchase given that they are in demographic group one is 0.19, given that they are in demographic group two is 0.46, and given that they are in demographic group three is 0.36.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0515055915)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.17, that this product is manufactured at the second factory is 0.08, and that this product is manufactured at the third factory is 0.75.

The probability that this product is defective given that it is manufactured at the first factory is 0.44, given that it is manufactured at the second factory is 0.47, and given that it is manufactured at the third factory is 0.35.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0019973340)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.74, that this product is manufactured at the second factory is 0.1, and that this product is manufactured at the third factory is 0.16.

The probability that this product is defective given that it is manufactured at the first factory is 0.14, given that it is manufactured at the second factory is 0.11, and given that it is manufactured at the third factory is 0.37.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0318433126)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.07, that the customer is in demographic group two is 0.18, and that the customer is in demographic group three is 0.75.

The probability that the customer makes a purchase given that they are in demographic group one is 0.03, given that they are in demographic group two is 0.14, and given that they are in demographic group three is 0.45.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0161983249)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.17, that the customer is in demographic group two is 0.56, and that the customer is in demographic group three is 0.27.

The probability that the customer makes a purchase given that they are in demographic group one is 0.32, given that they are in demographic group two is 0.12, and given that they are in demographic group three is 0.18.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0074158306)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.25, that the customer is in demographic group two is 0.47, and that the customer is in demographic group three is 0.28.

The probability that the customer makes a purchase given that they are in demographic group one is 0.5, given that they are in demographic group two is 0.12, and given that they are in demographic group three is 0.41.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0675870064)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.07, that this product is manufactured at the second factory is 0.2, and that this product is manufactured at the third factory is 0.73.

The probability that this product is defective given that it is manufactured at the first factory is 0.17, given that it is manufactured at the second factory is 0.31, and given that it is manufactured at the third factory is 0.39.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0235779184)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.42, that this product is manufactured at the second factory is 0.17, and that this product is manufactured at the third factory is 0.41.

The probability that this product is defective given that it is manufactured at the first factory is 0.24, given that it is manufactured at the second factory is 0.03, and given that it is manufactured at the third factory is 0.07.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0024229736)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.06, that this product is manufactured at the second factory is 0.82, and that this product is manufactured at the third factory is 0.12.

The probability that this product is defective given that it is manufactured at the first factory is 0.33, given that it is manufactured at the second factory is 0.27, and given that it is manufactured at the third factory is 0.24.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0044772212)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.22, that a line of code was written by Barb is 0.72, and that a line of code was written by Celeste is 0.06.

The probability that a line of code has an error given that it was written by Alice is 0.07, given that it was written by Barb is 0.07, and given that it was written by Celeste is 0.34.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0471670275)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.85, that a line of code was written by Barb is 0.05, and that a line of code was written by Celeste is 0.1.

The probability that a line of code has an error given that it was written by Alice is 0.04, given that it was written by Barb is 0.39, and given that it was written by Celeste is 0.43.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0640607004)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.57, that the customer is in demographic group two is 0.07, and that the customer is in demographic group three is 0.36.

The probability that the customer makes a purchase given that they are in demographic group one is 0.37, given that they are in demographic group two is 0.47, and given that they are in demographic group three is 0.36.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0208508328)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.24, that the customer is in demographic group two is 0.6, and that the customer is in demographic group three is 0.16.

The probability that the customer makes a purchase given that they are in demographic group one is 0.03, given that they are in demographic group two is 0.05, and given that they are in demographic group three is 0.25.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0521596254)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.24, that this product is manufactured at the second factory is 0.2, and that this product is manufactured at the third factory is 0.56.

The probability that this product is defective given that it is manufactured at the first factory is 0.11, given that it is manufactured at the second factory is 0.07, and given that it is manufactured at the third factory is 0.04.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0009321237)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.24, that a line of code was written by Barb is 0.67, and that a line of code was written by Celeste is 0.09.

The probability that a line of code has an error given that it was written by Alice is 0.25, given that it was written by Barb is 0.1, and given that it was written by Celeste is 0.42.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0767525676)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.85, that the customer is in demographic group two is 0.01, and that the customer is in demographic group three is 0.14.

The probability that the customer makes a purchase given that they are in demographic group one is 0.42, given that they are in demographic group two is 0.17, and given that they are in demographic group three is 0.15.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0775164160)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.22, that this product is manufactured at the second factory is 0.41, and that this product is manufactured at the third factory is 0.37.

The probability that this product is defective given that it is manufactured at the first factory is 0.2, given that it is manufactured at the second factory is 0.28, and given that it is manufactured at the third factory is 0.44.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0627420985)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.56, that this product is manufactured at the second factory is 0.35, and that this product is manufactured at the third factory is 0.09.

The probability that this product is defective given that it is manufactured at the first factory is 0.07, given that it is manufactured at the second factory is 0.1, and given that it is manufactured at the third factory is 0.5.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0554537473)

A particular piece of software is being programmed by three different programmers.

The probability that a line of code was written by Alice is 0.32, that a line of code was written by Barb is 0.27, and that a line of code was written by Celeste is 0.41.

The probability that a line of code has an error given that it was written by Alice is 0.36, given that it was written by Barb is 0.15, and given that it was written by Celeste is 0.5.

  1. What is the probability that a line of code was written by Alice and that a line of code has an error?
  2. What is the probability that was not written by Alice and that a line of code has an error?
  3. What is the probability that a line of code has an error?

(Question ID: 0164109778)

A marketing team segments their customers into three possible demographic categories.

The probability that the customer is in demographic group one is 0.36, that the customer is in demographic group two is 0.58, and that the customer is in demographic group three is 0.06.

The probability that the customer makes a purchase given that they are in demographic group one is 0.19, given that they are in demographic group two is 0.15, and given that they are in demographic group three is 0.1.

  1. What is the probability that the customer is in demographic group one and that the customer makes a purchase?
  2. What is the probability that the customer is not in demographic group one and that the customer makes a purchase?
  3. What is the probability that the customer makes a purchase?

(Question ID: 0365099503)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.23, that this product is manufactured at the second factory is 0.44, and that this product is manufactured at the third factory is 0.33.

The probability that this product is defective given that it is manufactured at the first factory is 0.08, given that it is manufactured at the second factory is 0.38, and given that it is manufactured at the third factory is 0.5.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0345246688)

A company produces a certain product at three different factories.

The probability that this product is manufactured at the first factory is 0.27, that this product is manufactured at the second factory is 0.03, and that this product is manufactured at the third factory is 0.7.

The probability that this product is defective given that it is manufactured at the first factory is 0.38, given that it is manufactured at the second factory is 0.15, and given that it is manufactured at the third factory is 0.04.

  1. What is the probability that this product is manufactured at the first factory and that this product is defective?
  2. What is the probability that this product is not manufactured at the first factory and that this product is defective?
  3. What is the probability that this product is defective?

(Question ID: 0721269274)

Self Assessment 4.7

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.009\). If an item is unsafe, the probability is \(0.981\) that it will fail the inspection. If an item is safe, the probability is \(0.9835\) that it will pass the inspection.

If an item is passes inspection, what is the probability it is safe?

(Question ID: 0713903283)

A patient goes to see a doctor. The doctor performs a test with \(98\%\) reliability, that is, \(98\%\) of people who are sick test positive and \(98\%\) of the healthy people test negative. The doctor knows that only \(6\) in \(1000\) people in the country are sick.

If the patient tests negative, what is the probability that they are not sick?

(Question ID: 0802273891)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(69\%\) are sent via the standard service. A package sent via the standard service has a 0.1 probability of arriving late, and a package sent via the express service has a 0.05 probability of arriving late.

If an package arrives late, what is the probability it was sent via the standard service?

(Question ID: 0979271237)

A patient goes to see a doctor. The doctor performs a test with \(94\%\) reliability, that is, \(94\%\) of people who are sick test positive and \(94\%\) of the healthy people test negative. The doctor knows that only \(4\) in \(10000\) people in the country are sick.

If the patient tests positive, what is the probability that they are sick?

(Question ID: 0133730716)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.07\). If an item is unsafe, the probability is \(0.996\) that it will fail the inspection. If an item is safe, the probability is \(0.9922\) that it will pass the inspection.

If an item is fails inspection, what is the probability it is unsafe?

(Question ID: 0314924509)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(69\%\) are sent via the standard service. A package sent via the standard service has a 0.15 probability of arriving late, and a package sent via the express service has a 0.03 probability of arriving late.

If an package arrives late, what is the probability it was sent via the express service?

(Question ID: 0235401821)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(71\%\) are sent via the standard service. A package sent via the standard service has a 0.1 probability of arriving late, and a package sent via the express service has a 0.02 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the standard service?

(Question ID: 0068395410)

A patient goes to see a doctor. The doctor performs a test with \(95\%\) reliability, that is, \(95\%\) of people who are sick test positive and \(95\%\) of the healthy people test negative. The doctor knows that only \(2\) in \(100\) people in the country are sick.

If the patient tests negative, what is the probability that they are sick?

(Question ID: 0890281529)

A patient goes to see a doctor. The doctor performs a test with \(97\%\) reliability, that is, \(97\%\) of people who are sick test positive and \(97\%\) of the healthy people test negative. The doctor knows that only \(6\) in \(50\) people in the country are sick.

If the patient tests negative, what is the probability that they are sick?

(Question ID: 0484164539)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.003\). If an item is unsafe, the probability is \(0.983\) that it will fail the inspection. If an item is safe, the probability is \(0.9848\) that it will pass the inspection.

If an item is passes inspection, what is the probability it is unsafe?

(Question ID: 0661949803)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.2\). If an item is unsafe, the probability is \(0.992\) that it will fail the inspection. If an item is safe, the probability is \(0.9807\) that it will pass the inspection.

If an item is fails inspection, what is the probability it is safe?

(Question ID: 0538634834)

A patient goes to see a doctor. The doctor performs a test with \(94\%\) reliability, that is, \(94\%\) of people who are sick test positive and \(94\%\) of the healthy people test negative. The doctor knows that only \(1\) in \(1000\) people in the country are sick.

If the patient tests positive, what is the probability that they are not sick?

(Question ID: 0044954556)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(67\%\) are sent via the standard service. A package sent via the standard service has a 0.18 probability of arriving late, and a package sent via the express service has a 0.17 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the standard service?

(Question ID: 0968165588)

A patient goes to see a doctor. The doctor performs a test with \(93\%\) reliability, that is, \(93\%\) of people who are sick test positive and \(93\%\) of the healthy people test negative. The doctor knows that only \(7\) in \(100\) people in the country are sick.

If the patient tests negative, what is the probability that they are not sick?

(Question ID: 0863123095)

A patient goes to see a doctor. The doctor performs a test with \(96\%\) reliability, that is, \(96\%\) of people who are sick test positive and \(96\%\) of the healthy people test negative. The doctor knows that only \(2\) in \(100\) people in the country are sick.

If the patient tests negative, what is the probability that they are not sick?

(Question ID: 0408424575)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.002\). If an item is unsafe, the probability is \(0.992\) that it will fail the inspection. If an item is safe, the probability is \(0.9844\) that it will pass the inspection.

If an item is passes inspection, what is the probability it is unsafe?

(Question ID: 0847337400)

A patient goes to see a doctor. The doctor performs a test with \(94\%\) reliability, that is, \(94\%\) of people who are sick test positive and \(94\%\) of the healthy people test negative. The doctor knows that only \(1\) in \(50\) people in the country are sick.

If the patient tests positive, what is the probability that they are not sick?

(Question ID: 0984385028)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(68\%\) are sent via the standard service. A package sent via the standard service has a 0.07 probability of arriving late, and a package sent via the express service has a 0.02 probability of arriving late.

If an package arrives late, what is the probability it was sent via the express service?

(Question ID: 0296106519)

A patient goes to see a doctor. The doctor performs a test with \(90\%\) reliability, that is, \(90\%\) of people who are sick test positive and \(90\%\) of the healthy people test negative. The doctor knows that only \(5\) in \(50\) people in the country are sick.

If the patient tests negative, what is the probability that they are not sick?

(Question ID: 0952012865)

A patient goes to see a doctor. The doctor performs a test with \(94\%\) reliability, that is, \(94\%\) of people who are sick test positive and \(94\%\) of the healthy people test negative. The doctor knows that only \(1\) in \(1000\) people in the country are sick.

If the patient tests positive, what is the probability that they are sick?

(Question ID: 0943757515)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(82\%\) are sent via the standard service. A package sent via the standard service has a 0.06 probability of arriving late, and a package sent via the express service has a 0.05 probability of arriving late.

If an package arrives late, what is the probability it was sent via the express service?

(Question ID: 0325160548)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.0002\). If an item is unsafe, the probability is \(0.997\) that it will fail the inspection. If an item is safe, the probability is \(0.9842\) that it will pass the inspection.

If an item is fails inspection, what is the probability it is safe?

(Question ID: 0061116465)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(50\%\) are sent via the standard service. A package sent via the standard service has a 0.2 probability of arriving late, and a package sent via the express service has a 0.18 probability of arriving late.

If an package arrives late, what is the probability it was sent via the express service?

(Question ID: 0202062303)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(55\%\) are sent via the standard service. A package sent via the standard service has a 0.25 probability of arriving late, and a package sent via the express service has a 0.24 probability of arriving late.

If an package arrives late, what is the probability it was sent via the express service?

(Question ID: 0568953684)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.04\). If an item is unsafe, the probability is \(0.978\) that it will fail the inspection. If an item is safe, the probability is \(0.9777\) that it will pass the inspection.

If an item is passes inspection, what is the probability it is safe?

(Question ID: 0108333402)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(85\%\) are sent via the standard service. A package sent via the standard service has a 0.07 probability of arriving late, and a package sent via the express service has a 0.06 probability of arriving late.

If an package arrives late, what is the probability it was sent via the standard service?

(Question ID: 0290601258)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(71\%\) are sent via the standard service. A package sent via the standard service has a 0.08 probability of arriving late, and a package sent via the express service has a 0.05 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the standard service?

(Question ID: 0729633493)

A patient goes to see a doctor. The doctor performs a test with \(97\%\) reliability, that is, \(97\%\) of people who are sick test positive and \(97\%\) of the healthy people test negative. The doctor knows that only \(8\) in \(50\) people in the country are sick.

If the patient tests positive, what is the probability that they are not sick?

(Question ID: 0256889506)

A patient goes to see a doctor. The doctor performs a test with \(92\%\) reliability, that is, \(92\%\) of people who are sick test positive and \(92\%\) of the healthy people test negative. The doctor knows that only \(1\) in \(50\) people in the country are sick.

If the patient tests positive, what is the probability that they are not sick?

(Question ID: 0203482805)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.002\). If an item is unsafe, the probability is \(0.992\) that it will fail the inspection. If an item is safe, the probability is \(0.9989\) that it will pass the inspection.

If an item is fails inspection, what is the probability it is unsafe?

(Question ID: 0332929845)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(83\%\) are sent via the standard service. A package sent via the standard service has a 0.1 probability of arriving late, and a package sent via the express service has a 0.04 probability of arriving late.

If an package arrives late, what is the probability it was sent via the standard service?

(Question ID: 0698307113)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(62\%\) are sent via the standard service. A package sent via the standard service has a 0.24 probability of arriving late, and a package sent via the express service has a 0.06 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the express service?

(Question ID: 0645754304)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(53\%\) are sent via the standard service. A package sent via the standard service has a 0.13 probability of arriving late, and a package sent via the express service has a 0.01 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the standard service?

(Question ID: 0288307729)

A patient goes to see a doctor. The doctor performs a test with \(98\%\) reliability, that is, \(98\%\) of people who are sick test positive and \(98\%\) of the healthy people test negative. The doctor knows that only \(1\) in \(10000\) people in the country are sick.

If the patient tests positive, what is the probability that they are sick?

(Question ID: 0361256251)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(51\%\) are sent via the standard service. A package sent via the standard service has a 0.14 probability of arriving late, and a package sent via the express service has a 0.03 probability of arriving late.

If an package arrives late, what is the probability it was sent via the standard service?

(Question ID: 0420442811)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(74\%\) are sent via the standard service. A package sent via the standard service has a 0.21 probability of arriving late, and a package sent via the express service has a 0.03 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the express service?

(Question ID: 0013796552)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(52\%\) are sent via the standard service. A package sent via the standard service has a 0.11 probability of arriving late, and a package sent via the express service has a 0.1 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the express service?

(Question ID: 0249895356)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.01\). If an item is unsafe, the probability is \(0.978\) that it will fail the inspection. If an item is safe, the probability is \(0.9821\) that it will pass the inspection.

If an item is fails inspection, what is the probability it is safe?

(Question ID: 0825086368)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(72\%\) are sent via the standard service. A package sent via the standard service has a 0.21 probability of arriving late, and a package sent via the express service has a 0.17 probability of arriving late.

If an package arrives on time, what is the probability it was sent via the express service?

(Question ID: 0384408562)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.05\). If an item is unsafe, the probability is \(0.992\) that it will fail the inspection. If an item is safe, the probability is \(0.9923\) that it will pass the inspection.

If an item is passes inspection, what is the probability it is safe?

(Question ID: 0927399732)

A patient goes to see a doctor. The doctor performs a test with \(97\%\) reliability, that is, \(97\%\) of people who are sick test positive and \(97\%\) of the healthy people test negative. The doctor knows that only \(1\) in \(50\) people in the country are sick.

If the patient tests positive, what is the probability that they are sick?

(Question ID: 0349394837)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(53\%\) are sent via the standard service. A package sent via the standard service has a 0.05 probability of arriving late, and a package sent via the express service has a 0.03 probability of arriving late.

If an package arrives late, what is the probability it was sent via the express service?

(Question ID: 0693164684)

A delivery company offers customers the ability to send their packages either using a standard service or an express service. Of all package sent, \(67\%\) are sent via the standard service. A package sent via the standard service has a 0.21 probability of arriving late, and a package sent via the express service has a 0.05 probability of arriving late.

If an package arrives late, what is the probability it was sent via the standard service?

(Question ID: 0343828447)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.08\). If an item is unsafe, the probability is \(0.981\) that it will fail the inspection. If an item is safe, the probability is \(0.9914\) that it will pass the inspection.

If an item is passes inspection, what is the probability it is unsafe?

(Question ID: 0215750120)

A patient goes to see a doctor. The doctor performs a test with \(94\%\) reliability, that is, \(94\%\) of people who are sick test positive and \(94\%\) of the healthy people test negative. The doctor knows that only \(6\) in \(1000\) people in the country are sick.

If the patient tests positive, what is the probability that they are not sick?

(Question ID: 0895372549)

A patient goes to see a doctor. The doctor performs a test with \(98\%\) reliability, that is, \(98\%\) of people who are sick test positive and \(98\%\) of the healthy people test negative. The doctor knows that only \(5\) in \(100\) people in the country are sick.

If the patient tests negative, what is the probability that they are not sick?

(Question ID: 0287462430)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.002\). If an item is unsafe, the probability is \(0.989\) that it will fail the inspection. If an item is safe, the probability is \(0.9872\) that it will pass the inspection.

If an item is passes inspection, what is the probability it is safe?

(Question ID: 0822907323)

A patient goes to see a doctor. The doctor performs a test with \(98\%\) reliability, that is, \(98\%\) of people who are sick test positive and \(98\%\) of the healthy people test negative. The doctor knows that only \(6\) in \(1000\) people in the country are sick.

If the patient tests negative, what is the probability that they are not sick?

(Question ID: 0527678432)

A patient goes to see a doctor. The doctor performs a test with \(92\%\) reliability, that is, \(92\%\) of people who are sick test positive and \(92\%\) of the healthy people test negative. The doctor knows that only \(2\) in \(1000\) people in the country are sick.

If the patient tests negative, what is the probability that they are not sick?

(Question ID: 0134753480)

A quality control program at a food production facility involves inspecting the finished product for safety. The proportion of items that are actually unsafe is \(0.0006\). If an item is unsafe, the probability is \(0.984\) that it will fail the inspection. If an item is safe, the probability is \(0.9821\) that it will pass the inspection.

If an item is fails inspection, what is the probability it is safe?

(Question ID: 0735960393)

Self Assessment 4.8

Let \(A\) and \(B\) be events with \(P(A) = 0.422\) and \(P(A \cap B) = 0.151\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0557618415)

Let \(A\) and \(B\) be events with \(P(A) = 0.786\) and \(P(A \cap B) = 0.075\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0159545494)

Let \(A\) and \(B\) be events with \(P(A) = 0.608\) and \(P(A \cap B) = 0.407\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0735421686)

Let \(A\) and \(B\) be events with \(P(A) = 0.699\) and \(P(A \cap B^C) = 0.531\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0327402895)

Let \(A\) and \(B\) be events with \(P(A) = 0.277\) and \(P(A \cap B^C) = 0.125\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0023844024)

Let \(A\) and \(B\) be events with \(P(A) = 0.758\) and \(P(A \cap B^C) = 0.187\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0427847288)

Let \(A\) and \(B\) be events with \(P(A) = 0.586\) and \(P(A \cap B) = 0.044\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0352582019)

Let \(A\) and \(B\) be events with \(P(A) = 0.811\) and \(P(A \cap B) = 0.486\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0554013027)

Let \(A\) and \(B\) be events with \(P(A) = 0.733\) and \(P(A \cap B) = 0.629\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0336110463)

Let \(A\) and \(B\) be events with \(P(A) = 0.873\) and \(P(A \cap B) = 0.243\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0526366277)

Let \(A\) and \(B\) be events with \(P(A) = 0.97\) and \(P(A \cap B^C) = 0.594\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0748555644)

Let \(A\) and \(B\) be events with \(P(A) = 0.37\) and \(P(A \cap B) = 0.132\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0795338181)

Let \(A\) and \(B\) be events with \(P(A) = 0.257\) and \(P(A \cap B^C) = 0.2\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0517999320)

Let \(A\) and \(B\) be events with \(P(A) = 0.691\) and \(P(A \cap B^C) = 0.627\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0994570140)

Let \(A\) and \(B\) be events with \(P(A) = 0.473\) and \(P(A \cap B) = 0.046\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0297036505)

Let \(A\) and \(B\) be events with \(P(A) = 0.441\) and \(P(A \cap B) = 0.224\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0654881379)

Let \(A\) and \(B\) be events with \(P(A) = 0.5\) and \(P(A \cap B) = 0.223\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0376520463)

Let \(A\) and \(B\) be events with \(P(A) = 0.654\) and \(P(A \cap B) = 0.114\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0709375884)

Let \(A\) and \(B\) be events with \(P(A) = 0.662\) and \(P(A \cap B) = 0.279\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0292426419)

Let \(A\) and \(B\) be events with \(P(A) = 0.97\) and \(P(A \cap B^C) = 0.381\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0892116881)

Let \(A\) and \(B\) be events with \(P(A) = 0.782\) and \(P(A \cap B^C) = 0.337\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0874103616)

Let \(A\) and \(B\) be events with \(P(A) = 0.744\) and \(P(A \cap B) = 0.191\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0851466330)

Let \(A\) and \(B\) be events with \(P(A) = 0.547\) and \(P(A \cap B) = 0.452\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0587288467)

Let \(A\) and \(B\) be events with \(P(A) = 0.995\) and \(P(A \cap B^C) = 0.386\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0052111030)

Let \(A\) and \(B\) be events with \(P(A) = 0.442\) and \(P(A \cap B) = 0.371\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0828368775)

Let \(A\) and \(B\) be events with \(P(A) = 0.282\) and \(P(A \cap B) = 0.175\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0799830713)

Let \(A\) and \(B\) be events with \(P(A) = 0.324\) and \(P(A \cap B) = 0.053\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0654847334)

Let \(A\) and \(B\) be events with \(P(A) = 0.433\) and \(P(A \cap B^C) = 0.058\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0742221575)

Let \(A\) and \(B\) be events with \(P(A) = 0.763\) and \(P(A \cap B) = 0.29\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0868176211)

Let \(A\) and \(B\) be events with \(P(A) = 0.555\) and \(P(A \cap B^C) = 0.268\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0876295575)

Let \(A\) and \(B\) be events with \(P(A) = 0.868\) and \(P(A \cap B^C) = 0.8\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0680868458)

Let \(A\) and \(B\) be events with \(P(A) = 0.929\) and \(P(A \cap B) = 0.505\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0881162347)

Let \(A\) and \(B\) be events with \(P(A) = 0.742\) and \(P(A \cap B^C) = 0.069\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0886135032)

Let \(A\) and \(B\) be events with \(P(A) = 0.609\) and \(P(A \cap B^C) = 0.532\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0846968560)

Let \(A\) and \(B\) be events with \(P(A) = 0.388\) and \(P(A \cap B) = 0.268\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0648070514)

Let \(A\) and \(B\) be events with \(P(A) = 0.394\) and \(P(A \cap B) = 0.33\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0671610345)

Let \(A\) and \(B\) be events with \(P(A) = 0.758\) and \(P(A \cap B^C) = 0.546\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0285842660)

Let \(A\) and \(B\) be events with \(P(A) = 0.456\) and \(P(A \cap B^C) = 0.282\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0455414534)

Let \(A\) and \(B\) be events with \(P(A) = 0.125\) and \(P(A \cap B^C) = 0.014\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0479441564)

Let \(A\) and \(B\) be events with \(P(A) = 0.991\) and \(P(A \cap B) = 0.116\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0445484998)

Let \(A\) and \(B\) be events with \(P(A) = 0.87\) and \(P(A \cap B) = 0.06\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0170025117)

Let \(A\) and \(B\) be events with \(P(A) = 0.897\) and \(P(A \cap B) = 0.094\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0325052601)

Let \(A\) and \(B\) be events with \(P(A) = 0.985\) and \(P(A \cap B^C) = 0.875\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0984839418)

Let \(A\) and \(B\) be events with \(P(A) = 0.179\) and \(P(A \cap B) = 0.076\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0724051314)

Let \(A\) and \(B\) be events with \(P(A) = 0.27\) and \(P(A \cap B^C) = 0.109\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0730328935)

Let \(A\) and \(B\) be events with \(P(A) = 0.653\) and \(P(A \cap B^C) = 0.146\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0448748328)

Let \(A\) and \(B\) be events with \(P(A) = 0.994\) and \(P(A \cap B) = 0.582\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0345216382)

Let \(A\) and \(B\) be events with \(P(A) = 0.675\) and \(P(A \cap B^C) = 0.418\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0365130704)

Let \(A\) and \(B\) be events with \(P(A) = 0.34\) and \(P(A \cap B) = 0.205\).

For what value of \(P(B)\) will \(A\) and \(B\) be independent?

(Question ID: 0517498869)

Let \(A\) and \(B\) be events with \(P(A) = 0.527\) and \(P(A \cap B^C) = 0.238\).

For what value of \(P(B^C)\) will \(A\) and \(B\) be independent?

(Question ID: 0209572362)


  1. Note, this is another result which seems to (on the surface) defy expectations. We will see this again later on in this chapter in a slightly different context. It seems strange that company \(A\) is more likely to be late, and yet, we are more likely to see a late package that is sent by company \(B\) than a late package that is sent by company \(A\). The reason for this is that company \(B\) is used much more frequently than company \(A\), which overcomes the added likelihood of company \(A\) being late.↩︎

  2. For instance, \(5\) is neither a multiple of \(2\) nor of \(3\).↩︎

  3. For instance, \(6\) is a multiple of both \(2\) and \(3\).↩︎

  4. Bayes’ Theorem is named in the same way that the Bayesian interpretation of probability is, and that Bayesian statistics is more broadly. The connections are more than merely surface: Bayes’ Theorem can be viewed as the primary technique with which we can update subjective beliefs about the world. Importantly, however, even those who use a Frequentist view of statistics accept the math of Bayes’ Theorem, and use it frequently.↩︎

  5. In fact, I would go as far as to suggest that learning Bayes’ Theorem by itself is less important than fully grasping the definition of conditional probability alongside the multiplication rule and the law of total probability. I myself do not remember Bayes’ Theorem directly, but can write it down directly from these definitions without any thought.↩︎

  6. It is worth drawing attention to the language that we have started to use at this point in the notes regarding “events”. In this case our sample space would actually be formed using pairs of information. In particular, we might have \[\mathcal{S} = \{(\text{Pos. Test}, \text{Illness}), (\text{Neg. Test}, \text{Illness}),(\text{Pos. Test}, \text{No Illness}),(\text{Neg. Test}, \text{No Illness})\}.\] Then the event \(A\) here is actually \(A = \{(\text{Pos. Test}, \text{Illness}),(\text{Pos. Test}, \text{No Illness})\}\) and \(B\) is \(\{(\text{Pos. Test}, \text{Illness}),(\text{Neg. Test}, \text{Illness})\}\). It is far more clunky to make explicit these events, and so we move towards using more natural language. Until you feel confident that you can identify the specific outcomes associated with the events of interest, it is worth writing these out in full.↩︎

  7. Since \(P(A|B^C) = 1 - P(A^C|B^C) = 1 - 0.95 = 0.05\).↩︎

  8. This counterintuitive fact was an intensely frustrating reality for statisticians everywhere during the height of the COVID-19 pandemic, when politicians and the population at large turned away from testing owing to its perceived “ineffectiveness”. The quantity of interest for knowing how good a test is, is \(P(A|B)\) and \(P(A^C|B^C)\). However, if a disease is sufficiently rare, with \(P(B)\) sufficiently small, then no matter how effective the tests are you will likely have \(P(B|A)\) to be low. Note that \(P(B|A) >> P(B)\) in the example, and this will also be true in general. A single test cannot say with certainty, however, they are an incredibly effective tool at reducing our uncertainty.↩︎

  9. Note that this is another example of a counterintuitive result. Here, \(A\) is far more likely to be late, but it is more likely that \(B\) delivered a late package to us than \(A\) because of the base rates. That is, \(P(B)\) is much higher to begin than \(P(A)\), and so that is hard to overcome. It is worth noting, however, that originally \(P(B)\) is \(9\) times more likely than \(P(A)\), and after knowing that package is late it is \(\dfrac{6}{5} = 1.2\) times more likely. This major reduction in the relative likelihood is owed to how much more likely \(A\) is to deliver late packages than \(B\).↩︎

  10. Mixing up \(P(B|A)\) and \(P(A|B)\) is often called confusion of the inverse, and it can lead to very faulty conclusions when ignored. In the medical testing example above, it is important to not confuse “the probability that the test returns positive, assuming you have the illness” with “the probability that you have the illness, given that the test returns positive.” This type of faulty logic has long been used to justify discriminatory behaviour in medicine, the law, and so on. A thorough understanding of statistics and probability helps to ensure that these types of errors are not made, and gives you the tools to push-back on the spots where people are making these arguments incorrectly, particularly when the stakes are high or harm is being done.↩︎

  11. When discussing Bayes’ Theorem we introduced two frequently occurring sources of error when reasoning about probabilities, and showed how to remedy them. Confusion of the inverse occurs when you mix up \(P(A|B)\) with \(P(B|A)\), and can lead to disastrous consequences. The base rate fallacy occurs when you fail to take into account the rareness of the marginal events and instead only consider the conditionals (such as seeing that delivery company \(A\) was more likely to be late than \(B\), without considering that \(B\) was more likely to be used than \(A\)). These are both examples of the challenges at the heart of probability and statistics: namely, the subjects are fairly unintuitive once moving beyond the basics. As a result, we need to rely on building up our intuitions over time by making use of the formal rules that we are able to derive.↩︎

  12. Though, perhaps the world will freeze over if (insert-your-least-favourite-team-here) wins the (insert-the-name-of-the-championship-for-the-league-you-care-about-here)?↩︎

  13. Namely, we know that all of the others are then impossible.↩︎

  14. Note, the values in this table are roughly inspired by a subset of the Fall 2022 University of New Brunswick Enrolment Numbers.↩︎

  15. This provides further emphasis for the utility of urn models. If you imagine an urn that has balls with two or more traits (say like those in Example 4.3), then randomly selecting a single ball from this population has an analogous probability distribution.↩︎

  16. That is, the number of students who are enrolled in that faculty, in that year of study.↩︎

  17. Faculty.↩︎

  18. Year of study.↩︎

  19. It is worth reemphasizing what this probability actually means. If we were to randomly sample, with equal probability, individuals from this population then the probability that an individual selected has both the faculty and year specified is given by the joint probability. That is to say, if we did this over and over and over again (with replacement) in the long-term, these probabilities represent proportion of time those combinations would be observed.↩︎

  20. Note here, \(A=4\) has been filled in.↩︎

  21. We have filled in \(C = 0\) here.↩︎

  22. This uses \(A=4\) and \(C=0\).↩︎

  23. Note, we could also apply Bayes’ Theorem to find the result here. We know that \(P(\text{Green}|\text{Pyramid}) = 4/13\), that \(P(\text{Green}) = 15/50\) and that \(P(\text{Pyramid}) = 13/50\). Thus, Bayes’ Theorem gives \[P(\text{Pyramid}|\text{Green}) = \dfrac{(4/13)(13/50)}{15/50} = \dfrac{4}{15},\] exactly as in the direct derivation.↩︎