|
Department of
Mathematics 203 Avery Hall University of Nebraska Lincoln Lincoln, NE 68588-0323 402-472-3731 (voice) 402-472-8466 (fax) |
. where pY (y) is
the marginal probability mass
function of Y .
This section is adapted from: S. K. Ross, Introduction to Probability Models, Third Edition, Academic Press, 1985, Chapter 3, pages 83-103, Probability by L. Breiman, Addison-Wesley, 1968.
Rating: PG
Recall the definition of conditional probability for events: What is the probability that event E will happen, given that event F has already occurred? For finite sample spaces and discrete probability, then our new, restricted sample space is F. The probability of E is proportional to that part lying in F
Taking conditional probabilities of various events with respect to a given event F amounts to choosing F as a new sample space; and we have to multiply all probabilities by 1/ Pr[F] in order to reduce the total probability to 1. This shows that all general theorems on probabilities are valid also for conditional probabilities. For example:
A frequently useful alternative formulation of conditional probability is sometimes called probability by conditioning (more on this later):
To extend and generalize to three events:
Further generalization to more events is obvious.
Another very useful generalization is sometimes called the
“Law of Total Probability”: Let F1,F2,...,Fn
be a set of mutually exclusive events which partition the
sample space, that is, specifically Fi
Fj =
Ø and F1


Fn =
. Then necessarily
and applying the probability by conditioning and adding all the probabilities:
This law of total probability is very useful because the conditional probability of each sub-event is sometimes much easier than a direct calculation of Pr[E].
Example 1 The following example is a classic in elementary probability theory called the Ballot Problem. In an election, candidate A receives n votes and candidate B receives m votes, where m > n, so A is the winner. Assuming that all orderings of vote countings are equally likely, the probability that A is always ahead in the count of votes is (n - m)/(n + m)
It is advisable always to check boundary or extreme cases to determine the reasonableness of a formula such as this. For example if m = 0, then the probability that A is always ahead in the vote count is 1 from the formula. On the other hand, if A wins b only one vote, so n-1 = m, then the probability that A is always ahead is 1/(n + m) = 1/(2n - 1), which is fairly small as we expect. (We will use this case in the next example.)
We can use the Law of Total Probability to compute the probabilities.
Proof: Let Pn,m denote the desired probability. By conditioning on which candidate receives the last vote counted we have
|
Now given that A received the last vote, one can see that the probability that A is always ahead is the same as if A had received n - 1 votes, B received m votes, and A was always ahead, namely Pn-1,m. Likewise the probability that A was always ahead, while B received the last vote is just Pn,m-1. Hence the probability by conditioning is
Note that we have turned the probability problem into
solving a difference equation in two variables. We can then
verify directly that Pn,m = (n
- m)/(n + m)
is a solution. (If that is an unsatisfying approach for
you, then you can solve the difference equation either by
induction starting from the trivial case P1,0 =
1, or by appealing to a direct difference equation solution
method.)
Example 2 Consider a coin-flipping game, where the probability of a head is p and the probability of a tail is 1-p. What is the probability that for the first time on flip 2n after beginning, the total number of heads is the same as the total number of tails? Since the number of heads is the same as the number of tails, the total number of coin-flips is double the number of heads, hence an even number. Note that we do ask that heads be ahead of tails until flip 2n or that tails be ahead of heads, only that they first equalize at flip 2n
Equivalently, consider a gambler starting from an initial stake X0. The gambler wins a dollar when the coin flip is heads with probability p. The gambler loses a dollar when the coin flip is tails with probability 1 - p. Let the outcome of the ith flip be
Then the gambler’s fortune at stage n is X0+
i-1nXi. We
are asking what is the probability that the
gambler’s fortune is again X0
for the first time at step 2n? We do not care if the gambler has
been ahead or behind, only that he gambler’s
fortune again is his starting value at flip 2n.
We compute this by conditioning on the total number of heads in the first 2n flips.
Now clearly all but one of these conditional probabilities are 0. So this reduces nicely to
Now given a total of n heads and n tails in 2n flips, all possible orderings of the heads and tails are equivalent. Thus the single conditional probability we are considering above is the same as ballot-counting in which each candidate receives n votes, but one of the candidates is always ahead until the last vote which ties them. But condition on whomever receives the last vote,
|
Rating: PG-13
The definition of conditional probability immediately motivates the following definition when X and Y are discrete random variables:
Definition 1 The conditional probability mass function of X given Y = y is
Note that there is no particular difficulty in applying this definition even if the discrete probability is a for a random variable assuming infinitely many values (for instance, a Poisson random variable.) However, we take as a convention that conditional probabilities are undefined if the denominator event F, or for discrete distributions, Y = y has zero probability.
If the random variables X and Y are continuous, we could still appeal to the quotient fX,Y (x,y)/fY (y) as the definition of FX|Y (x|y) and argue by analogy. However, this is not a legitimate mathematical approach. Nevertheless, we can justify this conclusion by taking a limit of cumulative distributions
0. Under all these hypotheses,
fX,Y (x,y)/fY (x)
behaves as a conditional density function, justifying our
extension of the definition. However, the number and
variety of hypotheses is unsatisfying.
Rating: R
Looking more closely at
satisfying
Thus, a measurable-theoretic analogue of the law of total probability becomes the definition of conditional probability. A complete discussion of this topic requires a complete understanding of measure theory and is beyond the scope of this course.
.
Suppose two customers arrived in the first hour. What
is the probability that
I check all the information on each page for correctness and typographical errors. Nevertheless, some errors may occur and I would be grateful if you would alert me to such errors. I make every reasonable effort to present current and accurate information for public use, however I do not guarantee the accuracy or timeliness of information on this website. Your use of the information from this website is strictly voluntary and at your risk.
I have checked the links to external sites for usefulness. Links to external websites are provided as a convenience. I do not endorse, control, monitor, or guarantee the information contained in any external website. I don't guarantee that the links are active at all times. Use the links here with the same caution as you would all information on the Internet. This website reflects the thoughts, interests and opinions of its author. They do not explicitly represent official positions or policies of my employer.
Information on this website is subject to change without notice.
Steve Dunbar's Home Page, http://www.math.unl.edu/~sdunbar1Last modified: [an error occurred while processing this directive]