Theory 1

Conditional probability

The conditional probability of “B given A” is defined by:

P[B|A]=P[BA]P[A]

This conditional probability P[B|A] represents the probability of event B taking place given the assumption that A took place. (All within the given probability model.)

By letting the actuality of event A be taken as a fixed hypothesis, we can define a conditional probability measure by plugging events into the slot of B:

P[|A]=P[A]P[A]

It is possible to verify each of the Kolmogorov axioms for this function, and therefore P[|A] itself defines a bona fide probability measure.

Conditioning

What does it really mean?

Conceptually, P[B|A] corresponds to creating a new experiment in which we run the old experiment and record data only those times that A happened. Or, it corresponds to finding ourselves with knowledge or data that A happened, and we seek our best estimates of the likelihoods of other events, based on our existing model and the actuality of A.

Mathematically, P[B|A] corresponds to restricting the probability function to outcomes in A, and renormalizing the values (dividing by P[A]) so that the total probability of all the outcomes (in A) is now 1.

The definition of conditional probability can also be turned around and reinterpreted:

Multiplication rule

P[AB]=P[A]P[B|A]

“The probability of A AND B equals the probability of A times the probability of B-given-A.”

This principle generalizes to any events in sequence:

Generalized multiplication rule

P[A1A2A3]=P[A1]P[A2|A1]P[A3|A1A2] P[A1An]=P[A1]P[A2|A1]P[A3|A1A2]P[An|A1An1]

The generalized rule can be verified like this. First substitute A2 for B and A1 for A in the original rule. Now repeat, substituting A3 for B and A1A2 for A in the original rule, and combine with the first one, and you find the rule for triples. Repeat again with A4 and A1A2A3, combine with the triples, and you get quadruples.

Theory 2

Law of Total Probability: 2 cases

For any events A and B:

P[B]=P[A]P[B|A]+P[Ac]P[B|Ac]

This rule can also be called “Division into Cases.”

Interpretation: event B may be divided along the lines of A, with some of P[B] coming from the part in A and the rest from the part in Ac.

This law can be generalized to any partition of the sample space S. A partition is a collection of events Ai which are mutually exclusive and jointly exhaustive:

AiAj=,iAi=S

The generalized formulation of Total Probability for a partition is:

Law of Total Probability: n cases

For a partition Ai of the sample space S:

P[B]=i=1nP[Ai]P[B|Ai]

By setting n=2 with A1=A and A2=Ac, we recover the 2-case Law from the n-case version of the Law.