Theory 1

Joint distributions describe the probabilities of events associated with multiple random variables simultaneously.

In this course we consider only two variables at a time, typically called X and Y. It is easy to extend this theory to vectors of n random variables.

Joint PMF and joint PDF

Discrete joint PMF:

PX,Y(x,y)=P[X=x,Y=y]

Continuous joint PDF:

fX,Y(x,y)=density at(x,y)

center

center

Probabilities of events: Discrete case If B2 is a set of points in the plane, then an event is formed by the set of all outcomes s mapped by X and Y to points in B:

={sS|(X(s),Y(s))B}

The probabilities of such events can be measured using the joint PMF:

P[(X,Y)B]=P[]=(x,y)BPX,Y(x,y)

Probabilities of events: Continuous case Let 𝒱=[a,b]×[c,d]2 be the rectangular region defined by (x,y)2 such that axb and cyd. Then:

P[(x,y)𝒱]=P[aXb,cYd]=cdabfX,Y(x,y)dxdy

For more general regions 𝒱2:

P[(X,Y)𝒱]=𝒱fX,Y(x,y)dA

The existence of a variable Y does not change the theory for a variable X considered by itself.

However, it is possible to relate the theory for X to the theory for (X,Y), in various ways.

The simplest relationship is the marginal distribution for X, which is merely the distribution of X itself, considered as a single random variable, but in a context where it is derived from the joint distribution for (X,Y).

Marginal PMF, marginal PDF

Marginal distributions are obtained from joint distributions by summing the probabilities over all possibilities of the other variable.

Discrete marginal PMF:

PX(x)=yPX,Y(x,y)PY(y)=xPX,Y(x,y)

Continuous marginal PMF:

fX(x)=+fX,Y(x,y)dyfY(y)=+fX,Y(x,y)dx

Infinitesimal method

Suppose X has density fX(x) that is continuous at x0. Then for infinitesimal dx:

P[x0<Xx0+dx]=fX(x)dx

Suppose X and Y have joint density fX,Y(x,y) that is continuous at (x0,y0). Then for infinitesimal dx,dy:

P[x0<Xx0+dx,y0<Yy0+dy]=fX,Y(x0,y0)dxdy

Joint densities depend on coordinates

The density fX,Y(x,y) in these integration formulas depends on the way X and Y act as Cartesian coordinates and determine differential areas dxdy as little rectangles.

To find a density fR,Θ(r,θ) in polar coordinates, for example, it is not enough to solve for x(r,θ) and y(r,θ) and plug into fX,Y!

Instead, we must consider the differential area dxdy vs. drdθ. We find that dxdy=rdrdθ.

As an example, the density of the uniform distribution on the unit disk is fR,Θ=rπ, which is not constant as a function of r and θ.

Extra - Joint densities may not exist

It is not always possible to form a joint PDF fX,Y from any two continuous RVs X and Y.

For example, if X=Y, then (X,Y) cannot have a joint PDF, since P[X=Y]=1 but the integral over the region X=Y will always be 0. (The area of a line is zero.)

Theory 2

Joint CDF

The joint CDF of X and Y is defined by:

FX,Y(x,y)=P[Xx,Yy]

We can relate the joint CDF to the joint PDF using integration:

FX,Y(x,y)=yxfX,Y(s,t)dsdt

center

Conversely, if X and Y have a continuous joint PDF fX,Y(x,y) that is also differentiable, we can obtain the PDF from the CDF using partial derivatives:

fX,Y(x,y)=2xyFX,Y(x,y)

There is also a marginal CDF that is computed using a limit:

FX(x)=limy+FX,Y(x,y)

This could also be written, somewhat abusing notation, as FX(x)=FX,Y(x,+).