Summations

02

Jumble of coins

In my pocket I have a jumble of coins: 5 dimes, 4 quarters, 3 nickels, 3 pennies, and one big 50¢-piece. I draw three at random. What is the expected value of the three?

Link to original

Central limit theorem

09

Burning through light bulbs

A 100 Watt light bulb’s expected lifetime is 600 hours, with variance 360,000. An advertising board uses one of these light bulbs at a time, and when one burns out, it is immediately replaced with another. (The lifetime of each bulb is independent from the others.) Let the continuous random variable be the total number of hours of advertising from 10 bulbs.

(a) Find the expected value of .

(b) Find the variance of .

(c) Use the CLT to approximate the probability that is less than 5,500 hours. (You should decide whether it is appropriate to use the continuity correction.)

Link to original

02

De Moivre-Laplace Continuity Correction

A fair die is rolled 300 times.

Use a normal approximation to estimate the probability that exactly 100 outcomes are either 3 or 6.

Do this with and without the continuity correction.

Link to original

01

Normal approximation - Eating hot dogs

Frank is a competitive hot dog eater. He eats in with .

What is the probability that Frank manages to consume in or less, in an upcoming competition? Use a normal approximation from the CLT to estimate this probability.

State the reason that the normal approximation is applicable.

Link to original

11

Community college ages

At a community college, the mean age of the students is 22.3 years, and the standard deviation is 4 years. A random sample of 64 students is drawn.

What is the probability that the average age of the students in the random sample is less than 23 years?

Link to original

12

Winning the lottery

Suppose a lottery game requires that you purchase a $10 game card and advertises a 10% probability of winning a prize.

Use the Central Limit Theorem and the continuity correction to approximate the probability of winning at least 20 times when you purchase 100 of these game cards.

Link to original

Sample Mean, Tails, Law of Large Numbers

01

Deviation estimation - Exponential

Let with .

(a) Compute the Markov bound on .

(b) Compute the Chebyshev bound on .

(c) Find the exact value of and compare with yours answers in (a) and (b).

Link to original

03

Deviation estimation - Factory production

Suppose a factory produces an average of items per week.

(a) How likely is it that more than 75 items are produced this week? (Find an upper bound.)

(b) Suppose the variance is known to be 25. Now what can you say about (a)? (Hint: Monotonicity.)

(c) What do you know about the probability that the number of items produced differs from the average by at most 10?

Link to original

07

Math contest scores

At a high school math competition, students take a test with 10 questions. Each question is worth one point and the probability of a student getting any one question correct is 0.55, independent of the other questions.

(a) Find the variance of , the average score for 15 students.

(b) Use the Law of Large Numbers to find an upper bound for the probability that is greater than 6.

Link to original

Significance testing

01

Testing paperclips - Likelihood of error

A factory assembly line machine is cutting paperclips to length before folding. Each paperclip is supposed to be long. The length of paperclips is approximately normally distributed with standard deviation .

(a) Design a significance test with that is based on the average of 5 measurements (sample mean). What is the rejection region? What is the probability of Type I error?

(b) What is the probability of Type II error, given that the average paperclip length on the machine is actually ?

Link to original

04

Blue eyes

A redditor claims that 10% of people have blue eyes, but you think it is not that many. You work at the DMV for the summer, so you write down the eye color recorded on drivers’ licenses of various people in the database.

(a) Suppose you record the eye color of 1000 people and let be the number that are blue. If the rejection region is , what is the significance level of the test?

(b) Take again the experiment in (a). If you want a significance level of , what should the rejection region be in your test?

(c) Suppose the fact is that 7% of people have blue eyes. How likely is it that your test in (b) rejects ?

Link to original

05

Shipping time test

The number of days it takes for a package to arrive after being shipped with a particular company is a random variable, . When the shipping process is operating at full capacity and delays are not common, the PMF of is given in the following table:

123456789
0.0410.2290.3790.2370.0450.0210.0190.0170.012

Design a significance test at the level that uses the value of X for one package to test the null hypothesis: the shipping process is operating at full capacity. You should clearly state which values of X are in the rejection region.

Link to original

06

Bits received in error

In a digital communication channel, it is assumed that a bit is received in error with probability . Someone challenges this hypothesis: they believe the error rate is higher than . Assume 100,000 bits are transmitted. Design a one-tailed significance test using and , the number of bits received in error, to decide whether to reject the hypothesis that the error rate is . Your rejection region should be of the form . You do not have to use the continuity correction.

Link to original

Binary hypothesis testing

01

Identifying Uranium

You are testing gram samples of pure Uranium to see if they are enriched. You have a Geiger counter that counts a number of gamma rays that come from nearby fission events in 1 second intervals after you press the count button.

If the sample is enriched, you expect a Poisson distribution of gamma rays in the counter with an average of 20. If the sample is not enriched (the null hypothesis), the average count will be 10.

(a) Design an ML test to decide whether it is ordinary or enriched (). What is ? What are the probabilities of Type I, Type II, and Total error?

(b) After running the test many times, you have noticed that 70% of the samples are ordinary, while 30% are enriched. Now design an MAP test. What is ? What are the probabilities of Type I, Type II, and Total error?

(c) Missing a bit of enriched Uranium is obviously a major problem. The damage to your reputation and pocketbook of missing enriched Uranium is the damage caused by incorrectly labeling ordinary Uranium as enriched. Now design an MC test. What is ? What are the probabilities of Type I, Type II, and Total error?

(d) What is the expected cost of each application of the MC test, assuming the cost of a false alarm is $10,000? What is this number for the MAP test?

Link to original

03

Security screening

A metal detector for an event produces a reading, , that varies between 0 and 10 according to the PDFs given below. (Note is a continuous random variable.)

Based on the reading, a security guard will stop and search a person or let them pass. Suppose it is known that 10% of people passing through security are carrying metal objects.

a person is not carrying metal objects a person is carrying metal objects

Suppose it is 20 times worse to neglect searching someone who is carrying metal than to search someone who is not carrying metal. Design a minimum cost test that uses the value of the reading, X to decide whether the security guard will stop that person. Clearly state the decision rule.

Link to original

04

Medical testing

A doctor is planning to use a new, inexpensive medical test to detect a particular disease. The test score, , tends to be higher for patients with the disease. The PMFs for the test score for patients with and without the disease are shown below. From a previously used, more expensive test, it is known that 20% of the population has this disease.

Patients without the disease:

12345
0.50.30.150.050

Patients with the disease:

12345
0.050.10.30.350.2

Design a binary hypothesis test that will minimize the doctor’s probability of error. Let : the patient does not have the disease and : the patient does have the disease. Determine for which test scores the doctor should diagnose the patient as having the disease. Clearly denote which scores result in which decisions.

Link to original

Mean square error

02

Estimates from joint PDF

Suppose and have the following joint PDF:

(a) Find and the blind estimate .

(b) Compute , the MMSE estimate of assuming the event .

(c) Find and the blind estimate .

(d) Compute , the MMSE estimate of assuming the event .

Link to original

03

MMSE exact estimator from joint PDF

Suppose and have the following joint PDF:

(a) What is , the MMSE estimate of given ?

(b) What is , the MMSE estimate of given ?

Link to original

04

MMSE linear estimator from joint PDF

Suppose and have the following joint PDF:

(a) What is , the MMSE linear estimator of in terms of ?

(b) What is , the MMSE linear estimator of in terms of ?

Link to original

05

MMSE linear estimator from joint PMF

Suppose and have the following joint PMF:

0
0

(a) Find the minimal MSE linear estimator for in terms of .

(b) What is the MMSE error for this linear estimator?

(c) Use (a) to estimate given and .

Link to original