CSE 312 – Section 7


Spring 2026

Review of Main Concepts

Announcements & Plan for Section

Announcements

Plan for Section

Midterm Prep Resources

1  Content Review - understanding the Central Limit Theorem

The Central Limit Theorem (CLT) is one of the most powerful results in probability because it allows us to approximate the distribution of sums and averages of independent random variables, regardless of their original distribution.

a)
(4 points) You roll a fair, 6-sided die 100 times. The outcome of a single roll has a discrete uniform distribution with a mean of \(\mu = 3.5\) and a variance of \(\sigma ^2 \approx 2.92\). Let \(\bar {X}\) be the average value of your 100 rolls. According to the Central Limit Theorem, which of the following best describes the approximate distribution of \(\bar {X}\)?
(a)
A discrete uniform distribution from 1 to 6.
(b)
A normal distribution with mean \(3.5\) and variance \(2.92\).
(c)
A normal distribution with mean \(3.5\) and variance \(0.0292\).
(d)
A normal distribution with mean \(350\) and variance \(292\).
b)
(4 points) A shipping company loads 100 identical packages onto a truck. The weight of a single package is a random variable with a mean of \(20\) lbs and a standard deviation of \(5\) lbs. Let \(S\) be the total combined weight of all 100 packages. Using the CLT, what is the approximate distribution of \(S\)?
(a)
\(S \approx \mathcal {N}(2000, 500)\)
(b)
\(S \approx \mathcal {N}(2000, 2500)\)
(c)
\(S \approx \mathcal {N}(20, 0.25)\)
(d)
\(S \approx \mathcal {N}(2000, 5)\)
c)
(4 points) The waiting time for a bus is modeled by an Exponential distribution, which is heavily right-skewed. If you record the waiting times for \(64\) independent bus rides and calculate your average waiting time \(\bar {X}\), what will the shape of the distribution of \(\bar {X}\) look like?
(a)
It will be exactly Exponential, because the sum of exponentials is exponential.
(b)
It will be heavily right-skewed, matching the underlying population.
(c)
It will be approximately a bell-shaped Normal curve.
(d)
It will be perfectly uniform, since all averages balance out.
d)
(4 points) A researcher takes a sample of size \(n = 36\) from a population with mean \(\mu = 50\) and variance \(\sigma ^2 = 144\). They want to find the probability that the sample mean \(\bar {X}\) is greater than \(54\). To use the Standard Normal table, they must convert \(\bar {X}\) to a standard \(Z\)-score. Which of the following is the correct calculation for \(Z\)?
(a)
\(Z = \frac {54 - 50}{144 / \sqrt {36}}\)
(b)
\(Z = \frac {54 - 50}{12 / 36}\)
(c)
\(Z = \frac {54 - 50}{12 / \sqrt {36}}\)
(d)
\(Z = \frac {54 - 50}{144 / 36}\)

2  More review

(a)
(4 points) What does the Central Limit Theorem fundamentally guarantee about a sequence of i.i.d. random variables as the sample size \(n\) becomes very large?
i.
The distribution of the individual random variables gradually becomes normal.
ii.
The sample mean gets closer and closer to the true expected value of the distribution.
iii.
The distribution of the sample mean (or sample sum) approaches a normal distribution, regardless of the original distribution’s shape.
iv.
The sum of any two normally distributed random variables will also be normally distributed.
(b)
(4 points) Suppose the weight of an apple in an orchard has an unknown, highly skewed distribution with an expected value of 150 grams and a standard deviation of 20 grams. You pick a random sample of 100 apples. Let \(S\) be the total weight of these 100 apples. Using the CLT, what is the approximate probability that the total weight exceeds 15,200 grams?

Hint: First, find the expected value \(\expect {S}\) and the variance \(\Var {S}\) for the sum of the 100 apples. Then standardize to find a Z-score (. Assume \(\Phi (1) \approx 0.84\).

i.
\(0.84\)
ii.
\(0.16\)
iii.
\(0.50\)
iv.
\(0.05\)
(c)
(4 points) A computer server processes requests with an expected response time of 40 milliseconds and a standard deviation of 12 milliseconds. The distribution of response times is heavily right-skewed (an exponential-like tail). If you take a random sample of 36 requests, what is the approximate distribution of the sample mean, \(\bar {X}\)?

Hint: Remember that the variance of the sample mean is \(\Var {(\bar {X})} = \frac {\sigma ^2}{n}\). What is its standard deviation (the standard error)?

i.
Heavily right-skewed with a mean of 40 and a standard deviation of 12.
ii.
Approximately Normal with a mean of 40 and a standard deviation of 12.
iii.
Approximately Normal with a mean of 40 and a standard deviation of 2.
iv.
Approximately Normal with a mean of 1440 and a standard deviation of 72.

3  Round off error

Let \(X\) be the sum of 100 real numbers, and let \(Y\) be the same sum, but with each number rounded to the nearest integer before summing. If the roundoff errors are independent and uniformly distributed between \(-0.5\) and \(0.5\), what is the approximate probability that \(|X - Y| > 3\)?

4  Bad Computer

Each day, the probability your computer crashes is 10%, independent of every other day. Suppose we want to evaluate the computer’s performance over the next \(100\) days.

a)
Let \(X\) be the number of crash-free days in the next \(100\) days. What distribution does \(X\) have? Identify \(\expect {X}\) and \(Var(X)\) as well. Write an exact (possibly unsimplified) expression for \(\Pr (X\ge 87)\).
b)
Approximate the probability of at least 87 crash-free days out of the next 100 days using the Central Limit Theorem. Use continuity correction.

Important: continuity correction says that if we are using the normal distribution to approximate \[\Pr (a \le \sum _{i=1}^n X_i \le b)\] where \(a \le b\) are integers and the \(X_i\)’s are i.i.d. discrete random variables taking integer values, then, as our approximation, we should use \[ \Pr ( a-0.5 \le Y \le b+ 0.5)\] where \(Y\) is the appropriate normal distribution that \( \sum _{i=1}^n X_i \) converges to by the Central Limit Theorem. (The intuition here is that, to avoid a mismatch between discrete distributions (whose range is a set of integers) and continuous distributions, we get a better approximation by imagining that a discrete random variable, say \(W\), is a continuous distribution with density function \[f_W(x) := p_W (i) \quad \text { when }i-0.5 \le x < i + 0.5 \text { and $i$ integer} \])

For more details see pages 209-210 in the Tsun book.

5  Tweets

A prolific twitter user tweets approximately 350 tweets per week. Let’s assume for simplicity that the tweets are independent, and each consists of a uniformly random number of characters between 10 and 140. (Note that this is a discrete uniform distribution.) Thus, the central limit theorem (CLT) implies that the number of characters tweeted by this user is approximately normal with an appropriate mean and variance. Assuming this normal approximation is correct, estimate the probability that this user tweets between 26,000 and 27,000 characters in a particular week. (This is a case where continuity correction will make virtually no difference in the answer, but you should still use it to get into the practice!).

6  Ping Pong

You’re playing ping pong with your friend, and want to keep playing until you’ve scored 15 points. Unfortunately, your friend is a much more skilled ping pong player than you, so you only win points 25% of the time (with each point being independent of the other points). Approximate the probability that you’ll need to play at least 50 points before stopping.

7  More normal stuff (10 points)

Let \(X\) be a normal random variable with mean 12 and variance 4. Find the value of \(c\) such that \(\Prob {X < c} = 0.1\)

8  CLT for stocks (10 points)

Suppose that the daily price change of a certain stock on the stock market is a random variable with mean 0 and variance \(\sigma ^2\). Thus, if \(Y_n\) is the price of the stock on the \(n\)-th day, then \[Y_n = Y_{n-1} + X_n,\quad n \ge 1\] where \(X_1, X_2, \ldots \) are independent, identically distributed random variables with mean 0 and variance \(\sigma ^2\). Suppose also that today’s stock price is 100 and \(\sigma ^2 = 16\). Use the Central Limit Theorem to estimate the probability that the stock price will exceed 110 after 10 days.