Theorem 3.1 (Multiplicative Chernoff bound).

CSE 525: Randomized Algorithms Spring 2025 Lecture 3: Strong Concentration Bounds Lecturer: Shayan Oveis Gharan 04/02/2026

Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal publications.

We have seen how knowledge of the variance of a random variable $X$ can be used to control deviation of $X$ from its mean. This is the heart of the second moment method. But often we can control even higher moments, and this allows us to obtain much stronger concentration properties. A prototypical example is when $X_{1},X_{2},\dots,X_{n}$ is a family of independent (but not necessarily identically distributed) $\{0,1\}$ random variables and $X=X_{1}+X_{2}+\dots+X_{n}$ . Let $p_{i}=\mathbb{E}\left[X_{i}\right]$ and define $\mu=\mathbb{E}\left[X\right]=\sum_{i=1}^{n}p_{1}+p_{2}+\dots+p_{n}$ . In that case, we have the following multiplicative form of the ”Chernoff bound”.

Theorem 3.1 (Multiplicative Chernoff bound).

. For every $\delta\geq 0$ , it holds that

\mathbb{P}\left[X\geq(1+\delta)\mu\right]\leq\left(\frac{e^{\delta}}{(1+\delta% )^{1+\delta}}\right)^{\mu}.

and

\mathbb{P}\left[X<(1-\delta\mu\right]\leq\left(\frac{e^{-\delta}}{(1-\delta)^{% 1-\delta}}\right)^{\mu}

Consequently,

\mathbb{P}\left[X\geq(1+\delta)\mu\right]\leq e^{-\delta^{2}\mu/(2+\delta},% \mathbb{P}\left[X\leq(1-\delta)\mu\right]\leq e^{-\delta^{2}\mu/2}

Proof.

Let $t$ be a parameter that we choose later.

\mathbb{P}\left[X\geq(1+\delta)\mu\right]=\mathbb{P}\left[e^{tX}\geq e^{t(1+% \delta)\mu}\right]\underset{\text{Markov's Inequality}}{\leq}\frac{\mathbb{E}% \left[e^{tX}\right]}{e^{t(1+\delta)\mu}}.

(3.1)

The first inequality uses that the exponential function is a monotone function.

Now, we can write

\mathbb{E}\left[e^{tX}\right]=\mathbb{E}\left[e^{t\sum_{i}X_{i}}\right]=% \mathbb{E}\left[\prod_{i=1}^{n}e^{tX_{i}}\right]\underset{\text{independence}}% {=}\prod_{i=1}^{n}\mathbb{E}\left[e^{tX_{i}}\right].

Now, observe that

\mathbb{E}\left[e^{tX}\right]=p_{i}e^{t}+(1-p_{i})=1+p_{i}(e^{t}-1)\underset{1% +x\leq e^{x}}{\leq}e^{p_{i}(e^{t}-1)}

Plugging this back we obtain

\mathbb{E}\left[e^{tX}\right]\leq\prod_{i=1}^{n}e^{p_{i}(e^{t}-1)}=e^{\mu(e^{t% }-1)}

Putting back in (3.1), we obtain

\mathbb{P}\left[X\geq(1+\delta)\mu\right]\leq\frac{e^{\mu(e^{t}-1)}}{e^{t(1+% \delta)\mu}}=e^{\mu(e^{t}-1-(1+\delta)t)}\underset{\text{set }t=\ln(1+\delta)}% {=}\left(\frac{e^{\delta}}{(1+\delta)^{1+\delta}}\right)^{\mu}

The other case can be proven similarly. ∎

3.1 Giant Connected Components in Erdös-Réyni Graphs

In this section we prove the following theorem.

Theorem 3.2.

Theorem 1 Let $\epsilon>0$ be a small enough constant. Let $G$ be an Erdös-Réyni random graph with parameter $p$ .

1.

Let $p=\frac{1-\epsilon}{n}$ . Then whp all connected components of $G$ are of size at most $\frac{7}{\epsilon^{2}}\ln n$ .
2.

Let $p=\frac{1+\epsilon}{n}$ . Then whp $G$ contains a path of length at least $\frac{\epsilon^{2}n}{5}$ .

We run the DFS algorithm to prove the theorem. First, let us recall this algorithm: Fix a natural order $1<2<\dots<n$ on the vertices of $G$ we assume that algorithm prioritizes vertices according to this natural order. DFS maintains three sets of vertices, letting $X$ be the set of vertices whose exploration is complete, i.e., explored, $U$ be the set of unvisited vertices, and $T=[n]\setminus X\setminus U$ be the set of active vertices in the stack.

The algorithm starts with $X=T=\varnothing$ and $U=V$ , and runs till $T\cup U=\varnothing$ . At each round of the algorithm, if the set $T$ is non-empty, the algorithm queries $U$ for neighbors of the last vertex $v$ that has been added to $T$ , scanning $U$ according to the natural order. If $v$ has a neighbor $u\in U$ , the algorithm deletes $u$ from $U$ and inserts it into $T$ . If $v$ does not have a neighbor in $U$ , then $v$ is popped out of $T$ and is moved to $X$ . If $T$ is empty, the algorithm chooses the first vertex of $U$ according to the natural order, deletes it from $U$ and pushes it into $T$ . In order to complete the exploration of the graph, whenever the sets $T$ and $U$ have both become empty (at this stage all connected components of $G$ have been revealed), we make the algorithm query all remaining pairs of vertices in $S$ , not queried before.

The following properties of DFS are immediate:

•

At each round of the algorithm one vertex moves, either from $U$ to $T$ , or from $T$ to $X$ ;
•

At any time during the algorithm, it has been revealed already that the graph $G$ has no edges between the current set $X$ and the current set of unvisited vertices $U$ ;
•

The set $T$ always spans a path (indeed, when a vertex $u$ is added to $T$ , it happens because $u$ is a neighbor of the last vertex $v$ in $T$ ; thus, $u$ augments the path spanned by $T$ , of which $v$ is the last vertex).

Let $N={n\choose 2}$ To prove the theorem we run DFS on a random input $G(n,p)$ . Thus we feed DFS algorithm with a sequence of i.i.d. Bernoulli(p) random variables $Y_{1},\dots,Y_{N}$ so that is gets its i-th query answered positively if $Y_{i}=1$ and answered negatively otherwise, the so obtained graph is clearly distributed according to G(n, p). Thus, studying the component structure of $G$ can be reduced to studying the properties of the random sequence $X$ . In particular, observe crucially that as long as $U\neq\varnothing$ , every positive answer to a query results in a vertex being moved from $U$ to $T$ , and thus after $t$ queries and assuming $T\neq\varnothing$ still, we have $|X\cup T|\geq\sum_{i=1}^{t}Y_{i}$ . (The last inequality is strict in fact as the first vertex of each connected component is moved from $T$ to $U$ ”for free”, i.e., without need to get a positive answer to a query.) On the other hand, since the addition of every vertex, but the first one in a connected component, to U is caused by a positive answer to a query, we have at time t: $|T|\leq 1+\sum_{i=1}^{t}Y_{i}$ .

The following lemma gives us the tool that we need to prove the theorem.

Lemma 3.3.

Let $\epsilon>0$ be a small enough constant. Consider the sequence of iid Bernoulli random variables with parameter $p$ . $Y_{1},\dots,Y_{N}$ .

1.

Let $p=\frac{1-\epsilon}{n}$ and $k=\frac{7}{\epsilon^{2}}\ln n$ . Then, with probability $\gtrsim 1-1/\sqrt{n}$ , there is no interval of length $k n$ where at least $k$ of the Bernoullis are 1.
2.

Let $p=\frac{1+\epsilon}{n}$ and $N_{0}=\frac{\epsilon n^{2}}{2}$ . Then,

$\mathbb{P}\left[\left|\sum_{i=1}^{N_{0}}Y_{i}-\frac{\epsilon(1+\epsilon)n}{2}% \right|<n^{2/3}\right]\geq 1-o(1).$

Proof.

Consider an interval $I$ of length $k n$ in $[N]$ . Let $Y=\sum_{i\in I}Y_{i}$ . Notice $\mathbb{E}\left[Y\right]=knp$ . By the multiplicative Chernoff bound,

\mathbb{P}\left[Y\geq k\right]=\mathbb{P}\left[Y\geq\frac{\mathbb{E}\left[Y% \right]}{np}\right]=\mathbb{P}\left[Y\geq\frac{\mathbb{E}\left[Y\right]}{1-% \epsilon}\right]\leq\exp(-\frac{\epsilon^{2}\mathbb{E}\left[Y\right]}{2+% \epsilon})\leq n^{-\frac{7}{2+\epsilon}}

where the last inequality follows by $k=\frac{7}{\epsilon^{2}}\ln n$ . By a union bound the probability, since there are only $O(n^{2})$ many such intervals the claim follows.

To prove 2, let $Y=\sum_{i=1}^{N_{0}}Y_{i}$ . Then,

\mathbb{E}\left[Y\right]=N_{0}\cdot p=\frac{(1+\epsilon)\epsilon n}{2}

Now, again by multiplicative Chernoff bound, for $\delta=\frac{2n^{-1/3}}{\epsilon(1+\epsilon)}$

\mathbb{P}\left[\left|\sum_{i=1}^{N_{0}}Y_{i}-\frac{\epsilon(1+\epsilon)n}{2}% \right|>n^{2/3}\right]\leq\exp(-\delta^{2}\mu/3)\leq\exp(-n^{1/3})

∎

We are now ready to prove the theorem.

Part 1.

Assume to the contrary that $G$ contains a connected component $C$ with more than $k=\frac{7}{\epsilon^{2}}\ln n$ vertices. Let us look at the epoch of the DFS when $C$ was created (an epoch is a period during which the stack gets empty again). Consider the moment inside this epoch when the algorithm has found the $(k+1)$ -st vertex of $C$ and is about to move it to $T$ . Denote $X_{C}=X\cap C$ at that moment. Then $|X_{C}\cup T|=k$ , and thus the algorithm got exactly $k$ positive answers to its queries to random variables $Y_{i}$ during the epoch, with each positive answer being responsible for revealing a new vertex of $C$ , after the first vertex of $C$ was put into $T$ in the beginning of the epoch. During the epoch only pairs of edges touching $X_{C}\cup T$ have been queried, and the number of such pairs is therefore at most ${k\choose 2}+k(n-k)\leq kn$ . It thus follows that the sequence $Y$ contains an interval of length at most $k n$ with at least $k$ 1’s which is a contradiction.

Part 2.

Now, assume that the sequence $Y$ satisfies Property 2 of 3.3. We claim that after the first $N_{0}=\frac{\epsilon n^{2}}{2}$ queries of the DFS algorithm, the set $T$ contains at least $\frac{\epsilon^{2}n}{5}$ vertices (with the contents of $T$ forming a path of desired length at that moment).

First observe that $|X|<\frac{n}{3}$ at time $N_{0}$ . Indeed, if $|X|\geq\frac{n}{3}$ , then let us look at a moment $t$ where $|X|=\frac{n}{3}$ . At that moment $|T|\leq 1+\sum_{i=1}^{t}Y_{i}\leq 1+\frac{\epsilon(1+\epsilon)n}{2}+n^{2/3}<% \frac{n}{3}$ by Property 2 of the Lemma. Then $|U|=n-|X|-|T|\geq\frac{n}{3}$ , and the algorithm has examined all $|X|\cdot|U|\geq\frac{n^{2}}{9}>N_{0}$ pairs between $X$ and $U$ (and found them to be non-edges) – a contradiction.

Getting back to time $N_{0}$ ; now assume $|X|<\frac{n}{3}$ and $|T|<\frac{\epsilon^{2}n}{5}$ then, we have $U\neq\varnothing$ . This means in particular that the algorithm is still revealing connected components of $G$ , and each positive answer it got resulted in moving a vertex from $U$ to $T$ (some of these vertices may have already moved further from $T$ to $X$ ). By Property 2 of 3.3 the number of positive answers at that point is at least $\frac{\epsilon(1+\epsilon)n}{2}-n^{2/3}$ . Hence, we have $|X\cup T|\geq\frac{\epsilon(1+\epsilon)n}{2}-n^{2/3}$ . If $|T|\leq\frac{\epsilon^{2}n}{5}$ , then $|X|\geq\frac{\epsilon n}{2}+\frac{3\epsilon^{2}n}{10}-n^{2/3}$ . Therefore, all pairs of vertices between $X, U$ are queried already (and received a negative answer), i.e., $|X|\cdot|U|$ many pairs. It follows that

	$\displaystyle\frac{\epsilon^{2}n}{2}=N_{0}$	$\displaystyle\geq\|X\|\cdot\|U\|\geq\|X\|\cdot\left(n-\|X\|-\frac{\epsilon^{2}n}{5}\right)$
		$\displaystyle\geq\left(\frac{\epsilon n}{2}+\frac{3\epsilon^{2}n}{10}-n^{2/3}% \right)\cdot\left(n-\frac{\epsilon n}{2}-\frac{\epsilon^{2}n}{2}+n^{2/3}\right)$
		$\displaystyle\geq\frac{\epsilon n^{2}}{2}+\frac{\epsilon^{2}n^{2}}{20}-O(% \epsilon^{3})n^{2}>\frac{\epsilon n^{2}}{2}$

as desired.