CSE 525: Randomized Algorithms Spring 2025 Lecture 7: Negative Correlation and Applications Lecturer: Shayan Oveis Gharan 04/29/2025
Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal publications.
7.1 Positive Association
Theorem 7.1 (The Four Functions Theorem).
Let be non-negative functions defined on subsets of . If for any two subsets we have
then, for every two families of subsets , we have
where .
The following theorem, known as the FKG inequality, is a direct consequence of the above theorem:
Definition 7.2 (Log-supermodular probability distributions).
We say a probability distribution is log-supermodular if for any , we have
This property is also known as Positive Lattice Condition.
For a concrete example, consider the family of Erdos-Reyni random graphs. In such a case, for any set we have
We claim that this distribution is log-supermodular. Cancelling out the normalizing constant , we need to check for any two sets , and
But this holds simply because .
Definition 7.3 (Increasing functions).
We say a function is increasing if for any such that we have
We say is a decreasing function if the above inequality holds in the reverse direction.
For a concrete example, notice for any , is increasing and is decreasing.
But, more generally, consider the domain of the set of all possible edges in a graph with vertices. Then, for ,
are increasing, but,
is decreasing.
Theorem 7.4 (FKG Inequality).
Let be a log-supermodular probability distribution. Then, for any two increasing functions we have
i.e., is positively associated.
Proof.
We use the Four functions theorem with
We claim that these four functions satisfy the assumption of the Four functions theorem. In particular, for any by log-supermodularity of we have
Therefore, letting , we conclude
On the other hand,
Putting them together proves the theorem. ∎
Note that the above inequality also holds if both are decreasing functions. In case that is increasing and is decreasing, the inequality holds just in the opposite direction.
Consequently, FKG theorem implies that any pair of elements are positively correlated in a log-supermodular probability distribution,
More interestingly, we can use it to prove the following fact about graphs:
Fact 7.5.
For any , let be a random Erdos-Reyni graph with parameter .
7.2 Negatively Correlated Random Variables
We say that a collection of random variables are negatively correlated if it holds that for any subset :
Note that if are independent, then this holds with equality.
Furthermore, we say are pairwise negatively correlated if for all ,
Theorem 7.6 (Chernoff for negatively correlated random variables).
. Suppose are negatively correlated Bernoulli random variables (instead of independent), then the conclusion of the multiplicative Chernoff bound still holds.
Proof.
To see this, note that the one place we used independence in the proof of the Chernoff bound is in the calculation: When ,
The main observation is that the above statement still holds except the last identity will be an inequality. So the rest of the proof of the Chernoff bound follows. In particular, when are negatively correlated we show
Let be independent Bernoulli random variables with for each and define . For any nonnegative integer ,
where the sum is over all non-negative integer vectors such that .
On the other hand, since are independent,
Putting these together we obtain, for every ,
| (7.1) |
Lastly, using the Taylor expansion
Applying (7.1) to every monomial above, we get
as desired. ∎
Definition 7.7 (Generating Polynomial).
It is natural to express a probability distribution over subsets of by its generating polynomial. To do that we consider variables, and write
where .
For a concrete example, let be independent Bernoulli random variables where has success probability . Then, we can write the corresponding generating polynomial as follows:
The following facts about the generating polynomial are straightforward:
Fact 7.8.
Let be a probability distribution over with generating polynomial , then
-
•
, i.e., sum of the coefficients of is 1.
-
•
, i.e., the marginals can be deduced by take partial derivatives.
-
•
are negatively correlated if
-
•
Say we have two probability distributions over disjoint sets, then the product distribution is the probability distribution with generating polynomial .
-
•
If are pairwise negatively correlated then so is .
Next, we explain a few examples of negatively correlated random variables:
Example 1: Observe that any probability distribuition over subsets of size (exactly) one among objects is negatively correlated, namely
where . Following the above fact, product of these distributions are also negatively correlated.
As an application, recall that in lecture 4, we introduced a probability distribution over paths connecting the -th terminal pairs where we chose one path with its probability and we independently run the procedure for every . It follows that the resulting probability distribution over the random variables is negatively correlated. So, we could have directly apply the Chernoff bound instead of defining a new family of random variables .
Example 2: Edges of a uniform spanning tree One of the most interesting family of negative correlated probability distributions is the distribution of the set of edges of a uniform spanning tree. Namely, let be a connected undirected graph; assign a variable to every edge , then is the distribution with the following generating polynomial,
We will discuss ideas to prove this fact in the next lecture.
7.3 Towards a theory of Negative dependence
One of the ongoing research directions in probability theory is to study under what conditions one can expect negative correlation and and negative association.
Following the above discussion, a natural choice is the reverse of the positive lattice condition, namely negative lattice condition:
Unfortunately, it can be seen that this property does not even imply a pairwise negative correlation property:
Example 7.9.
Consider the distribution over with the following generating polynomial,
This distribution satisfies the NLC but it not negative correlated as .
In the next lecture we will introduce strongly Rayleigh distribution as a generic method to study negative dependence.