Hypergraph removal lemma explained

In graph theory, the hypergraph removal lemma states that when a hypergraph contains few copies of a given sub-hypergraph, then all of the copies can be eliminated by removing a small number of hyperedges. It is a generalization of the graph removal lemma. The special case in which the graph is a tetrahedron is known as the tetrahedron removal lemma. It was first proved by Nagle, Rödl, Schacht and Skokan[1] and, independently, by Gowers.[2]

The hypergraph removal lemma can be used to prove results such as Szemerédi's theorem and the multi-dimensional Szemerédi theorem.

Statement

The hypergraph removal lemma states that for any

\varepsilon,r,m>0

, there exists

\delta=\delta(\varepsilon,r,m)>0

such that for any

r

-uniform hypergraph

H

with

m

vertices the following is true: if

G

is any

n

-vertex

r

-uniform hypergraph with at most

\deltanv(H)

subgraphs isomorphic to

H

, then it is possible to eliminate all copies of

H

from

G

by removing at most

\varepsilonnr

hyperedges from

G

.

An equivalent formulation is that, for any graph

G

with

o(nv(H))

copies of

H

, we can eliminate all copies of

H

from

G

by removing

o(nr)

hyperedges.

Proof idea of the hypergraph removal lemma

The high level idea of the proof is similar to that of graph removal lemma. We prove a hypergraph version of Szemerédi's regularity lemma (partition hypergraphs into pseudorandom blocks) and a counting lemma (estimate the number of hypergraphs in an appropriate pseudorandom block). The key difficulty in the proof is to define the correct notion of hypergraph regularity. There were multiple attempts[3] [4] [5] [6] [7] [8] [9] [10] [11] [12] to define "partition" and "pseudorandom (regular) blocks" in a hypergraph, but none of them are able to give a strong counting lemma. The first correct definition of Szemerédi's regularity lemma for general hypergraphs is given by Rödl et al.

In Szemerédi's regularity lemma, the partitions are performed on vertices (1-hyperedge) to regulate edges (2-hyperedge). However, for

k>2

, if we simply regulate

k

-hyperedges using only 1-hyperedge, we will lose information of all

j

-hyperedges in the middle where

1<j<k

, and fail to find a counting lemma.[13] The correct version has to partition

(k-1)

-hyperedges in order to regulate

k

-hyperedges. To gain more control of the

(k-1)

-hyperedges, we can go a level deeper and partition on

(k-2)

-hyperedges to regulate them, etc. In the end, we will reach a complex structure of regulating hyperedges.

Proof idea for 3-uniform hypergraphs

For example, we demonstrate an informal 3-hypergraph version of Szemerédi's regularity lemma, first given by Frankl and Rödl.[14] Consider a partition of edges

E(Kn)=

(2)
G
1\cup...\cup
(2)
G
l
such that for most triples

(i,j,k),

there are a lot of triangles on top of
(2)
\left(G
k\right).
We say that
(2)
\left(G
k\right)
is "pseudorandom" in the sense that for all subgraphs
(2)
A
i\subset
(2)
G
i
with not too few triangles on top of
(2)
\left(A
k\right),
we have
(2)
\left|d\left(G
k\right)

-

(2)
d\left(A
k\right)\right|\le\varepsilon,

where

d(X,Y,Z)

denotes the proportion of

3

-uniform hyperedge in

G(3)

among all triangles on top of

(X,Y,Z)

.

We then subsequently define a regular partition as a partition in which the triples of parts that are not regular constitute at most an

\varepsilon

fraction of all triples of parts in the partition.

In addition to this, we need to further regularize

(2)
G
1,

...,

(2)
G
l
via a partition of the vertex set. As a result, we have the total data of hypergraph regularity as follows:
  1. a partition of

E(Kn)

into graphs such that

G(3)

sits pseudorandomly on top;
  1. a partition of

V(G)

such that the graphs in (1) are extremely pseudorandom (in a fashion resembling Szemerédi's regularity lemma).

After proving the hypergraph regularity lemma, we can prove a hypergraph counting lemma. The rest of proof proceeds similarly to that of Graph removal lemma.

Proof of Szemerédi's theorem

Let

rk(N)

be the size of the largest subset of

\{1,\ldots,N\}

that does not contain a length

k

arithmetic progression. Szemerédi's theorem states that,

rk(N)=o(N)

for any constant

k

. The high level idea of the proof is that, we construct a hypergraph from a subset without any length

k

arithmetic progression, then use graph removal lemma to show that this graph cannot have too many hyperedges, which in turn shows that the original subset cannot be too big.

Let

A\subset\{1,\ldots,N\}

be a subset that does not contain any length

k

arithmetic progression. Let

M=k2N+1

be a large enough integer. We can think of

A

as a subset of

Z/MZ

. Clearly, if

A

doesn't have length

k

arithmetic progression in

Z

, it also doesn't have length

k

arithmetic progression in

Z/MZ

.

We will construct a

k

-partite

(k-1)

-uniform hypergraph

G

from

A

with parts

V1,V2,\ldots,Vk

, all of which are

M

element vertex sets indexed by

Z/MZ

. For each

1\lei\lek

, we add a hyperedge among vertices

(vj\inVj)j

} if and only if

\sumj(j-i)vj\inA.

Let

H

be the complete

k

-partite

(k-1)

-uniform hypergraph. If

G

contains an isomorphic copy of

H

with vertices

v1,\ldots,vk

, then

\alphai=\sumj(j-i)vj\inA

for any

1\lei\lej

. However, note that

\alphai

is a length

k

arithmetic progression with common difference

\alphai+1-\alphai=-\sumjvj

. Since

A

has no length

k

arithmetic progression, it must be the case that

\alpha1==\alphak

, so

\sumjvj=0

.

Thus, for each hyperedge

(vj\inVj)j

}, we can find a unique copy of

H

that this edge lies in by finding

vi=-\sumjvj

. The number of copies of

H

in

G

equals
1
k

e(G)=O(Nk-1)=o(Nk)

. Therefore, by the hypergraph removal lemma, we can remove

o(Nk-1)

edges to eliminate all copies of

H

in

G

. Since every hyperedge of

G

is in a unique copy of

H

, to eliminate all copies of

H

in

G

, we need to remove at least

e(G)/k

edges. Thus,

e(G)=o(Nk-1)

.

The number of hyperedges in

G

is

kMk-2|A|=o(Nk-1)

, which concludes that

|A|=o(N)

.

This method usually does not give a good quantitative bound, since the hidden constants in hypergraph removal lemma involves the inverse Ackermann function. For a better quantitive bound, Leng, Sah, and Sawhney proved that

|A|\le

N
\exp(-(loglog
ck
N)
)
for some constant

ck

depending on

k

.[15] It is the best bound for

k\ge5

so far.

Applications

S

of

Zr

, any

\delta>0

and any

n

large enough, any subset of

[n]r

of size at least

\deltanr

contains a subset of the form

aS+d

, that is, a dilated and translated copy of

S

. Corners theorem is a special case when

S=\{(0,0),(0,1),(1,0)\}

.

See also

Notes and References

  1. Rodl. V.. Nagle. B.. Skokan. J.. Schacht. M.. Kohayakawa. Y.. 2005-05-26. From The Cover: The hypergraph regularity method and its applications. Proceedings of the National Academy of Sciences. 102. 23. 8109–8113. 10.1073/pnas.0502771102. 15919821. 1149431. 0027-8424. 2005PNAS..102.8109R. free.
  2. Gowers. William. 2007-11-01. Hypergraph regularity and the multidimensional Szemerédi theorem. Annals of Mathematics. 166. 3. 897–946. 10.4007/annals.2007.166.897. 0003-486X. 2007arXiv0710.3032G. 0710.3032.
  3. Haviland. Julie. Thomason. Andrew. May 1989. Pseudo-random hypergraphs. Discrete Mathematics. 75. 1–3. 255–278. 10.1016/0012-365x(89)90093-9. 0012-365X. free.
  4. Chung. F. R. K.. Graham. R. L.. 1989-11-01. Quasi-random hypergraphs. Proceedings of the National Academy of Sciences. 86. 21. 8175–8177. 1989PNAS...86.8175C. 10.1073/pnas.86.21.8175. 0027-8424. 298241. 16594074. free.
  5. Chung. Fan R. K.. 1990. Quasi-random classes of hypergraphs. Random Structures and Algorithms. 1. 4. 363–382. 10.1002/rsa.3240010401. 1042-9832.
  6. Chung. F. R. K.. Graham. R. L.. 1990. Quasi-random hypergraphs. Random Structures and Algorithms. 1. 1. 105–124. 10.1002/rsa.3240010108. 1042-9832. 298241. 16594074.
  7. Chung. F. R. K.. Graham. R. L.. January 1991. Quasi-Random Set Systems. Journal of the American Mathematical Society. 4. 1. 151. 10.2307/2939258. 0894-0347. 2939258. free.
  8. Kohayakawa. Yoshiharu. Rödl. Vojtěch. Skokan. Jozef. February 2002. Hypergraphs, Quasi-randomness, and Conditions for Regularity. . Series A. 97. 2. 307–352. 10.1006/jcta.2001.3217. 0097-3165. free.
  9. Frieze. Alan. Kannan. Ravi. 1999-02-01. Quick Approximation to Matrices and Applications. Combinatorica. 19. 2. 175–220. 10.1007/s004930050052. 0209-9683.
  10. Czygrinow. Andrzej. Rödl. Vojtech. January 2000. An Algorithmic Regularity Lemma for Hypergraphs. SIAM Journal on Computing. 30. 4. 1041–1066. 10.1137/s0097539799351729. 0097-5397.
  11. Chung. Fan R.K.. 2007-07-05. Regularity lemmas for hypergraphs and quasi-randomness. Random Structures & Algorithms. 2. 2. 241–252. 10.1002/rsa.3240020208. 1042-9832.
  12. Frankl. P.. Rödl. V.. December 1992. The Uniformity Lemma for hypergraphs. Graphs and Combinatorics. 8. 4. 309–312. 10.1007/bf02351586. 0911-0119.
  13. Nagle. Brendan. Rödl. Vojtěch. 2003-07-17. Regularity properties for triple systems. Random Structures & Algorithms. 23. 3. 264–332. 10.1002/rsa.10094. 1042-9832.
  14. Frankl. Peter. Rödl. Vojtěch. 2002-02-07. Extremal problems on set systems. Random Structures & Algorithms. 20. 2. 131–164. 10.1002/rsa.10017. 1042-9832.
  15. 2402.17995 . James . Leng . Ashwin . Sah . Improved Bounds for Szemerédi's Theorem . 2024 . Sawhney . Mehtaab.
  16. SOLYMOSI. J.. March 2004. A Note on a Question of Erdős and Graham. Combinatorics, Probability and Computing. 13. 2. 263–267. 10.1017/s0963548303005959. 0963-5483.
  17. Bergelson. Vitaly. Leibman. Alexander. Ziegler. Tamar. February 2011. The shifted primes and the multidimensional Szemerédi and polynomial Van der Waerden theorems. Comptes Rendus Mathématique. 349. 3–4. 123–125. 10.1016/j.crma.2010.11.028. 1631-073X. 1007.1839.
  18. Furstenberg. H.. Katznelson. Y.. December 1991. A density version of the Hales-Jewett theorem. Journal d'Analyse Mathématique. 57. 1. 64–119. 10.1007/bf03041066. free. 0021-7670.