Set packing explained
Set packing is a classical NP-complete problem in computational complexity theory and combinatorics, and was one of Karp's 21 NP-complete problems. Suppose one has a finite set S and a list of subsets of S. Then, the set packing problem asks if some k subsets in the list are pairwise disjoint (in other words, no two of them share an element).
More formally, given a universe
and a family
of subsets of
, a
packing is a subfamily
of sets such that all sets in
are pairwise disjoint. The size of the packing is
. In the set packing
decision problem, the input is a pair
and an integer
; the question is whetherthere is a set packing of size
or more. In the set packing
optimization problem, the input is a pair
, and the task is to find a set packing that uses the most sets.
The problem is clearly in NP since, given
subsets, we can easily verify that they are pairwise disjoint in
polynomial time.
The optimization version of the problem, maximum set packing, asks for the maximum number of pairwise disjoint sets in the list. It is a maximization problem that can be formulated naturally as an integer linear program, belonging to the class of packing problems.
Integer linear program formulation
The maximum set packing problem can be formulated as the following integer linear program.
maximize |
| | (maximize the total number of subsets) |
subject to |
| for all
| (selected sets have to be pairwise disjoint) |
|
| for all
. | (every set is either in the set packing or not) | |
Complexity
The set packing problem is not only NP-complete, but its optimization version (general maximum set packing problem) has been proven as difficult to approximate as the maximum clique problem; in particular, it cannot be approximated within any constant factor.[1] The best known algorithm approximates it within a factor of
.
[2] The weighted variant can also be approximated as well.
[3] Packing sets with a bounded size
The problem does have a variant which is more tractable. Given any positive integer k≥3, the k-set packing problem is a variant of set packing in which each set contains at most k elements.
When k=1, the problem is trivial. When k=2, the problem is equivalent to finding a maximum cardinality matching, which can be solved in polynomial time.
For any k≥3, the problem is NP-hard, as it is more general than 3-dimensional matching. However, there are constant-factor approximation algorithms:
- Cygan[4] presented an algorithm that, for any ε>0, attains a (k+1+ε)/3 approximation. The run-time is polynomial in the number of sets and elements, but doubly-exponential in 1/ε.
- Furer and Yu[5] presented an algorithm that attains the same approximation, but with run-time singly-exponential in 1/ε.
Packing sets with a bounded degree
In another more tractable variant, if no element occurs in more than d of the subsets, the answer can be approximated within a factor of d. This is also true for the weighted version.
Related problems
Equivalent problems
Hypergraph matching is equivalent to set packing: the sets correspond to the hyperedges.
The independent set problem is also equivalent to set packing – there is a one-to-one polynomial-time reduction between them:
- Given a set packing problem on a collection
, build a graph where for each set
there is a vertex
, and there is an edge between
and
iff
. Every independent set of vertices in the generated graph corresponds to a set packing in
.
- Given an independent vertex set problem on a graph
, build a collection of sets where for each vertex
there is a set
containing all edges adjacent to
. Every set packing in the generated collection corresponds to an independent vertex set in
.
This is also a bidirectional PTAS reduction, and it shows that the two problems are equally difficult to approximate.
In the special case when each set contains at most k elements (the k-set packing problem), the intersection graph is (k+1)-claw-free. This is because, if a set intersects some k+1 sets, then at least two of these sets intersect, so there cannot be a (k+1)-claw. So Maximum Independent Set in claw-free graphs[6] can be seen as a generalization of Maximum k-Set Packing.
Special cases
Graph matching is a special case of set packing in which the size of all sets is 2 (the sets correspond to the edges). In this special case, a maximum-size matching can be found in polynomial time.
3-dimensional matching is a special case in which the size of all sets is 3, and in addition, the elements are partitioned into 3 colors and each set contains exactly one element of each color. This special case is still NP-hard, though it has better constant-factor approximation algorithms than the general case.
Other related problems
In the set cover problem, we are given a family
of subsets of a universe
, and the goal is to determine whether we can choose
t sets that together contain every element of
. These sets may overlap. The optimization version finds the minimum number of such sets. The maximum set packing need not cover every possible element.
In the exact cover problem, every element of
should be contained in
exactly one of the subsets. Finding such an exact cover is an
NP-complete problem, even in the special case in which the size of all sets is 3 (this special case is called
exact 3 cover or
X3C). However, if we create a
singleton set for each element of
S and add these to the list, the resulting problem is about as easy as set packing.
Karp originally showed set packing NP-complete via a reduction from the clique problem.
See also: Packing in a hypergraph.
References
- "set packing". Dictionary of Algorithms and Data Structures, editor Paul E. Black, National Institute of Standards and Technology. Note that the definition here is somewhat different.
- Steven S. Skiena. "Set Packing". The Algorithm Design Manual.
- Pierluigi Crescenzi, Viggo Kann, Magnús Halldórsson, Marek Karpinski and Gerhard Woeginger. "Maximum Set Packing". A compendium of NP optimization problems. Last modified March 20, 2000.
- Book: . 1979 . Computers and Intractability: A Guide to the Theory of NP-Completeness . W.H. Freeman . 978-0-7167-1045-5. Computers and Intractability: A Guide to the Theory of NP-Completeness . A3.1: SP3, pg.221.
- Book: Vazirani, Vijay V. . Vijay Vazirani . Approximation Algorithms . 2001 . Springer-Verlag . 978-3-540-65367-7 .
External links
Notes and References
- . See in particular p. 21: "Maximum clique (and therefore also maximum independent set and maximum set packing) cannot be approximated to within
unless NP ⊂ ZPP."
- Halldórsson . Magnus M.. Kratochvíl . Jan. Telle . Jan Arne. Independent sets with domination constraints. 25th International Colloquium on Automata, Languages and Programming. Lecture Notes in Computer Science. 1443. Springer-Verlag. 176–185. 1998.
- Halldórsson . Magnus M.. 1999. Approximations of weighted independent set and hereditary subset problems. 5th Annual International Conference on Computing and Combinatorics. Lecture Notes in Computer Science. 1627. Springer-Verlag. 261–270.
- Book: Cygan, Marek . 2013 IEEE 54th Annual Symposium on Foundations of Computer Science . Improved Approximation for 3-Dimensional Matching via Bounded Pathwidth Local Search . October 2013 . https://ieeexplore.ieee.org/document/6686187 . 509–518 . 10.1109/FOCS.2013.61. 1304.1424 . 978-0-7695-5135-7 . 14160646 .
- Fürer . Martin . Yu . Huiwen . Combinatorial Optimization . Approximating the -set packing problem by local improvements . 2014 . Fouilhoux . Pierre . Gouveia . Luis Eduardo Neves . Mahjoub . A. Ridha . Paschos . Vangelis T. . https://link.springer.com/chapter/10.1007/978-3-319-09174-7_35 . Lecture Notes in Computer Science . 8596 . en . Cham . Springer International Publishing . 408–420 . 10.1007/978-3-319-09174-7_35 . 978-3-319-09174-7. 15815885 .
- Neuwohner . Meike . Bläser . Markus . Monmege . Benjamin . 2106.03545 . An improved approximation algorithm for the maximum weight independent set problem in -claw free graphs . 10.4230/LIPICS.STACS.2021.53 . 53:1–53:20 . Schloss Dagstuhl – Leibniz-Zentrum für Informatik . LIPIcs . 38th International Symposium on Theoretical Aspects of Computer Science, STACS 2021, March 16–19, 2021, Saarbrücken, Germany (Virtual Conference) . 187 . 2021. free .