The study of facility location problems (FLP), also known as location analysis, is a branch of operations research and computational geometry concerned with the optimal placement of facilities to minimize transportation costs while considering factors like avoiding placing hazardous materials near housing, and competitors' facilities. The techniques also apply to cluster analysis.
A simple facility location problem is the Weber problem, in which a single facility is to be placed, with the only optimization criterion being the minimization of the weighted sum of distances from a given set of point sites. More complex problems considered in this discipline include the placement of multiple facilities, constraints on the locations of facilities, and more complex optimization criteria.
In a basic formulation, the facility location problem consists of a set of potential facility sites L where a facility can be opened, and a set of demand points D that must be serviced. The goal is to pick a subset F of facilities to open, to minimize the sum of distances from each demand point to its nearest facility, plus the sum of opening costs of the facilities.
The facility location problem on general graphs is NP-hard to solve optimally, by reduction from (for example) the set cover problem. A number of approximation algorithms have been developed for the facility location problem and many of its variants.
Without assumptions on the set of distances between clients and sites (in particular, without assuming that the distances satisfy the triangle inequality), the problem is known as non-metric facility location and can be approximated to within a factor O(log n).[1] This factor is tight, via an approximation-preserving reduction from the set cover problem.
If we assume distances between clients and sites are undirected and satisfy the triangle inequality, we are talking about a metric facility location (MFL) problem. The MFL is still NP-hard and hard to approximate within factor better than 1.463.[2] The currently best known approximation algorithm achieves approximation ratio of 1.488.[3]
The minimax facility location problem seeks a location which minimizes the maximum distance to the sites, where the distance from one point to the sites is the distance from the point to its nearest site. A formal definition is as follows:
Given a point set,find a point set so that is minimized.
In the case of the Euclidean metric for, it is known as the smallest enclosing sphere problem or 1-center problem. Its study traced at least to the year of 1860.
It has been proven that exact solution of k-center problem is NP hard.[4] [5] [6] Approximation to the problem was found to be also NP hard when the error is small. The error level in the approximation algorithm is measured as an approximation factor, which is defined as the ratio between the approximation and the optimum. It's proved that the k-center problem approximation is NP hard when approximation factor is less than 1.822 (dimension = 2) or 2 (dimension > 2).[6]
Exact solver
There exist algorithms to produce exact solutions to this problem. One exact solver runs in time
nO(\sqrt{k)}
1 + ε approximation
1 + ε approximation is to find a solution with approximation factor no greater than 1 + ε. This approximation is NP hard as ε is arbitrary. One approach based on the coreset concept is proposed with execution complexity of
O(klogk/\varepsilon2) | |
O(2 |
dn)
O(kn)
Farthest-point clustering
For the hardness of the problem, it's impractical to get an exact solution or precise approximation. Instead, an approximation with factor = 2 is widely used for large k cases. The approximation is referred to as the farthest-point clustering (FPC) algorithm, or farthest-first traversal.[6] The algorithm is quite simple: pick any point from the set as one center; search for the farthest point from remaining set as another center; repeat the process until k centers are found.
It is easy to see that this algorithm runs in linear time. As approximation with factor less than 2 is proved to be NP hard, FPC was regarded as the best approximation one can find.
As per the performance of execution, the time complexity is later improved to O(n log k) with box decomposition technique.
The maxmin facility location or obnoxious facility location problem seeks a location which maximizes the minimum distance to the sites. In the case of the Euclidean metric, it is known as the largest empty sphere problem. The planar case (largest empty circle problem) may be solved in optimal time Θ(n log n).[7] [8]
Facility location problems are often solved as integer programs. In this context, facility location problems are often posed as follows: suppose there are
n
m
n
m
fi
i
i=1,...,n
cij
i
j
i=1,...,n
j=1,...,m
dj
j
j=1,...,m
ui
i
ui
i
In our initial formulation, introduce a binary variable
xi
i=1,...,n
xi=1
i
xi=0
yij
i=1,...,n
j=1,...,m
dj
i
Note that the second set of constraints ensures that if
xi=0
i
yij=0
j
i
A common case of the capacitated facility location problem above is the case when
ui=+infty
i=1,...,n
j
yij
zij
zij=1
j
i
zij=0
where
M
M
M=m
xi=1
zij
Another formulation possibility for the uncapacitated facility location problem is to disaggregate the capacity constraints (the big-
M
M
In healthcare, incorrect facility location decisions have a serious impact on the community beyond simple cost and service metrics; for instance, hard-to-access healthcare facilities are likely to be associated with increased morbidity and mortality. From this perspective, facility location modeling for healthcare is more critical than similar modeling for other areas.[11]
Municipal solid waste management still remains a challenge for developing countries because of increasing waste production and high costs associated with waste management. Through the formulation and exact resolution of a facility location problem it is possible to optimize the location of landfills for waste disposal.[12]
A particular subset of cluster analysis problems can be viewed as facility location problems. In a centroid-based clustering problem, the objective is to partition
n
To see how one might view (read "transform" or "reduce") a centroid-based clustering problem as a (metric) facility location problem, view each data point in the former as a demand point in the latter. Suppose that the data to be clustered are elements of a metric space
M
M
p
p
M
L
c\ell,
k
L'\subseteqL
k
k
d
\ell*
d
\ell*:=argmin\ell\{c\ell,\}
c\ell,
The popular algorithms textbook Algorithm Design [14] provides a related problem-description and an approximation algorithm. The authors refer to the metric facility location problem (i.e. the centroid-based clustering problem or the metric
k
Furthermore, see that in our above definition of the facility location problem that the objective function
f
f