Collision detection explained

Collision detection is the computational problem of detecting an intersection of two or more objects in virtual space. More precisely, it deals with the questions of if, when and where two or more objects intersect. Collision detection is a classic problem of computational geometry with applications in computer graphics, physical simulation, video games, robotics (including autonomous driving) and computational physics. Collision detection algorithms can be divided into operating on 2D or 3D spatial objects.[1]

Overview

Collision detection is closely linked to calculating the distance between objects, as two objects (or more) intersect when the distance between them reaches zero or even becomes negative.[2] Negative distance indicates that one object has penetrated another. Performing collision detection requires more context than just the distance between the objects.

Accurately identifying the points of contact on both objects' surfaces is also essential for the computation of a physically accurate collision response. The complexity of this task increases with the level of detail in the objects' representations: the more intricate the model, the greater the computational cost.[3]

Collision detection frequently involves dynamic objects, adding a temporal dimension to distance calculations. Instead of simply measuring distance between static objects, collision detection algorithms often aim to determine whether the objects’ motion will bring them to a point in time when their distance is zero—an operation that adds significant computational overhead.[4]

In collision detection involving multiple objects, a naive approach would require detecting collisions for all pairwise combinations of objects. As the number of objects increases, the number of required comparisons grows rapidly: for

n

objects, / intersection tests are needed with a naive approach. This quadratic growth makes such an approach computationally expensive as

n

increases.[5]

Due to the complexity mentioned above, collision detection is computationally intensive process. Nevertheless, it is essential for interactive applications like video games, robotics, and real-time physics engines. To manage these computational demands, extensive efforts have gone into optimizing collision detection algorithms.

A commonly used approach towards accelerating the required computations is to divide the process into two phases: the broad phase and the narrow phase.[6] The broad phase aims to answer the question of whether objects might collide, using a conservative but efficient approach to rule out pairs that clearly do not intersect, thus avoiding unnecessary calculations.

Objects that cannot be definitively separated in the broad phase are passed to the narrow phase. Here, more precise algorithms determine whether these objects actually intersect. If they do, the narrow phase often calculates the exact time and location of the intersection.

Broad phase

This phase aims at quickly finding objects or parts of objects for which it can be quickly determined that no further collision test is needed. A useful property of such approach is that it is output sensitive. In the context of collision detection this means that the time complexity of the collision detection is proportional to the number of objects that are close to each other. An early example of that is the I-COLLIDE where the number of required narrow phase collision tests was

O(n+m)

where

n

is the number of objects and

m

is the number of objects at close proximity. This is a significant improvement over the quadratic complexity of the naive approach.

Spatial partitioning

Several approaches can grouped under the spatial partitioning umbrella, which includes octrees (for 3D), quadtrees (for 2D) binary space partitioning (or BSP trees) and other, similar approaches. If one splits space into a number of simple cells, and if two objects can be shown not to be in the same cell, then they need not be checked for intersection. Dynamic scenes and deformable objects require updating the partitioning which can add overhead.

Bounding volume hierarchy

Bounding Volume Hierarchy (BVH) a tree structure over a set of bounding volumes. Collision is determined by doing a tree traversal starting from the root. If the bounding volume of the root doesn't intersect with the object of interest, the traversal can be stopped. If, however there is an intersection, the traversal proceeds and checks the branches for each there is an intersection. Branches for which there is no intersection with the bounding volume can be culled from further intersection test. Therefore multiple objects can be determined to not intersect at once. BVH can be used with deformable objects such as cloth or soft-bodies but the volume hierarchy has to be adjusted as the shape deforms. For deformable objects we need to be concerned about self-collisions or self intersections. BVH can be used for that end as well. Collision between two objects is computed by computing intersection between the bounding volumes of the root of the tree as there are collision we dive into the sub-trees that intersect. Exact collisions between the actual objects, or its parts (often triangles of a triangle mesh) need to be computed only between intersecting leaves.[7] The same approach works for pair wise collision and self-collisions.

Exploiting temporal coherence

During the broad-phase, when the objects in the world move or deform, the data-structures used to cull collisions have to be updated. In cases where the changes between two frames or time-steps are small and the objects can approximated well with an axis-aligned bounding boxes the sweep and prune algorithm can be a suitable approach.

Several key observation make the implementation efficient: Two bounding-boxes intersect if, and only if, there is overlap along all three axes; overlap can be determined, for each axis separately, by sorting the intervals for all the boxes; and lastly, between two frames updates are typically small (making sorting algorithms optimized for almost-sorted lists suitable for this application). The algorithm keeps track of currently intersecting boxes, and as objects moves, re-sorting the intervals helps keep track of the status.[8]

Pairwise pruning

Once we've selected a pair of physical bodies for further investigation, we need to check for collisions more carefully. However, in many applications, individual objects (if they are not too deformable) are described by a set of smaller primitives, mainly triangles. So now, we have two sets of triangles,

S={S1,S2,...,Sn}

and

T={T1,T2,...,Tn}

(for simplicity, we will assume that each set has the same number of triangles.)

The obvious thing to do is to check all triangles

Sj

against all triangles

Tk

for collisions, but this involves

n2

comparisons, which is highly inefficient. If possible, it is desirable to use a pruning algorithm to reduce the number of pairs of triangles we need to check.

The most widely used family of algorithms is known as the hierarchical bounding volumes method. As a preprocessing step, for each object (in our example,

S

and

T

) we will calculate a hierarchy of bounding volumes. Then, at each time step, when we need to check for collisions between

S

and

T

, the hierarchical bounding volumes are used to reduce the number of pairs of triangles under consideration. For simplicity, we will give an example using bounding spheres, although it has been noted that spheres are undesirable in many cases.

If

E

is a set of triangles, we can pre-calculate a bounding sphere

B(E)

. There are many ways of choosing

B(E)

, we only assume that

B(E)

is a sphere that completely contains

E

and is as small as possible.

Ahead of time, we can compute

B(S)

and

B(T)

. Clearly, if these two spheres do not intersect (and that is very easy to test), then neither do

S

and

T

. This is not much better than an n-body pruning algorithm, however.

If

E={E1,E2,...,Em}

is a set of triangles, then we can split it into two halves

L(E):={E1,E2,...,Em/2

} and

R(E):={Em/2+1,...,Em-1,Em}

. We can do this to

S

and

T

, and we can calculate (ahead of time) the bounding spheres

B(L(S)),B(R(S))

and

B(L(T)),B(R(T))

. The hope here is that these bounding spheres are much smaller than

B(S)

and

B(T)

. And, if, for instance,

B(S)

and

B(L(T))

do not intersect, then there is no sense in checking any triangle in

S

against any triangle in

L(T)

.

As a precomputation, we can take each physical body (represented by a set of triangles) and recursively decompose it into a binary tree, where each node

N

represents a set of triangles, and its two children represent

L(N)

and

R(N)

. At each node in the tree, we can pre-compute the bounding sphere

B(N)

.

When the time comes for testing a pair of objects for collision, their bounding sphere tree can be used to eliminate many pairs of triangles.

Many variants of the algorithms are obtained by choosing something other than a sphere for

B(T)

. If one chooses axis-aligned bounding boxes, one gets AABBTrees. Oriented bounding box trees are called OBBTrees. Some trees are easier to update if the underlying object changes. Some trees can accommodate higher order primitives such as splines instead of simple triangles.

Narrow phase

Objects that cannot be definitively separated in the broad phase are passed to the narrow phase. In this phase, the objects under consideration are relatively close to each other. Still, attempts to quickly determine if a full intersection is needed are employed first. This step is sometimes referred to as mid-phase. Once these tests passed (e.g the pair of objects may be colliding) more precise algorithms determine whether these objects actually intersect. If they do, the narrow phase often calculates the exact time and location of the intersection.

Bounding volumes

A quick way to potentially avoid a needless expensive computation is to check if the bounding volume enclosing the two objects intersect. If they don't, there is not need to check the actual objects. However, if the bounding volume intersect, the more expensive computation has to be performed. In order for the bounding-volume test to add value, two properties need to be balanced: a) the cost of intersecting the bounding volume needs to be low and b) the bounding volume needs to be tight enough so that the number of 'false positive' intersection will be low. A false positive intersection in this case means that the bounding volume intersect but the actual objects do not. Different bounding volume types offer different trade-offs for these properties.

Axis-Align Bounding Boxes (AABB) and cuboids are popular due to their simplicity and quick intersection tests. [9] Bounding volumes such as Oriented Bounding Boxes (OBB), K-DOPs and Convex-hulls offer a tighter approximation of the enclosed shape at the expense of a more elaborate intersection test.

Bounding volumes are typically used in the early (pruning) stage of collision detection, so that only objects with overlapping bounding volumes need be compared in detail.[10] Computing collision or overlap between bounding volumes involves additional computations, therefore, in order for it to beneficial we need the bounding volume to be relatively tight and the computation overhead to due the collisions to be low.

Exact pairwise collision detection

Objects for which pruning approaches could not rule out the possibility of a collision have to undergo an exact collision detection computation.

Collision detection between convex objects

According to the separating planes theorem, for any two disjoint convex objects, there exists a plane so that one object lies completely on one side of that plane, and the other object lies on the opposite side of that plane. This property allows the development of efficient collision detection algorithms between convex objects. Several algorithms are available for finding the closest points on the surface of two convex polyhedral objects - and determining collision. Early work by Ming C. Lin[11] that used a variation on the simplex algorithm from linear programming and the Gilbert-Johnson-Keerthi distance algorithm[12] are two such examples. These algorithms approach constant time when applied repeatedly to pairs of stationary or slow-moving objects, and every step is initialized from the previous collision check.

The result of all this algorithmic work is that collision detection can be done efficiently for thousands of moving objects in real time on typical personal computers and game consoles.

A priori pruning

Where most of the objects involved are fixed, as is typical of video games, a priori methods using precomputation can be used to speed up execution.

Pruning is also desirable here, both n-body pruning and pairwise pruning, but the algorithms must take time and the types of motions used in the underlying physical system into consideration.

When it comes to the exact pairwise collision detection, this is highly trajectory dependent, and one almost has to use a numerical root-finding algorithm to compute the instant of impact.

As an example, consider two triangles moving in time

{v1(t),v2(t),v3(t)}

and

{v4(t),v5(t),v6(t)}

. At any point in time, the two triangles can be checked for intersection using the twenty planes previously mentioned. However, we can do better, since these twenty planes can all be tracked in time. If

P(u,v,w)

is the plane going through points

u,v,w

in

R3

then there are twenty planes

P(vi(t),vj(t),vk(t))

to track. Each plane needs to be tracked against three vertices, this gives sixty values to track. Using a root finder on these sixty functions produces the exact collision times for the two given triangles and the two given trajectory. We note here that if the trajectories of the vertices are assumed to be linear polynomials in

t

then the final sixty functions are in fact cubic polynomials, and in this exceptional case, it is possible to locate the exact collision time using the formula for the roots of the cubic. Some numerical analysts suggest that using the formula for the roots of the cubic is not as numerically stable as using a root finder for polynomials.

Triangle centroid segments

A triangle mesh object is commonly used in 3D body modeling. Normally the collision function is a triangle to triangle intercept or a bounding shape associated with the mesh. A triangle centroid is a center of mass location such that it would balance on a pencil tip. The simulation need only add a centroid dimension to the physics parameters. Given centroid points in both object and target it is possible to define the line segment connecting these two points.

The position vector of the centroid of a triangle is the average of the position vectors of its vertices. So if its vertices have Cartesian coordinates

(x1,y1,z1)

,

(x2,y2,z2)

and

(x3,y3,z3)

then the centroid is
\left((x1+x2+x3),
3
(y1+y2+y3),
3
(z1+z2+z3)
3

\right)

.

Here is the function for a line segment distance between two 3D points.

distance=\sqrt{(z2-

2
z
1)

+(x2-

2
x
1)

+(y2-

2}
y
1)

Here the length/distance of the segment is an adjustable "hit" criteria size of segment. As the objects approach the length decreases to the threshold value. A triangle sphere becomes the effective geometry test. A sphere centered at the centroid can be sized to encompass all the triangle's vertices.

Usage

Collision detection in computer simulation

Physical simulators differ in the way they react on a collision. Some use the softness of the material to calculate a force, which will resolve the collision in the following time steps like it is in reality. This is very CPU intensive for low softness materials. Some simulators estimate the time of collision by linear interpolation, roll back the simulation, and calculate the collision by the more abstract methods of conservation laws.

Some iterate the linear interpolation (Newton's method) to calculate the time of collision with a much higher precision than the rest of the simulation. Collision detection utilizes time coherence to allow even finer time steps without much increasing CPU demand, such as in air traffic control.

After an inelastic collision, special states of sliding and resting can occur and, for example, the Open Dynamics Engine uses constraints to simulate them. Constraints avoid inertia and thus instability. Implementation of rest by means of a scene graph avoids drift.

In other words, physical simulators usually function one of two ways: where the collision is detected a posteriori (after the collision occurs) or a priori (before the collision occurs). In addition to the a posteriori and a priori distinction, almost all modern collision detection algorithms are broken into a hierarchy of algorithms. Often the terms "discrete" and "continuous" are used rather than a posteriori and a priori.

A posteriori (discrete) versus a priori (continuous)

In the a posteriori case, the physical simulation is advanced by a small step, then checked to see if any objects are intersecting or visibly considered intersecting. At each simulation step, a list of all intersecting bodies is created, and the positions and trajectories of these objects are "fixed" to account for the collision. This method is called a posteriori because it typically misses the actual instant of collision, and only catches the collision after it has actually happened.

In the a priori methods, there is a collision detection algorithm which will be able to predict very precisely the trajectories of the physical bodies. The instants of collision are calculated with high precision, and the physical bodies never actually interpenetrate. This is called a priori because the collision detection algorithm calculates the instants of collision before it updates the configuration of the physical bodies.

The main benefits of the a posteriori methods are as follows. In this case, the collision detection algorithm need not be aware of the myriad of physical variables; a simple list of physical bodies is fed to the algorithm, and the program returns a list of intersecting bodies. The collision detection algorithm doesn't need to understand friction, elastic collisions, or worse, nonelastic collisions and deformable bodies. In addition, the a posteriori algorithms are in effect one dimension simpler than the a priori algorithms. An a priori algorithm must deal with the time variable, which is absent from the a posteriori problem.

On the other hand, a posteriori algorithms cause problems in the "fixing" step, where intersections (which aren't physically correct) need to be corrected. Moreover, if the discrete step is too large, the collision could go undetected, resulting in an object which passes through another if it is sufficiently fast or small.

The benefits of the a priori algorithms are increased fidelity and stability. It is difficult (but not completely impossible) to separate the physical simulation from the collision detection algorithm. However, in all but the simplest cases, the problem of determining ahead of time when two bodies will collide (given some initial data) has no closed form solution—a numerical root finder is usually involved.

Some objects are in resting contact, that is, in collision, but neither bouncing off, nor interpenetrating, such as a vase resting on a table. In all cases, resting contact requires special treatment: If two objects collide (a posteriori) or slide (a priori) and their relative motion is below a threshold, friction becomes stiction and both objects are arranged in the same branch of the scene graph.

Video games

Video games have to split their very limited computing time between several tasks. Despite this resource limit, and the use of relatively primitive collision detection algorithms, programmers have been able to create believable, if inexact, systems for use in games.

For a long time, video games had a very limited number of objects to treat, and so checking all pairs was not a problem. In two-dimensional games, in some cases, the hardware was able to efficiently detect and report overlapping pixels between sprites on the screen.[13] In other cases, simply tiling the screen and binding each sprite into the tiles it overlaps provides sufficient pruning, and for pairwise checks, bounding rectangles or circles called hitboxes are used and deemed sufficiently accurate.

Three-dimensional games have used spatial partitioning methods for

n

-body pruning, and for a long time used one or a few spheres per actual 3D object for pairwise checks. Exact checks are very rare, except in games attempting to simulate reality closely. Even then, exact checks are not necessarily used in all cases.

Because games do not need to mimic actual physics, stability is not as much of an issue. Almost all games use a posteriori collision detection, and collisions are often resolved using very simple rules. For instance, if a character becomes embedded in a wall, they might be simply moved back to their last known good location. Some games will calculate the distance the character can move before getting embedded into a wall, and only allow them to move that far.

In many cases for video games, approximating the characters by a point is sufficient for the purpose of collision detection with the environment. In this case, binary space partitioning trees provide a viable, efficient and simple algorithm for checking if a point is embedded in the scenery or not. Such a data structure can also be used to handle "resting position" situation gracefully when a character is running along the ground. Collisions between characters, and collisions with projectiles and hazards, are treated separately.

A robust simulator is one that will react to any input in a reasonable way. For instance, if we imagine a high speed racecar video game, from one simulation step to the next, it is conceivable that the cars would advance a substantial distance along the race track. If there is a shallow obstacle on the track (such as a brick wall), it is not entirely unlikely that the car will completely leap over it, and this is very undesirable. In other instances, the "fixing" that posteriori algorithms require isn't implemented correctly, resulting in bugs that can trap characters in walls or allow them to pass through them and fall into an endless void where there may or may not be a deadly bottomless pit, sometimes referred to as "black hell", "blue hell", or "green hell", depending on the predominant color. These are the hallmarks of a failing collision detection and physical simulation system. is an infamous example of a game with a failing or possibly missing collision detection system.

Hitbox

A hitbox is an invisible shape commonly used in video games for real-time collision detection; it is a type of bounding box. It is often a rectangle (in 2D games) or cuboid (in 3D) that is attached to and follows a point on a visible object (such as a model or a sprite). Circular or spheroidial shapes are also common, though they are still most often called "boxes". It is common for animated objects to have hitboxes attached to each moving part to ensure accuracy during motion.[14]

Hitboxes are used to detect "one-way" collisions such as a character being hit by a punch or a bullet. They are unsuitable for the detection of collisions with feedback (e.g. bumping into a wall) due to the difficulty experienced by both humans and AI in managing a hitbox's ever-changing locations; these sorts of collisions are typically handled with much simpler axis-aligned bounding boxes instead. Players may use the term "hitbox" to refer to these types of interactions regardless.

A hurtbox is a hitbox used to detect incoming sources of damage. In this context, the term hitbox is typically reserved for those which deal damage. For example, an attack may only land if the hitbox around an attacker's punch connects with one of the opponent's hurtboxes on their body, while opposing hitboxes colliding may result in the players trading or cancelling blows, and opposing hurtboxes do not interact with each other. The term is not standardized across the industry; some games reverse their definitions of hitbox and hurtbox, while others only use "hitbox" for both sides.

See also

External links

Notes and References

  1. Collision Detection for Deformable Objects. 2005. 10.1111/j.1467-8659.2005.00829.x. Teschner. M.. Kimmerle. S.. Heidelberger. B.. Zachmann. G.. Raghupathi. L.. Fuhrmann. A.. Cani. M.-P.. Faure. F.. Magnenat-Thalmann. N.. Strasser. W.. Volino. P.. Computer Graphics Forum. 24. 61–81. 1359430. 10.1.1.58.2505.
  2. Book: Handbook of discrete and computational geometry . 2018 . CRC Press, Taylor & Francis Group, a Chapman & Hall book . 978-1-4987-1139-5 . Goodman . Jacob E. . 3rd . Discrete mathematics and its applications . Boca Raton London New York . 39 . O'Rourke . Joseph . Tóth . Csaba D..
  3. Book: Andrews . Sheldon . Erleben . Kenny . Ferguson . Zachary . Contact and friction simulation for computer graphics . 2022-08-02 . ACM SIGGRAPH 2022 Courses . https://dl.acm.org/doi/10.1145/3532720.3535640 . en . ACM . 1–172 . 10.1145/3532720.3535640 . 978-1-4503-9362-1.
  4. Book: Hadap . Sunil . Eberle . Dave . Volino . Pascal . Lin . Ming C. . Redon . Stephane . Ericson . Christer . Collision detection and proximity queries . 2004-08-08 . ACM SIGGRAPH 2004 Course Notes . https://dl.acm.org/doi/10.1145/1103900.1103915 . en . ACM . 15 . 10.1145/1103900.1103915 . 978-1-4503-7801-7.
  5. Book: Cohen . Jonathan D. . Lin . Ming C. . Manocha . Dinesh . Ponamgi . Madhav . I-COLLIDE: An interactive and exact collision detection system for large-scale environments . 1995 . Proceedings of the 1995 symposium on Interactive 3D graphics - SI3D '95 . http://portal.acm.org/citation.cfm?doid=199404.199437 . en . ACM Press . 189–ff . 10.1145/199404.199437 . 978-0-89791-736-0.
  6. Book: Akenine-Möller . Tomas . Real-time rendering . Haines . Eric . Hoffman . Naty . Pesce . Angelo . Iwanicki . Michał . Hillaire . Sébastien . 2018 . CRC Press, Taylor & Francis Group . 978-1-138-62700-0 . 4th . An A K Peters book . Boca Raton London New York.
  7. Klosowski . James T . Held . Martin . Mitchell . Joseph S.B. . Sowizral . Henry . Zikan . Karel . 1998 . Efficient collision detection using bounding volume hierarchies of k-DOPs . IEEE Transactions on Visualization and Computer Graphics . IEEE . 4 . 1 . 21–36 . 10.1109/2945.675649.
  8. Book: Ericson, Christer . Real-time collision detection . 22 December 2004. Elsevier . 978-1-55860-732-3 . Nachdr. . Morgan Kaufmann series in interactive 3D technology . Amsterdam Heidelberg . 329–338.
  9. Web site: Caldwell, Douglas R. . 2005-08-29 . Unlocking the Mysteries of the Bounding Box . dead . https://web.archive.org/web/20120728180104/http://www.stonybrook.edu/libmap/coordinates/seriesa/no2/a2.htm . 2012-07-28 . 2014-05-13 . US Army Engineer Research & Development Center, Topographic Engineering Center, Research Division, Information Generation and Management Branch.
  10. Gan B, Dong Q . 2022 . An improved optimal algorithm for collision detection of hybrid hierarchical bounding box . Evolutionary Intelligence . 15 . 4 . 2515–2527 . 10.1007/s12065-020-00559-6.
  11. Web site: Lin, Ming C . 1993 . Efficient Collision Detection for Animation and Robotics (thesis) . https://web.archive.org/web/20140728124049/https://wwwx.cs.unc.edu/~geom/papers/documents/dissertations/lin93.pdf . 2014-07-28 . University of California, Berkeley.
  12. Gilbert . E.G. . Johnson . D.W. . Keerthi . S.S. . 1988 . A fast procedure for computing the distance between complex objects in three-dimensional space . IEEE Journal on Robotics and Automation . 4 . 2 . 193–203 . 10.1109/56.2083.
  13. Web site: Components of the Amiga: The MC68000 and the Amiga Custom Chips. Chapter 1. Reference manual. https://web.archive.org/web/20180717093216/http://amigadev.elowar.com/read/ADCD_2.1/Hardware_Manual_guide/node0004.html#line95. 2018-07-17. live. 2018-07-17. Additionally, you can use system hardware to detect collisions between objects and have your program react to such collisions.. 2.1.
  14. Web site: Hitbox. Valve Developer Community. Valve. 18 September 2011.