First-class constraint should not be confused with Primary constraint.
In physics, a first-class constraint is a dynamical quantity in a constrained Hamiltonian system whose Poisson bracket with all the other constraints vanishes on the constraint surface in phase space (the surface implicitly defined by the simultaneous vanishing of all the constraints). To calculate the first-class constraint, one assumes that there are no second-class constraints, or that they have been calculated previously, and their Dirac brackets generated.[1]
First- and second-class constraints were introduced by as a way of quantizing mechanical systems such as gauge theories where the symplectic form is degenerate.[2]
The terminology of first- and second-class constraints is confusingly similar to that of primary and secondary constraints, reflecting the manner in which these are generated. These divisions are independent: both first- and second-class constraints can be either primary or secondary, so this gives altogether four different classes of constraints.
Consider a Poisson manifold M with a smooth Hamiltonian over it (for field theories, M would be infinite-dimensional).
Suppose we have some constraints
fi(x)=0,
\{fi
n | |
\} | |
i=1 |
These will only be defined chartwise in general. Suppose that everywhere on the constrained set, the n derivatives of the n functions are all linearly independent and also that the Poisson brackets
\{fi,fj\}
\{fi,H\}
This means we can write
\{fi,fj\}=\sumk
k | |
c | |
ij |
fk
k | |
c | |
ij |
\{fi,H\}=\sumj
j | |
v | |
i |
fj
j | |
v | |
i |
This can be done globally, using a partition of unity. Then, we say we have an irreducible first-class constraint (irreducible here is in a different sense from that used in representation theory).
For a more elegant way, suppose given a vector bundle over
lM
n
V
\nablaf
TlM
V
g
(\Deltaf)g
The ordinary Poisson bracket is only defined over
Cinfty(M)
Assume also that under this Poisson bracket,
\{f,f\}=0
\{g,g\}=0
\{f,H\}=0
What does it all mean intuitively? It means the Hamiltonian and constraint flows all commute with each other on the constrained subspace; or alternatively, that if we start on a point on the constrained subspace, then the Hamiltonian and constraint flows all bring the point to another point on the constrained subspace.
Since we wish to restrict ourselves to the constrained subspace only, this suggests that the Hamiltonian, or any other physical observable, should only be defined on that subspace. Equivalently, we can look at the equivalence class of smooth functions over the symplectic manifold, which agree on the constrained subspace (the quotient algebra by the ideal generated by the 's, in other words).
The catch is, the Hamiltonian flows on the constrained subspace depend on the gradient of the Hamiltonian there, not its value. But there's an easy way out of this.
Look at the orbits of the constrained subspace under the action of the symplectic flows generated by the 's. This gives a local foliation of the subspace because it satisfies integrability conditions (Frobenius theorem). It turns out if we start with two different points on a same orbit on the constrained subspace and evolve both of them under two different Hamiltonians, respectively, which agree on the constrained subspace, then the time evolution of both points under their respective Hamiltonian flows will always lie in the same orbit at equal times. It also turns out if we have two smooth functions A1 and B1, which are constant over orbits at least on the constrained subspace (i.e. physical observables) (i.e. ==0 over the constrained subspace)and another two A2 and B2, which are also constant over orbits such that A1 and B1 agrees with A2 and B2 respectively over the restrained subspace, then their Poisson brackets and are also constant over orbits and agree over the constrained subspace.
In general, one cannot rule out "ergodic" flows (which basically means that an orbit is dense in some open set), or "subergodic" flows (which an orbit dense in some submanifold of dimension greater than the orbit's dimension). We can't have self-intersecting orbits.
For most "practical" applications of first-class constraints, we do not see such complications: the quotient space of the restricted subspace by the f-flows (in other words, the orbit space) is well behaved enough to act as a differentiable manifold, which can be turned into a symplectic manifold by projecting the symplectic form of M onto it (this can be shown to be well defined). In light of the observation about physical observables mentioned earlier, we can work with this more "physical" smaller symplectic manifold, but with 2n fewer dimensions.
In general, the quotient space is a bit difficult to work with when doing concrete calculations (not to mention nonlocal when working with diffeomorphism constraints), so what is usually done instead is something similar. Note that the restricted submanifold is a bundle (but not a fiber bundle in general) over the quotient manifold. So, instead of working with the quotient manifold, we can work with a section of the bundle instead. This is called gauge fixing.
The major problem is this bundle might not have a global section in general. This is where the "problem" of global anomalies comes in, for example. A global anomaly is different from the Gribov ambiguity, which is when a gauge fixing doesn't work to fix a gauge uniquely, in a global anomaly, there is no consistent definition of the gauge field. A global anomaly is a barrier to defining a quantum gauge theory discovered by Witten in 1980.
What have been described are irreducible first-class constraints. Another complication is that Δf might not be right invertible on subspaces of the restricted submanifold of codimension 1 or greater (which violates the stronger assumption stated earlier in this article). This happens, for example in the cotetrad formulation of general relativity, at the subspace of configurations where the cotetrad field and the connection form happen to be zero over some open subset of space. Here, the constraints are the diffeomorphism constraints.
One way to get around this is this: For reducible constraints, we relax the condition on the right invertibility of Δf into this one: Any smooth function that vanishes at the zeros of f is the fiberwise contraction of f with (a non-unique) smooth section of a
\bar{V}
\bar{V}
First of all, we will assume the action is the integral of a local Lagrangian that only depends up to the first derivative of the fields. The analysis of more general cases, while possible is more complicated. When going over to the Hamiltonian formalism, we find there are constraints. Recall that in the action formalism, there are on shell and off shell configurations. The constraints that hold off shell are called primary constraints while those that only hold on shell are called secondary constraints.
Consider the dynamics of a single point particle of mass with no internal degrees of freedom moving in a pseudo-Riemannian spacetime manifold with metric g. Assume also that the parameter describing the trajectory of the particle is arbitrary (i.e. we insist upon reparametrization invariance). Then, its symplectic space is the cotangent bundle T*S with the canonical symplectic form .
If we coordinatize T * S by its position in the base manifold and its position within the cotangent space p, then we have a constraint
f = m2 -g(x)-1(p,p) = 0.
The Hamiltonian is, surprisingly enough, = 0. In light of the observation that the Hamiltonian is only defined up to the equivalence class of smooth functions agreeing on the constrained subspace, we can use a new Hamiltonian '= instead. Then, we have the interesting case where the Hamiltonian is the same as a constraint! See Hamiltonian constraint for more details.
Consider now the case of a Yang–Mills theory for a real simple Lie algebra (with a negative definite Killing form) minimally coupled to a real scalar field, which transforms as an orthogonal representation with the underlying vector space under in (- 1) + 1 Minkowski spacetime. For in, we write
as
for simplicity. Let A be the -valued connection form of the theory. Note that the A here differs from the A used by physicists by a factor of and . This agrees with the mathematician's convention.
The action is given by
dA+A\wedgeA
Dσ = dσ - A[σ]and is the orthogonal form for .
What is the Hamiltonian version of this model? Well, first, we have to split A noncovariantly into a time component and a spatial part . Then, the resulting symplectic space has the conjugate variables, (taking values in the underlying vector space of
\bar{\rho}
\vec{D} ⋅ \vec{\pi}A-\rho'(\pi\sigma,\sigma)=0
\rho:L ⊗ V → V
\rho':\bar{V} ⊗ V → L
Hf=\intdd-1x
1 | |
2 |
\alpha-1(\pi\sigma,\pi
\alpha(\vec{D}\sigma ⋅ \vec{D}\sigma)- | |||||
|
g2 | |
2 |
η(\vec{\pi}A,\vec{\pi}
|
η(B ⋅ B)-η(\pi\phi,f)-<\pi\sigma,\phi[\sigma]>-η(\phi,\vec{D} ⋅ \vec{\pi}A).
The last two terms are a linear combination of the Gaussian constraints and we have a whole family of (gauge equivalent)Hamiltonians parametrized by . In fact, since the last three terms vanish for the constrained states, we may drop them.
In a constrained Hamiltonian system, a dynamical quantity is second-class if its Poisson bracket with at least one constraint is nonvanishing. A constraint that has a nonzero Poisson bracket with at least one other constraint, then, is a second-class constraint.
See Dirac brackets for diverse illustrations.
Before going on to the general theory, consider a specific example step by step to motivate the general analysis.
Start with the action describing a Newtonian particle of mass constrained to a spherical surface of radius within a uniform gravitational field . When one works in Lagrangian mechanics, there are several ways to implement a constraint: one can switch to generalized coordinates that manifestly solve the constraint, or one can use a Lagrange multiplier while retaining the redundant coordinates so constrained.
In this case, the particle is constrained to a sphere, therefore the natural solution would be to use angular coordinates to describe the position of the particle instead of Cartesian and solve (automatically eliminate) the constraint in that way (the first choice). For pedagogical reasons, instead, consider the problem in (redundant) Cartesian coordinates, with a Lagrange multiplier term enforcing the constraint.
The action is given by
S=\intdtL=\intdt\left[
m | ( | |
2 |
x |
| |||
| |||
| ||||
(x2+y2+z2-R2)\right]
Of course, as indicated, we could have just used different, non-redundant, spherical coordinates and written it as
S=\intdt\left[
mR2 | ( | |
2 |
\theta |
2+\sin
| |||
2)+mgR\cos(\theta)\right]
The conjugate momenta are given by
p | |||
|
p | |||
|
p | |||
|
pλ=0
The Hamiltonian is given by
H=\vec{p} ⋅
\vec{r |
We cannot eliminate at this stage yet. We are here treating as a shorthand for a function of the symplectic space which we have yet to determine and not as an independent variable. For notational consistency, define from now on. The above Hamiltonian with the term is the "naive Hamiltonian". Note that since, on-shell, the constraint must be satisfied, one cannot distinguish, on-shell, between the naive Hamiltonian and the above Hamiltonian with the undetermined coefficient, .
.
We require, on the grounds of consistency, that the Poisson bracket of all the constraints with the Hamiltonian vanish at the constrained subspace. In other words, the constraints must not evolve in time if they are going to be identically zero along the equations of motion.
From this consistency condition, we immediately get the secondary constraint
\begin{align} 0&=\{H,pλ\}PB\\ &=\sumi
\partialH | |
\partialqi |
\partialpλ | - | |
\partialpi |
\partialH | |
\partialpi |
\partialpλ | \\ &= | |
\partialqi |
\partialH | \\ &= | |
\partialλ |
1 | |
2 |
(r2-R2)\\ &\Downarrow\\ 0&=r2-R2\end{align}
This constraint should be added into the Hamiltonian with an undetermined (not necessarily constant) coefficient 2, enlarging the Hamiltonian to
H=
p2 | |
2m |
+mgz-
λ | |
2 |
(r2-R2)+u1pλ+u2(r2-R2)~.
Similarly, from this secondary constraint, we find the tertiary constraint
\begin{align} 0&=\{H,r2-R
2\} | |
PB |
2\} | |
\\ &=\{H,x | |
PB |
2\} | |
+\{H,y | |
PB |
2\} | ||
+\{H,z | \\ &= | |
PB |
\partialH | 2x+ | |
\partialpx |
\partialH | 2y+ | |
\partialpy |
\partialH | 2z\\ &= | |
\partialpz |
2 | |
m |
(pxx+pyy+pzz)\\ &\Downarrow\\ 0&=\vecp ⋅ \vecr\end{align}
Again, one should add this constraint into the Hamiltonian, since, on-shell, no one can tell the difference. Therefore, so far, the Hamiltonian looks like
H=
p2 | |
2m |
+mgz-
λ | |
2 |
(r2-R2)+u1pλ+u2(r2-R2)+u3\vec{p} ⋅ \vec{r}~,
Note that, frequently, all constraints that are found from consistency conditions are referred to as secondary constraints and secondary, tertiary, quaternary, etc., constraints are not distinguished.
We keep turning the crank, demanding this new constraint have vanishing Poisson bracket
0=\{\vec{p} ⋅ \vec{r},H\}PB=
p2 | |
m |
-mgz+λr2-2u2r2.
We might despair and think that there is no end to this, but because one of the new Lagrange multipliers has shown up, this is not a new constraint, but a condition that fixes the Lagrange multiplier:
u2=
λ | |
2 |
+
1 | \left( | |
r2 |
p2 | - | |
2m |
1 | |
2 |
mgz\right).
Plugging this into our Hamiltonian gives us (after a little algebra)
H=
p2 | (2- | |
2m |
R2 | |
r2 |
)+
1 | mgz(1+ | |
2 |
R2 | |
r2 |
)+u1pλ+u3\vecp ⋅ \vecr
Now that there are new terms in the Hamiltonian, one should go back and check the consistency conditions for the primary and secondary constraints. The secondary constraint's consistency condition gives
2 | |
m |
\vec{r} ⋅ \vec{p}+2u3r2=0.
u3=-
\vec{r | |
⋅ \vec{p}}{m |
r2}~.
Putting it all together,
H=\left(2- | R2 | \right) |
r2 |
p2 | |
2m |
+
1 | \left(1+ | |
2 |
R2 | |
r2 |
\right)mgz-
(\vec{r | |
⋅ \vec{p}) |
2}{mr2}+u1pλ
\vec{r |
Before analyzing the Hamiltonian, consider the three constraints,
\varphi1=pλ, \varphi2=r2-R2, \varphi3=\vec{p} ⋅ \vec{r}.
\{\varphi2,\varphi3\}=2r2 ≠ 0.
Here, we have a symplectic space where the Poisson bracket does not have "nice properties" on the constrained subspace. However, Dirac noticed that we can turn the underlying differential manifold of the symplectic space into a Poisson manifold using his eponymous modified bracket, called the Dirac bracket, such that this Dirac bracket of any (smooth) function with any of the second-class constraints always vanishes.
Effectively, these brackets (illustrated for this spherical surface in the Dirac bracket article) project the system back onto the constraints surface.If one then wished to canonically quantize this system, then one need promote the canonical Dirac brackets,[3] not the canonical Poisson brackets to commutation relations.
Examination of the above Hamiltonian shows a number of interesting things happening. One thing to note is that, on-shell when the constraints are satisfied, the extended Hamiltonian is identical to the naive Hamiltonian, as required. Also, note that dropped out of the extended Hamiltonian. Since is a first-class primary constraint, it should be interpreted as a generator of a gauge transformation. The gauge freedom is the freedom to choose, which has ceased to have any effect on the particle's dynamics. Therefore, that dropped out of the Hamiltonian, that 1 is undetermined, and that = pλ is first-class, are all closely interrelated.
Note that it would be more natural not to start with a Lagrangian with a Lagrange multiplier, but instead take as a primary constraint and proceed through the formalism: The result would the elimination of the extraneous dynamical quantity. However, the example is more edifying in its current form.
See also: Dirac bracket.
Another example we will use is the Proca action. The fields are
A\mu=(\vec{A},\phi)
S=\intddxdt\left[
1 | |
2 |
E2-
1 | |
4 |
BijBij-
m2 | |
2 |
A2+
m2 | |
2 |
\phi2\right]
\vec{E}\equiv-\nabla\phi-
\vec{A |
Bij\equiv
\partialAj | |
\partialxi |
-
\partialAi | |
\partialxj |
(\vec{A},-\vec{E})
(\phi,\pi)
\pi ≈ 0
\nabla ⋅ \vec{E}+m2\phi ≈ 0
H=\intddx\left[
1 | |
2 |
E2+
1 | |
4 |
BijBij-\pi\nabla ⋅ \vec{A}+\vec{E} ⋅ \nabla\phi+
m2 | |
2 |
A2-
m2 | |
2 |
\phi2\right]
L(q,
q), |