In linear algebra, Cramer's rule is an explicit formula for the solution of a system of linear equations with as many equations as unknowns, valid whenever the system has a unique solution. It expresses the solution in terms of the determinants of the (square) coefficient matrix and of matrices obtained from it by replacing one column by the column vector of right-sides of the equations. It is named after Gabriel Cramer, who published the rule for an arbitrary number of unknowns in 1750,[1] [2] although Colin Maclaurin also published special cases of the rule in 1748,[3] and possibly knew of it as early as 1729.[4] [5] [6]
Cramer's rule, implemented in a naive way, is computationally inefficient for systems of more than two or three equations.[7] In the case of equations in unknowns, it requires computation of determinants, while Gaussian elimination produces the result with the same computational complexity as the computation of a single determinant.[8] [9] Cramer's rule can also be numerically unstable even for 2×2 systems.[10] However, Cramer's rule can be implemented with the same complexity as Gaussian elimination,[11] [12] (consistently requires twice as many arithmetic operations and has the same numerical stability when the same permutation matrices are applied).
Consider a system of linear equations for unknowns, represented in matrix multiplication form as follows:
Ax=b
where the matrix has a nonzero determinant, and the vector
x=(x1,\ldots,
T | |
x | |
n) |
xi=
\det(Ai) | |
\det(A) |
i=1,\ldots,n
where
Ai
A more general version of Cramer's rule[13] considers the matrix equation
AX=B
where the matrix has a nonzero determinant, and, are matrices. Given sequences
1\leqi1<i2< … <ik\leqn
1\leqj1<j2< … <jk\leqm
XI,J
I:=(i1,\ldots,ik)
J:=(j1,\ldots,jk)
AB(I,J)
is
js
s=1,\ldots,k
\detXI,J=
\det(AB(I,J)) | |
\det(A) |
.
In the case
k=1
The rule holds for systems of equations with coefficients and unknowns in any field, not just in the real numbers.
The proof for Cramer's rule uses the following properties of the determinants: linearity with respect to any given column and the fact that the determinant is zero whenever two columns are equal, which is implied by the property that the sign of the determinant flips if you switch two columns.
Fix the index of a column, and consider that the entries of the other columns have fixed values. This makes the determinant a function of the entries of the th column. Linearity with respect of this column means that this function has the form
Dj(a1,j,\ldots,an,j)=C1,ja1,j+ … ,Cn,jan,j,
Ci,j
\det(A)=Dj(a1,j,\ldots,an,j)=C1,ja1,j+ … ,Cn,jan,j
Ci,j
If the function
Dj
Now consider a system of linear equations in unknowns
x1,\ldots,xn
\begin{matrix} a11x1+a12x2+ … +a1nxn&=&b1\\ a21x1+a22x2+ … +a2nxn&=&b2\\ &\vdots&\\ an1x1+an2x2+ … +annxn&=&bn. \end{matrix}
If one combines these equations by taking times the first equation, plus times the second, and so forth until times the last, then for every the resulting coefficient of becomes
Dj(a1,k,\ldots,an,k).
xj
\det(A).
Dj(b1,\ldots,bn),
\det(A)xj=Dj(b1,\ldots,bn),
xj
x | ||||
|
1,\ldots,bn).
As, by construction, the numerator is the determinant of the matrix obtained from by replacing column by, we get the expression of Cramer's rule as a necessary condition for a solution.
It remains to prove that these values for the unknowns form a solution. Let be the matrix that has the coefficients of
Dj
j=1,\ldots,n
x=
1{\det(A)}Mb | |
A\left( | 1{\det(A)}M\right)b=b. |
A\left( | 1{\det(A)}M\right)=I |
n, |
In
The above properties of the functions
Dj
\left( | 1{\det(A)}M\right)A=I |
n. |
For other proofs, see below.
See main article: article. Let be an matrix with entries in a field . Then
A\operatorname{adj}(A)=\operatorname{adj}(A)A=\det(A)I
where denotes the adjugate matrix, is the determinant, and is the identity matrix. If is nonzero, then the inverse matrix of is
A-1=
1 | |
\det(A) |
\operatorname{adj}(A).
This gives a formula for the inverse of, provided . In fact, this formula works whenever is a commutative ring, provided that is a unit. If is not a unit, then is not invertible over the ring (it may be invertible over a larger ring in which some non-unit elements of may be invertible).
Consider the linear system
\left\{\begin{matrix} a1x+b1y&={\color{red}c1}\\ a2x+b2y&={\color{red}c2} \end{matrix}\right.
which in matrix format is
\begin{bmatrix}a1&b1\ a2&b2\end{bmatrix}\begin{bmatrix}x\ y\end{bmatrix}=\begin{bmatrix}{\color{red}c1}\ {\color{red}c2}\end{bmatrix}.
Assume is nonzero. Then, with the help of determinants, and can be found with Cramer's rule as
\begin{align} x&=
\begin{vmatrix | |
\color{red |
{c1}}&b1\ {\color{red}{c2}}&b2\end{vmatrix}}{\begin{vmatrix}a1&b1\ a2&b2\end{vmatrix}}={{\color{red}c1}b2-b1{\color{red}c2}\overa1b2-b1a2}, y=
\begin{vmatrix | |
a |
1&{\color{red}{c1}}\ a2&{\color{red}{c2}}\end{vmatrix}}{\begin{vmatrix}a1&b1\ a2&b2\end{vmatrix}}={a1{\color{red}c2}-{\color{red}c1}a2\overa1b2-b1a2} \end{align}.
The rules for matrices are similar. Given
\left\{\begin{matrix} a1x+b1y+c1z&={\color{red}d1}\\ a2x+b2y+c2z&={\color{red}d2}\\ a3x+b3y+c3z&={\color{red}d3} \end{matrix}\right.
which in matrix format is
\begin{bmatrix}a1&b1&c1\ a2&b2&c2\ a3&b3&c3\end{bmatrix}\begin{bmatrix}x\ y\ z\end{bmatrix}=\begin{bmatrix}{\color{red}d1}\ {\color{red}d2}\ {\color{red}d3}\end{bmatrix}.
Then the values of and can be found as follows:
x=
\begin{vmatrix | |
\color{red |
d1}&b1&c1\ {\color{red}d2}&b2&c2\ {\color{red}d3}&b3&c3\end{vmatrix}}{\begin{vmatrix}a1&b1&c1\ a2&b2&c2\ a3&b3&c3\end{vmatrix}}, y=
\begin{vmatrix | |
a |
1&{\color{red}d1}&c1\ a2&{\color{red}d2}&c2\ a3&{\color{red}d3}&c3\end{vmatrix}}{\begin{vmatrix}a1&b1&c1\ a2&b2&c2\ a3&b3&c3\end{vmatrix}},and z=
\begin{vmatrix | |
a |
1&b1&{\color{red}d1}\ a2&b2&{\color{red}d2}\ a3&b3&{\color{red}d3}\end{vmatrix}}{\begin{vmatrix}a1&b1&c1\ a2&b2&c2\ a3&b3&c3\end{vmatrix}}.
Cramer's rule is used in the Ricci calculus in various calculations involving the Christoffel symbols of the first and second kind.[14]
In particular, Cramer's rule can be used to prove that the divergence operator on a Riemannian manifold is invariant with respect to change of coordinates. We give a direct proof, suppressing the role of the Christoffel symbols.Let
(M,g)
(x1,x2,...,xn)
A=Ai
\partial | |
\partialxi |
Theorem.
The divergence of
A
\operatorname{div}A=
1 | |
\sqrt{\detg |
is invariant under change of coordinates.
Let
(x1,x2,\ldots,xn)\mapsto(\barx1,\ldots,\barxn)
A=\barAk
\partial | |
\partial\barxk |
\barAk=
\partial\barxk | |
\partialxj |
Aj
g=gmkdxm ⊗ dxk=\bar{g}ijd\barxi ⊗ d\barxj
\bar{g}ij=
\partialxm | |
\partial\barxi |
\partialxk | |
\partial\barxj |
gmk
\barg=\left(
\partialx | |
\partial\bar{x |
\det\barg=\left(\det\left(
\partialx | |
\partial\bar{x |
Now one computes
\begin{align} \operatorname{div}A&=
1 | |
\sqrt{\detg |
1 | |
\sqrt{\det\barg |
\partial\barxk | |
\partialxi |
\partial | \left( | |
\partial\barxk |
\partialxi | \det\left( | |
\partial\barx\ell |
\partialx | |
\partial\bar{x |
\partial | \det\left( | |
\partial\barx\ell |
\partialx | |
\partial\bar{x |
\begin{align} | \partial | \det\left( |
\partial\barx\ell |
\partialx | |
\partial\bar{x |
M(i|j)
\left( | \partialx |
\partial\bar{x |
i
j
(-1)i+j | |||
|
\right)}\detM(i|j)
(j,i)
\left( | \partial\bar{x |
(\ast)=\det\left( | \partialx |
\partial\bar{x |
Consider the two equations
F(x,y,u,v)=0
G(x,y,u,v)=0
x=X(u,v)
y=Y(u,v).
An equation for
\dfrac{\partialx}{\partialu}
First, calculate the first derivatives of F, G, x, and y:
\begin{align} dF&=
\partialF | |
\partialx |
dx+
\partialF | |
\partialy |
dy+
\partialF | |
\partialu |
du+
\partialF | |
\partialv |
dv=0\\[6pt] dG&=
\partialG | |
\partialx |
dx+
\partialG | |
\partialy |
dy+
\partialG | |
\partialu |
du+
\partialG | |
\partialv |
dv=0\\[6pt] dx&=
\partialX | |
\partialu |
du+
\partialX | |
\partialv |
dv\\[6pt] dy&=
\partialY | |
\partialu |
du+
\partialY | |
\partialv |
dv. \end{align}
Substituting dx, dy into dF and dG, we have:
\begin{align} dF&=\left(
\partialF | |
\partialx |
\partialx | + | |
\partialu |
\partialF | |
\partialy |
\partialy | |
\partialu |
+
\partialF | |
\partialu |
\right)du+\left(
\partialF | |
\partialx |
\partialx | + | |
\partialv |
\partialF | |
\partialy |
\partialy | + | |
\partialv |
\partialF | |
\partialv |
\right)dv=0\ [6pt] dG&=\left(
\partialG | |
\partialx |
\partialx | + | |
\partialu |
\partialG | |
\partialy |
\partialy | + | |
\partialu |
\partialG | |
\partialu |
\right)du+\left(
\partialG | |
\partialx |
\partialx | + | |
\partialv |
\partialG | |
\partialy |
\partialy | + | |
\partialv |
\partialG | |
\partialv |
\right)dv=0. \end{align}
Since u, v are both independent, the coefficients of du, dv must be zero. So we can write out equations for the coefficients:
\begin{align} | \partialF |
\partialx |
\partialx | + | |
\partialu |
\partialF | |
\partialy |
\partialy | |
\partialu |
&=-
\partialF | \\[6pt] | |
\partialu |
\partialG | |
\partialx |
\partialx | + | |
\partialu |
\partialG | |
\partialy |
\partialy | |
\partialu |
&=-
\partialG | \\[6pt] | |
\partialu |
\partialF | |
\partialx |
\partialx | + | |
\partialv |
\partialF | |
\partialy |
\partialy | |
\partialv |
&=-
\partialF | \\[6pt] | |
\partialv |
\partialG | |
\partialx |
\partialx | + | |
\partialv |
\partialG | |
\partialy |
\partialy | |
\partialv |
&=-
\partialG | |
\partialv |
. \end{align}
Now, by Cramer's rule, we see that:
\partialx | |
\partialu |
=
\begin{vmatrix | |||
|
&
\partialF | \ - | |
\partialy |
\partialG | |
\partialu |
&
\partialG | \end{vmatrix}}{\begin{vmatrix} | |
\partialy |
\partialF | |
\partialx |
&
\partialF | \ | |
\partialy |
\partialG | |
\partialx |
&
\partialG | |
\partialy |
\end{vmatrix}}.
This is now a formula in terms of two Jacobians:
\partialx | |
\partialu |
=-
| |||||
|
.
Similar formulas can be derived for
\partialx | |
\partialv |
,
\partialy | |
\partialu |
,
\partialy | |
\partialv |
.
Cramer's rule can be used to prove that an integer programming problem whose constraint matrix is totally unimodular and whose right-hand side is integer, has integer basic solutions. This makes the integer program substantially easier to solve.
Cramer's rule is used to derive the general solution to an inhomogeneous linear differential equation by the method of variation of parameters.
Cramer's rule has a geometric interpretation that can be considered also a proof or simply giving insight about its geometric nature. These geometric arguments work in general and not only in the case of two equations with two unknowns presented here.
Given the system of equations
\begin{matrix}a11x1+a12x2&=b1\\a21x1+a22x2&=b2\end{matrix}
it can be considered as an equation between vectors
x1\binom{a11
The area of the parallelogram determined by
\binom{a11
\binom{a12
\begin{vmatrix}a11&a12\\a21&a22\end{vmatrix}.
In general, when there are more variables and equations, the determinant of vectors of length will give the volume of the parallelepiped determined by those vectors in the -th dimensional Euclidean space.
Therefore, the area of the parallelogram determined by
x1\binom{a11
\binom{a12
x1
\binom{b1}{b2}=x1\binom{a11
\binom{a12
Equating the areas of this last and the second parallelogram gives the equation
\begin{vmatrix}b1&a12\\b2&a22\end{vmatrix}=\begin{vmatrix}a11x1&a12\\a21x1&a22\end{vmatrix}=x1\begin{vmatrix}a11&a12\\a21&a22\end{vmatrix}
from which Cramer's rule follows.
This is a restatement of the proof above in abstract language.
Consider the map
x=(x1,\ldots,xn)\mapsto
1 | |
\detA |
\left(\det(A1),\ldots,\det(An)\right),
Ai
A
x
i
i
A
i
ei=(0,\ldots,1,\ldots,0)
i
A
A-1
A
Rn
A
A short proof of Cramer's rule [15] can be given by noticing that
x1
X1=\begin{bmatrix} x1&0&0& … &0\\ x2&1&0& … &0\\ x3&0&1& … &0\\ \vdots&\vdots&\vdots&\ddots&\vdots\\ xn&0&0& … &1 \end{bmatrix}
On the other hand, assuming that our original matrix is invertible, this matrix
X1
A-1b,A-1v2,\ldots,A-1vn
vn
A1
b,v2,\ldots,vn
-1 | |
X | |
1=A |
A1
x1=\det(X1)=\det(A-1)\det(A1)=
\det(A1) | |
\det(A) |
.
The proof for other
xj
A system of equations is said to be inconsistent when there are no solutions and it is called indeterminate when there is more than one solution. For linear equations, an indeterminate system will have infinitely many solutions (if it is over an infinite field), since the solutions can be expressed in terms of one or more parameters that can take arbitrary values.
Cramer's rule applies to the case where the coefficient determinant is nonzero. In the 2×2 case, if the coefficient determinant is zero, then the system is incompatible if the numerator determinants are nonzero, or indeterminate if the numerator determinants are zero.
For 3×3 or higher systems, the only thing one can say when the coefficient determinant equals zero is that if any of the numerator determinants are nonzero, then the system must be inconsistent. However, having all determinants zero does not imply that the system is indeterminate. A simple example where all determinants vanish (equal zero) but the system is still incompatible is the 3×3 system x+y+z=1, x+y+z=2, x+y+z=3.