Outer product explained

In linear algebra, the outer product of two coordinate vectors is the matrix whose entries are all products of an element in the first vector with an element in the second vector. If the two coordinate vectors have dimensions n and m, then their outer product is an n × m matrix. More generally, given two tensors (multidimensional arrays of numbers), their outer product is a tensor. The outer product of tensors is also referred to as their tensor product, and can be used to define the tensor algebra.

The outer product contrasts with:

Definition

Given two vectors of size

m x 1

and

n x 1

respectively

\mathbf = \begin u_1 \\ u_2 \\ \vdots \\ u_m \end,\quad\mathbf = \begin v_1 \\ v_2 \\ \vdots \\ v_n \endtheir outer product, denoted

uv,

is defined as the

m x n

matrix

A

obtained by multiplying each element of

u

by each element of [1]

\mathbf \otimes \mathbf = \mathbf = \begin u_1v_1 & u_1v_2 & \dots & u_1v_n \\ u_2v_1 & u_2v_2 & \dots & u_2v_n \\ \vdots & \vdots & \ddots & \vdots \\ u_mv_1 & u_mv_2 & \dots & u_mv_n \end

Or, in index notation:

(\mathbf \otimes \mathbf)_ = u_i v_j

Denoting the dot product by

,

if given an

n x 1

vector

w,

then

(uv)w=(vw)u.

If given a

1 x m

vector

x,

then

x(uv)=(xu)v\operatorname{T

}.

If

u

and

v

are vectors of the same dimension bigger than 1, then

\det(uv)=0

.

The outer product

uv

is equivalent to a matrix multiplication

uv\operatorname{T

}, provided that

u

is represented as a

m x 1

column vector and

v

as a

n x 1

column vector (which makes

v\operatorname{T

} a row vector).[2] [3] For instance, if

m=4

and

n=3,

then

\mathbf \otimes \mathbf = \mathbf\mathbf^\textsf = \beginu_1 \\ u_2 \\ u_3 \\ u_4\end \beginv_1 & v_2 & v_3\end = \begin u_1 v_1 & u_1 v_2 & u_1 v_3 \\ u_2 v_1 & u_2 v_2 & u_2 v_3 \\ u_3 v_1 & u_3 v_2 & u_3 v_3 \\ u_4 v_1 & u_4 v_2 & u_4 v_3 \end.

For complex vectors, it is often useful to take the conjugate transpose of

v,

denoted

v\dagger

or

\left(vsf{T}\right)*

:

\mathbf \otimes \mathbf = \mathbf \mathbf^\dagger = \mathbf \left(\mathbf^\textsf\right)^*.

Contrast with Euclidean inner product

If

m=n,

then one can take the matrix product the other way, yielding a scalar (or

1 x 1

matrix):

\left\langle\mathbf, \mathbf\right\rangle = \mathbf^\textsf \mathbfwhich is the standard inner product for Euclidean vector spaces,[3] better known as the dot product. The dot product is the trace of the outer product.[4] Unlike the dot product, the outer product is not commutative.

Multiplication of a vector

w

by the matrix

uv

can be written in terms of the inner product, using the relation

\left(uv\right)w=u\left\langlev,w\right\rangle

.

The outer product of tensors

Given two tensors

u,v

with dimensions

(k1,k2,...,km)

and

(l1,l2,...,ln)

, their outer product

uv

is a tensor with dimensions

(k1,k2,...,km,l1,l2,...,ln)

and entries

(\mathbf \otimes \mathbf)_ = u_ v_

For example, if

A

is of order 3 with dimensions

(3,5,7)

and

B

is of order 2 with dimensions

(10,100),

then their outer product

C

is of order 5 with dimensions

(3,5,7,10,100).

If

A

has a component and

B

has a component, then the component of

C

formed by the outer product is .

Connection with the Kronecker product

The outer product and Kronecker product are closely related; in fact the same symbol is commonly used to denote both operations.

If

u=\begin{bmatrix}1&2&3\end{bmatrix}sf{T}

and

v=\begin{bmatrix}4&5\end{bmatrix}sf{T}

, we have:

\begin \mathbf \otimes_\text \mathbf &= \begin 4 \\ 5 \\ 8 \\ 10 \\ 12 \\ 15\end, & \mathbf \otimes_\text \mathbf &= \begin 4 & 5 \\ 8 & 10 \\ 12 & 15\end\end

In the case of column vectors, the Kronecker product can be viewed as a form of vectorization (or flattening) of the outer product. In particular, for two column vectors

u

and

v

, we can write:

\mathbf \otimes_ \mathbf = \operatorname(\mathbf \otimes_\text \mathbf)

(The order of the vectors is reversed on the right side of the equation.)

Another similar identity that further highlights the similarity between the operations is

\mathbf \otimes_ \mathbf^\textsf = \mathbf u \mathbf^\textsf = \mathbf \otimes_ \mathbf

where the order of vectors needs not be flipped. The middle expression uses matrix multiplication, where the vectors are considered as column/row matrices.

Connection with the matrix product

Given a pair of matrices

A

of size

m x p

and

B

of size

p x n

, consider the matrix product

C=AB

defined as usual as a matrix of size

m x n

.

Now let

col
a
k
be the

k

-th column vector of

A

and let
row
b
k
be the

k

-th row vector of

B

. Then

C

can be expressed as a sum of column-by-row outer products:

\mathbf = \mathbf\, \mathbf =\left(\sum_^p _\, _\right)_ =\begin & & \\ \mathbf a^\text_ & \cdots & \mathbf a^\text_ \\ & & \end\begin & \mathbf b^\text_ & \\ & \vdots & \\ & \mathbf b^\text_ & \end= \sum_^p \mathbf a^\text_k \mathbf b^\text_kThis expression has duality with the more common one as a matrix built with row-by-column inner product entries (or dot product):

Cij=

row
\langle{a
i,b
col
j
}\rangle

This relation is relevant[5] in the application of the Singular Value Decomposition (SVD) (and Spectral Decomposition as a special case). In particular, the decomposition can be interpreted as the sum of outer products of each left (

uk

) and right (

vk

) singular vectors, scaled by the corresponding nonzero singular value

\sigmak

:

\mathbf = \mathbf = \sum_^(\mathbf_k \otimes \mathbf_k) \, \sigma_k

This result implies that

A

can be expressed as a sum of rank-1 matrices with spectral norm

\sigmak

in decreasing order. This explains the fact why, in general, the last terms contribute less, which motivates the use of the truncated SVD as an approximation. The first term is the least squares fit of a matrix to an outer product of vectors.

Properties

The outer product of vectors satisfies the following properties:

\begin (\mathbf \otimes \mathbf)^\textsf &= (\mathbf \otimes \mathbf) \\ (\mathbf + \mathbf) \otimes \mathbf &= \mathbf \otimes \mathbf + \mathbf \otimes \mathbf \\ \mathbf \otimes (\mathbf + \mathbf) &= \mathbf \otimes \mathbf + \mathbf \otimes \mathbf \\ c (\mathbf \otimes \mathbf) &= (c\mathbf) \otimes \mathbf = \mathbf \otimes (c\mathbf)\end

The outer product of tensors satisfies the additional associativity property:

(\mathbf \otimes \mathbf) \otimes \mathbf = \mathbf \otimes (\mathbf \otimes \mathbf)

Rank of an outer product

If u and v are both nonzero, then the outer product matrix uvT always has matrix rank 1. Indeed, the columns of the outer product are all proportional to u. Thus they are all linearly dependent on that one column, hence the matrix is of rank one.

("Matrix rank" should not be confused with "tensor order", or "tensor degree", which is sometimes referred to as "rank".)

Definition (abstract)

Let and be two vector spaces. The outer product of

v\inV

and

w\inW

is the element

vw\inVW

.

If is an inner product space, then it is possible to define the outer product as a linear map . In this case, the linear map

x\mapsto\langlev,x\rangle

is an element of the dual space of, as this maps linearly a vector into its underlying field, of which

\langlev,x\rangle

is an element. The outer product is then given by

(\mathbf w \otimes \mathbf v) (\mathbf x) = \left\langle \mathbf v, \mathbf x \right\rangle \mathbf w.

This shows why a conjugate transpose of is commonly taken in the complex case.

In programming languages

In some programming languages, given a two-argument function f (or a binary operator), the outer product, f, of two one-dimensional arrays, A and B, is a two-dimensional array C such that C[i, j] = f(A[i], B[j]). This is syntactically represented in various ways: in APL, as the infix binary operator ∘.f; in J, as the postfix adverb f/; in R, as the function outer(A, B, f) or the special %o%;[6] in Mathematica, as Outer[f, A, B]. In MATLAB, the function kron(A, B) is used for this product. These often generalize to multi-dimensional arguments, and more than two arguments.

In the Python library NumPy, the outer product can be computed with function np.outer.[7] In contrast, np.kron results in a flat array. The outer product of multidimensional arrays can be computed using np.multiply.outer.

Applications

As the outer product is closely related to the Kronecker product, some of the applications of the Kronecker product use outer products. These applications are found in quantum theory, signal processing, and image compression.[8]

Spinors

Suppose so that and are in . Then the outer product of these complex 2-vectors is an element of, the 2 × 2 complex matrices:

\begin sw & tw \\ sz & tz \end.The determinant of this matrix is because of the commutative property of .

In the theory of spinors in three dimensions, these matrices are associated with isotropic vectors due to this null property. Élie Cartan described this construction in 1937,[9] but it was introduced by Wolfgang Pauli in 1927 so that has come to be called Pauli algebra.

Concepts

The block form of outer products is useful in classification. Concept analysis is a study that depends on certain outer products:

When a vector has only zeros and ones as entries, it is called a logical vector, a special case of a logical matrix. The logical operation and takes the place of multiplication. The outer product of two logical vectors and is given by the logical matrix

\left(aij\right)=\left(ui\landvj\right)

. This type of matrix is used in the study of binary relations, and is called a rectangular relation or a cross-vector.[10]

See also

Products

Duality

Further reading

Notes and References

  1. Book: Encyclopaedia of Physics . 2nd . R. G. . Lerner . Rita G. Lerner . G. L. . Trigg . VHC . 1991 . 0-89573-752-3 . registration .
  2. Book: Linear Algebra . 4th . S. . Lipschutz . M. . Lipson . Schaum’s Outlines . McGraw-Hill . 2009 . 978-0-07-154352-1.
  3. Web site: Keller . Frank . February 23, 2020 . Algebraic Properties of Matrices; Transpose; Inner and Outer Product . https://web.archive.org/web/20171215061654/http://www.inf.ed.ac.uk/teaching/courses/cfcs1/lectures/cfcs_l10.pdf . 2017-12-15 . live . September 6, 2020 . inf.ed.ac.uk.
  4. Book: Stengel, Robert F. . Optimal Control and Estimation . New York . Dover Publications . 1994 . 26 . 0-486-68200-5 .
  5. Book: Bau III . David . Trefethen . Lloyd N. . Lloyd N. Trefethen . Numerical linear algebra . Society for Industrial and Applied Mathematics . Philadelphia . 978-0-89871-361-9 . 1997.
  6. Web site: outer function R Documentation . 2020-09-07 . rdocumentation.org.
  7. Web site: numpy.outer — NumPy v1.19 Manual . 2020-09-07 . numpy.org.
  8. Book: Steeb . Willi-Hans . Matrix Calculus and Kronecker Product: A Practical Approach to Linear and Multilinear Algebra . Hardy . Yorick . World Scientific . 2011 . 978-981-4335-31-7 . 2 . Applications (Chapter 3).
  9. [Élie Cartan]
  10. [Ki-Hang Kim]