Projected normal distribution explained

Projected normal distribution
Type:density
Notation:

l{PN}n(\boldsymbol\mu,\boldsymbol\Sigma)

Parameters:

\boldsymbol\mu\in\Rn

(location)

\boldsymbol\Sigma\in\Rn

(scale)
Support:

\boldsymbol\theta\in[0,\pi]n x [0,2\pi)

| pdf = complicated, see text

In directional statistics, the projected normal distribution (also known as offset normal distribution, angular normal distribution or angular Gaussian distribution) is a probability distribution over directions that describes the radial projection of a random variable with n-variate normal distribution over the unit (n-1)-sphere.

Definition and properties

Given a random variable

\boldsymbolX\in\Rn

that follows a multivariate normal distribution

l{N}n(\boldsymbol\mu,\boldsymbol\Sigma)

, the projected normal distribution

l{PN}n(\boldsymbol\mu,\boldsymbol\Sigma)

represents the distribution of the random variable

\boldsymbolY=

\boldsymbolX
\lVert\boldsymbolX\rVert
obtained projecting

\boldsymbolX

over the unit sphere. In the general case, the projected normal distribution can be asymmetric and multimodal. In case

\boldsymbol\mu

is orthogonal to an eigenvector of

\boldsymbol\Sigma

, the distribution is symmetric. The first version of such distribution was introduced in Pukkila and Rao (1988).

Density function

The density of the projected normal distribution

l{PN}n(\boldsymbol\mu,\boldsymbol\Sigma)

can be constructed from the density of its generator n-variate normal distribution

l{N}n(\boldsymbol\mu,\boldsymbol\Sigma)

by re-parametrising to n-dimensional spherical coordinates and then integrating over the radial coordinate.

In spherical coordinates with radial component

r\in[0,infty)

and angles

\boldsymbol\theta=(\theta1,...,\thetan-1)\in[0,\pi]n x [0,2\pi)

, a point

\boldsymbolx=(x1,...,xn)\in\Rn

can be written as

\boldsymbolx=r\boldsymbolv

, with

\lVert\boldsymbolv\rVert=1

. The joint density becomes

p(r,\boldsymbol\theta|\boldsymbol\mu,\boldsymbol\Sigma)=

rn-1
\sqrt{|\boldsymbol\Sigma|

(2

n
2
\pi)
}e^

and the density of

l{PN}n(\boldsymbol\mu,\boldsymbol\Sigma)

can then be obtained as

p(\boldsymbol\theta|\boldsymbol\mu,\boldsymbol\Sigma)=

infty
\int
0

p(r,\boldsymbol\theta|\boldsymbol\mu,\boldsymbol\Sigma)dr.

The same density had been previously obtained in Pukkila and Rao (1988, Eq. (2.4)) using a different notation.

Circular distribution

Parametrising the position on the unit circle in polar coordinates as

\boldsymbolv=(\cos\theta,\sin\theta)

, the density function can be written with respect to the parameters

\boldsymbol\mu

and

\boldsymbol\Sigma

of the initial normal distribution as

p(\theta|\boldsymbol\mu,\boldsymbol\Sigma)=

-1\boldsymbol\mu\top\boldsymbol\Sigma-1\boldsymbol\mu
2
e
2\pi\sqrt{|\boldsymbol\Sigma|

\boldsymbolv\top\boldsymbol\Sigma-1\boldsymbolv} \left(1+T(\theta)

\Phi(T(\theta))
\phi(T(\theta))

\right)I[0,(\theta)

where

\phi

and

\Phi

are the density and cumulative distribution of a standard normal distribution,

T(\theta)=

\boldsymbolv\top\boldsymbol\Sigma-1\boldsymbol\mu
\sqrt{\boldsymbolv\top\boldsymbol\Sigma-1\boldsymbolv
}, and

I

is the indicator function.

In the circular case, if the mean vector

\boldsymbol\mu

is parallel to the eigenvector associated to the largest eigenvalue of the covariance, the distribution is symmetric and has a mode at

\theta=\alpha

and either a mode or an antimode at

\theta=\alpha+\pi

, where

\alpha

is the polar angle of

\boldsymbol\mu=(r\cos\alpha,r\sin\alpha)

. If the mean is parallel to the eigenvector associated to the smallest eigenvalue instead, the distribution is also symmetric but has either a mode or an antimode at

\theta=\alpha

and an antimode at

\theta=\alpha+\pi

.

Spherical distribution

Parametrising the position on the unit sphere in spherical coordinates as

\boldsymbolv=(\cos\theta1\sin\theta2,\sin\theta1\sin\theta2,\cos\theta2)

where

\boldsymbol\theta=(\theta1,\theta2)

are the azimuth

\theta1\in[0,2\pi)

and inclination

\theta2\in[0,\pi]

angles respectively, the density function becomes

p(\boldsymbol\theta|\boldsymbol\mu,\boldsymbol\Sigma)=

-1\boldsymbol\mu\top\boldsymbol\Sigma-1\boldsymbol\mu
2
e
\sqrt{|\boldsymbol\Sigma|

\left(2\pi\boldsymbolv\top\boldsymbol\Sigma-1\boldsymbolv

3
2
\right)
}\left(\frac + T(\boldsymbol \theta) \left(1 + T(\boldsymbol \theta) \frac \right) \right)I_