Stochastic matrix explained

In mathematics, a stochastic matrix is a square matrix used to describe the transitions of a Markov chain. Each of its entries is a nonnegative real number representing a probability.[1] [2] It is also called a probability matrix, transition matrix, substitution matrix, or Markov matrix. The stochastic matrix was first developed by Andrey Markov at the beginning of the 20th century, and has found use throughout a wide variety of scientific fields, including probability theory, statistics, mathematical finance and linear algebra, as well as computer science and population genetics. There are several different definitions and types of stochastic matrices:

\le1.

In the same vein, one may define a probability vector as a vector whose elements are nonnegative real numbers which sum to 1. Thus, each row of a right stochastic matrix (or column of a left stochastic matrix) is a probability vector. Right stochastic matrices act upon row vectors of probabilities by multiplication from the right (hence their name) and the matrix entry in the -th row and -th column is the probability of transition from state to state . Left stochastic matrices act upon column vectors of probabilities by multiplication from the left (hence their name) and the matrix entry in the -th row and -th column is the probability of transition from state to state .

This article uses the right/row stochastic matrix convention.

History

The stochastic matrix was developed alongside the Markov chain by Andrey Markov, a Russian mathematician and professor at St. Petersburg University who first published on the topic in 1906.[3] His initial intended uses were for linguistic analysis and other mathematical subjects like card shuffling, but both Markov chains and matrices rapidly found use in other fields.[4]

Stochastic matrices were further developed by scholars such as Andrey Kolmogorov, who expanded their possibilities by allowing for continuous-time Markov processes.[5] By the 1950s, articles using stochastic matrices had appeared in the fields of econometrics[6] and circuit theory.[7] In the 1960s, stochastic matrices appeared in an even wider variety of scientific works, from behavioral science[8] to geology[9] [10] to residential planning.[11] In addition, much mathematical work was also done through these decades to improve the range of uses and functionality of the stochastic matrix and Markovian processes more generally.

From the 1970s to present, stochastic matrices have found use in almost every field that requires formal analysis, from structural science[12] to medical diagnosis[13] to personnel management.[14] In addition, stochastic matrices have found wide use in land change modeling, usually under the term Markov matrix.[15]

Definition and properties

A stochastic matrix describes a Markov chain over a finite state space with cardinality .

If the probability of moving from to in one time step is, the stochastic matrix is given by using as the -th row and -th column element, e.g.,

P=\left[\begin{matrix} P1,1&P1,2&...&P1,j&...&P1,\alpha\\ P2,1&P2,2&...&P2,j&...&P2,\alpha\\ \vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ Pi,1&Pi,2&...&Pi,j&...&Pi,\alpha\\ \vdots&\vdots&\ddots&\vdots&\ddots&\vdots\\ P\alpha,1&P\alpha,2&...&P\alpha,j&...&P\alpha,\alpha\\ \end{matrix}\right].

Since the total of transition probability from a state to all other states must be 1,

\alpha
\sum
j=1

Pi,j=1;

thus this matrix is a right stochastic matrix.

The above elementwise sum across each row of may be more concisely written as, where is the -dimensional column vector of all ones. Using this, it can be seen that the product of two right stochastic matrices and is also right stochastic: . In general, the -th power of a right stochastic matrix is also right stochastic. The probability of transitioning from to in two steps is then given by the -th element of the square of :

\left(P2\right)i,j.

In general, the probability transition of going from any state to another state in a finite Markov chain given by the matrix in steps is given by .

An initial probability distribution of states, specifying where the system might be initially and with what probabilities, is given as a row vector.

A stationary probability vector is defined as a distribution, written as a row vector, that does not change under application of the transition matrix; that is, it is defined as a probability distribution on the set

Notes and References

  1. Book: 10.1007/0-387-21525-5_1 . S. R. . Asmussen. Markov Chains . Applied Probability and Queues . Stochastic Modelling and Applied Probability . 51 . 3–8 . 2003 . 978-0-387-00211-8 .
  2. Book: Lawler, Gregory F. . Greg Lawler . Introduction to Stochastic Processes . CRC Press . 2006 . 1-58488-651-X . 2nd . en.
  3. Hayes . Brian . 2013 . First links in the Markov chain . American Scientist . 101 . 2. 92–96 . 10.1511/2013.101.92 .
  4. Charles Miller Grinstead; James Laurie Snell (1997). Introduction to Probability. American Mathematical Soc. pp. 464–466. .
  5. Kendall . D. G. . Batchelor . G. K. . Bingham . N. H. . Hayman . W. K. . Hyland . J. M. E. . Lorentz . G. G. . Moffatt . H. K. . Parry . W. . Razborov . A. A. . Robinson . C. A. . Whittle . P. . 1990 . Andrei Nikolaevich Kolmogorov (1903–1987) . Bulletin of the London Mathematical Society . 22 . 1. 33 . 10.1112/blms/22.1.31 .
  6. Solow. Robert. 1 January 1952. On the Structure of Linear Models. Econometrica. 20. 1. 29–46. 10.2307/1907805. 1907805.
  7. Sittler. R.. 1 December 1956. Systems Analysis of Discrete Markov Processes. IRE Transactions on Circuit Theory. 3. 4. 257–266. 10.1109/TCT.1956.1086324. 0096-2007.
  8. Evans. Selby. 1 July 1967. Vargus 7: Computed patterns from markov processes. Behavioral Science. en. 12. 4. 323–328. 10.1002/bs.3830120407. 1099-1743.
  9. Gingerich. P. D.. 1 January 1969. Markov analysis of cyclic alluvial sediments. Journal of Sedimentary Research. en-US. 39. 1. 330–332. 10.1306/74d71c4e-2b21-11d7-8648000102c1865d. 1527-1404. 1969JSedR..39..330G.
  10. Krumbein. W. C.. Dacey. Michael F.. 1 March 1969. Markov chains and embedded Markov chains in geology. Journal of the International Association for Mathematical Geology. en. 1. 1. 79–96. 10.1007/BF02047072. 0020-5958.
  11. Wolfe. Harry B.. 1 May 1967. Models for Conditioning Aging of Residential Structures. Journal of the American Institute of Planners. 33. 3. 192–196. 10.1080/01944366708977915. 0002-8991.
  12. A Markov matrix for fatigue load simulation and rainflow range evaluation . en. 10.1016/0167-4730(89)90025-8. 6. 2–4. Structural Safety. 247–258 . Krenk . S.. November 1989 .
  13. Beck. J.Robert. Pauker. Stephen G.. 1 December 1983. The Markov Process in Medical Prognosis. Medical Decision Making. en. 3. 4. 419–458. 10.1177/0272989X8300300403. 6668990. 0272-989X.
  14. Gotz. Glenn A.. McCall. John J.. 1 March 1983. Sequential Analysis of the Stay/Leave Decision: U.S. Air Force Officers. Management Science. 29. 3. 335–351. 10.1287/mnsc.29.3.335. 0025-1909.
  15. Kamusoko. Courage. Aniya. Masamu. Adi. Bongo. Manjoro. Munyaradzi. 1 July 2009. Rural sustainability under threat in Zimbabwe – Simulation of future land use/cover changes in the Bindura district based on the Markov-cellular automata model. Applied Geography. 29. 3. 435–447. 10.1016/j.apgeog.2008.10.002.