In mathematics, the prime number theorem (PNT) describes the asymptotic distribution of the prime numbers among the positive integers. It formalizes the intuitive idea that primes become less common as they become larger by precisely quantifying the rate at which this occurs. The theorem was proved independently by Jacques Hadamard and Charles Jean de la Vallée Poussin in 1896 using ideas introduced by Bernhard Riemann (in particular, the Riemann zeta function).
The first such distribution found is, where is the prime-counting function (the number of primes less than or equal to N) and is the natural logarithm of . This means that for large enough, the probability that a random integer not greater than is prime is very close to . Consequently, a random integer with at most digits (for large enough) is about half as likely to be prime as a random integer with at most digits. For example, among the positive integers of at most 1000 digits, about one in 2300 is prime, whereas among positive integers of at most 2000 digits, about one in 4600 is prime . In other words, the average gap between consecutive prime numbers among the first integers is roughly .[1]
Let be the prime-counting function defined to be the number of primes less than or equal to, for any real number . For example, because there are four prime numbers (2, 3, 5 and 7) less than or equal to 10. The prime number theorem then states that is a good approximation to (where log here means the natural logarithm), in the sense that the limit of the quotient of the two functions and as increases without bound is 1:
\limx\toinfty
\pi(x) | |||||
|
=1,
\pi(x)\sim
x | |
logx |
.
This notation (and the theorem) does not say anything about the limit of the difference of the two functions as increases without bound. Instead, the theorem states that approximates in the sense that the relative error of this approximation approaches 0 as increases without bound.
The prime number theorem is equivalent to the statement that the th prime number satisfies
pn\simnlog(n),
On the other hand, the following asymptotic relations are logically equivalent:[3]
\begin{align} \limx →
\pi(x)logx | |
x |
&=1,and\\ \limx →
\pi(x)log\pi(x) | |
x |
&=1. \end{align}
As outlined below, the prime number theorem is also equivalent to
\limx\toinfty
\vartheta(x) | |
x |
=\limx\toinfty
\psi(x) | |
x=1, |
\limx
M(x) | |
x |
=0,
M(x)=\sumn\mu(n)
Based on the tables by Anton Felkel and Jurij Vega, Adrien-Marie Legendre conjectured in 1797 or 1798 that is approximated by the function, where and are unspecified constants. In the second edition of his book on number theory (1808) he then made a more precise conjecture, with and . Carl Friedrich Gauss considered the same question at age 15 or 16 "in the year 1792 or 1793", according to his own recollection in 1849.[4] In 1838 Peter Gustav Lejeune Dirichlet came up with his own approximating function, the logarithmic integral (under the slightly different form of a series, which he communicated to Gauss). Both Legendre's and Dirichlet's formulas imply the same conjectured asymptotic equivalence of and stated above, although it turned out that Dirichlet's approximation is considerably better if one considers the differences instead of quotients.
In two papers from 1848 and 1850, the Russian mathematician Pafnuty Chebyshev attempted to prove the asymptotic law of distribution of prime numbers. His work is notable for the use of the zeta function, for real values of the argument "", as in works of Leonhard Euler, as early as 1737. Chebyshev's papers predated Riemann's celebrated memoir of 1859, and he succeeded in proving a slightly weaker form of the asymptotic law, namely, that if the limit as goes to infinity of exists at all, then it is necessarily equal to one.[5] He was able to prove unconditionally that this ratio is bounded above and below by 0.92129 and 1.10555, for all sufficiently large .[6] Although Chebyshev's paper did not prove the Prime Number Theorem, his estimates for were strong enough for him to prove Bertrand's postulate that there exists a prime number between and for any integer .
An important paper concerning the distribution of prime numbers was Riemann's 1859 memoir "On the Number of Primes Less Than a Given Magnitude", the only paper he ever wrote on the subject. Riemann introduced new ideas into the subject, chiefly that the distribution of prime numbers is intimately connected with the zeros of the analytically extended Riemann zeta function of a complex variable. In particular, it is in this paper that the idea to apply methods of complex analysis to the study of the real function originates. Extending Riemann's ideas, two proofs of the asymptotic law of the distribution of prime numbers were found independently by Jacques Hadamard and Charles Jean de la Vallée Poussin and appeared in the same year (1896). Both proofs used methods from complex analysis, establishing as a main step of the proof that the Riemann zeta function is nonzero for all complex values of the variable that have the form with .[7]
During the 20th century, the theorem of Hadamard and de la Vallée Poussin also became known as the Prime Number Theorem. Several different proofs of it were found, including the "elementary" proofs of Atle Selberg and Paul Erdős (1949). Hadamard's and de la Vallée Poussin's original proofs are long and elaborate; later proofs introduced various simplifications through the use of Tauberian theorems but remained difficult to digest. A short proof was discovered in 1980 by the American mathematician Donald J. Newman.[8] [9] Newman's proof is arguably the simplest known proof of the theorem, although it is non-elementary in the sense that it uses Cauchy's integral theorem from complex analysis.
Here is a sketch of the proof referred to in one of Terence Tao's lectures.[10] Like most proofs of the PNT, it starts out by reformulating the problem in terms of a less intuitive, but better-behaved, prime-counting function. The idea is to count the primes (or a related set such as the set of prime powers) with weights to arrive at a function with smoother asymptotic behavior. The most common such generalized counting function is the Chebyshev function, defined by
\psi(x)=\sumk
k | |
\sum | |
\stackrel{p |
\lex,}{pisprime
This is sometimes written as
\psi(x)=\sumn\leΛ(n) ,
Λ(n)=\begin{cases}logp&ifn=pkforsomeprimepandintegerk\ge1,\ 0&otherwise.\end{cases}
It is now relatively easy to check that the PNT is equivalent to the claim that
\limx\toinfty
\psi(x) | |
x |
=1 .
\psi(x)=\sum\stackrel{p\lex}{pisprime
\psi(x)\ge
1-\varepsilon | |
\sum | |
\stackrel{x |
\lep\lex}{pisprime
The next step is to find a useful representation for . Let be the Riemann zeta function. It can be shown that is related to the von Mangoldt function, and hence to, via the relation
- | \zeta'(s) |
\zeta(s) |
=
infty | |
\sum | |
n=1 |
Λ(n)n-s .
A delicate analysis of this equation and related properties of the zeta function, using the Mellin transform and Perron's formula, shows that for non-integer the equation
\psi(x)=x - log(2\pi) -\sum\limits\rho
x\rho | |
\rho |
The next step in the proof involves a study of the zeros of the zeta function. The trivial zeros −2, −4, −6, −8, ... can be handled separately:
infty | |
\sum | |
n=1 |
1 | |
2nx2n |
=-
1 | log\left(1- | |
2 |
1 | |
x2 |
\right),
To do this, we take for granted that is meromorphic in the half-plane, and is analytic there except for a simple pole at, and that there is a product formula
\zeta(s)=\prod | ||||
|
log\zeta(s)=-\sumplog\left(1-p-s\right)=\sump,n
p-ns | |
n |
.
Write ; then
|\zeta(x+iy)|=\exp\left(\sumn,p
\cosnylogp | |
npnx |
\right) .
Now observe the identity
3+4\cos\phi+\cos2\phi=2(1+\cos\phi)2\ge0 ,
\left|\zeta(x)3\zeta(x+iy)4\zeta(x+2iy)\right|=\exp\left(\sumn,p
3+4\cos(nylogp)+\cos(2nylogp) | |
npnx |
\right)\ge1
\zeta(s)
Finally, we can conclude that the PNT is heuristically true. To rigorously complete the proof there are still serious technicalities to overcome, due to the fact that the summation over zeta zeros in the explicit formula for does not converge absolutely but only conditionally and in a "principal value" sense. There are several ways around this problem but many of them require rather delicate complex-analytic estimates. Edwards's book[11] provides the details. Another method is to use Ikehara's Tauberian theorem, though this theorem is itself quite hard to prove. D.J. Newman observed that the full strength of Ikehara's theorem is not needed for the prime number theorem, and one can get away with a special case that is much easier to prove.
D. J. Newman gives a quick proof of the prime number theorem (PNT). The proof is "non-elementary" by virtue of relying on complex analysis, but uses only elementary techniques from a first course in the subject: Cauchy's integral formula, Cauchy's integral theorem and estimates of complex integrals. Here is a brief sketch of this proof. See for the complete details.
The proof uses the same preliminaries as in the previous section except instead of the function , the Chebyshev function is used, which is obtained by dropping some of the terms from the series for . Similar to the argument in the previous proof based on Tao's lecture, we can show that, and for any . Thus, the PNT is equivalent to
\limx\vartheta(x)/x=1
-
\zeta'(s) | |
\zeta(s) |
\Phi(s)=\sump\lelogpp-s
-
\zeta'(s) | |
\zeta(s) |
\Phi(s)
-\zeta'(s)/\zeta(s)
\Res=1
\zeta(s)
\Res=1
\Phi(s)-
1{s-1} | |
\Res=1
One further piece of information needed in Newman's proof, and which is the key to the estimates in his simple method, is that
\vartheta(x)/x
Integration by parts shows how
\vartheta(x)
\Phi(s)
\Res>1
\Phi(s)=\int
infty | |
1 |
x-sd\vartheta(x)=
infty | |
s\int | |
1 |
\vartheta(x)x-s-1dx=s
infty | |
\int | |
0 |
\vartheta(et)e-stdt.
Newman's method proves the PNT by showing the integral
I=
infty | |
\int | |
0 |
\left(
\vartheta(et) | |
et |
-1\right)dt.
t\toinfty
\vartheta
To show the convergence of
I
\Rez>0
gT(z)=
T | |
\int | |
0 |
f(t)e-ztdt
g(z)=
infty | |
\int | |
0 |
f(t)e-ztdt
f(t)=
\vartheta(et) | |
et |
-1
\limTgT(z)=g(z)=
\Phi(s) | |
s |
-
1 | |
s-1 |
where z=s-1
\Rez=0
The convergence of the integral
I
\limTgT(0)=g(0)
The difference
g(0)-gT(0)
T
R>0
\delta>0
g(z)
|z|\leRand\Rez\ge-\delta
C
g(0)-gT(0)=
1 | |
2\pii |
\intC\left(g(z)-gT(z)\right)
dz | |
z |
=
1 | |
2\pii |
\intC\left(g(z)-gT(z)\right)F(z)
dz | |
z |
F(z)=ezT\left(1+
z2 | |
R2 |
\right)
F
F(0)=1
To estimate the integral, break the contour
C
C=C++C-
C+=C\cap\left\{z\vert\Rez>0\right\}
C-\cap\left\{\Rez\le0\right\}
g(0)-gT(0)=
\int | |
C+ |
infty | |
\int | |
T |
H(t,z)dtdz-
\int | |
C- |
T | |
\int | |
0 |
H(t,z)dtdz+
\int | g(z)F(z) | |
C- |
dz | |
2\piiz |
H(t,z)=f(t)e-tzF(z)/2\pii
\vartheta(x)/x
f(t)
B
f(t)
|F|\le2\exp(T\Rez)|\Rez|/R
|z|=R
\leB/R
C-
C-
R
\leB/R
T\toinfty
ezT
F
\limsupT|g(0)-gT(0)|\le
2B | |
R. |
R
\limTgT(0)=g(0)
In a handwritten note on a reprint of his 1838 paper "French: Sur l'usage des séries infinies dans la théorie des nombres", which he mailed to Gauss, Dirichlet conjectured (under a slightly different form appealing to a series rather than an integral) that an even better approximation to is given by the offset logarithmic integral function, defined by
\operatorname{Li}(x)=
x | |
\int | |
2 |
dt | |
logt |
=\operatorname{li}(x)-\operatorname{li}(2).
Indeed, this integral is strongly suggestive of the notion that the "density" of primes around should be . This function is related to the logarithm by the asymptotic expansion
\operatorname{Li}(x)\sim
x | |
logx |
infty | |
\sum | |
k=0 |
k! | |
(logx)k |
=
x | |
logx |
+
x | |
(logx)2 |
+
2x | |
(logx)3 |
+ …
So, the prime number theorem can also be written as . In fact, in another paper in 1899 de la Vallée Poussin proved that
\pi(x)=\operatorname{Li}(x)+O\left(xe-a\sqrt{log
\pi(x)=\operatorname{li}(x)+O\left(x\exp\left(-
| ||||||||||
|
\right)\right)
A=0.2098
In 2016, Trudgian proved an explicit upper bound for the difference between
\pi(x)
\operatorname{li}(x)
|\pi(x)-\operatorname{li}(x)|\le0.2795
x | |
(logx)3/4 |
\exp\left(-\sqrt{
logx | |
6.455 |
}\right)
x\ge229
The connection between the Riemann zeta function and is one reason the Riemann hypothesis has considerable importance in number theory: if established, it would yield a far better estimate of the error involved in the prime number theorem than is available today. More specifically, Helge von Koch showed in 1901[14] that if the Riemann hypothesis is true, the error term in the above relation can be improved to
\pi(x)=\operatorname{Li}(x)+O\left(\sqrtxlogx\right)
|\pi(x)-\operatorname{li}(x)|<
\sqrtxlogx | |
8\pi |
|\psi(x)-x|<
\sqrtx(logx)2 | |
8\pi |
|\pi(x)-\operatorname{li}(x)|=\Omega\left(\sqrtx
logloglogx | |
logx |
\right)
The logarithmic integral is larger than for "small" values of . This is because it is (in some sense) counting not primes, but prime powers, where a power of a prime is counted as of a prime. This suggests that should usually be larger than by roughly
\tfrac{1}{2}\operatorname{li}(\sqrt{x}) ,
\pi(x)-\operatorname{li}(x)
In the first half of the twentieth century, some mathematicians (notably G. H. Hardy) believed that there exists a hierarchy of proof methods in mathematics depending on what sorts of numbers (integers, reals, complex) a proof requires, and that the prime number theorem (PNT) is a "deep" theorem by virtue of requiring complex analysis.[19] This belief was somewhat shaken by a proof of the PNT based on Wiener's tauberian theorem, though Wiener's proof ultimately relies on properties of the Riemann zeta function on the line
re(s)=1
In March 1948, Atle Selberg established, by "elementary" means, the asymptotic formula
\vartheta(x)log(x)+\sum\limitsp{log(p)} \vartheta\left({
x | |
p |
\vartheta(x)=\sum\limitsp{log(p)}
There is some debate about the significance of Erdős and Selberg's result. There is no rigorous and widely accepted definition of the notion of elementary proof in number theory, so it is not clear exactly in what sense their proof is "elementary". Although it does not use complex analysis, it is in fact much more technical than the standard proof of PNT. One possible definition of an "elementary" proof is "one that can be carried out in first-order Peano arithmetic." There are number-theoretic statements (for example, the Paris–Harrington theorem) provable using second order but not first-order methods, but such theorems are rare to date. Erdős and Selberg's proof can certainly be formalized in Peano arithmetic, and in 1994, Charalambos Cornaros and Costas Dimitracopoulos proved that their proof can be formalized in a very weak fragment of PA, namely .[21] However, this does not address the question of whether or not the standard proof of PNT can be formalized in PA.
A more recent "elementary" proof of the prime number theorem uses ergodic theory, due to Florian Richter.[22] The prime number theorem is obtained there in an equivalent form that the Cesàro sum of the values of the Liouville function is zero. The Liouville function is
(-1)\omega(n)
\omega(n)
n
Let
X
T
X
\mu
T
T
f\inC(X)
In 2005, Avigad et al. employed the Isabelle theorem prover to devise a computer-verified variant of the Erdős–Selberg proof of the PNT.[23] This was the first machine-verified proof of the PNT. Avigad chose to formalize the Erdős–Selberg proof rather than an analytic one because while Isabelle's library at the time could implement the notions of limit, derivative, and transcendental function, it had almost no theory of integration to speak of.[23]
In 2009, John Harrison employed HOL Light to formalize a proof employing complex analysis.[24] By developing the necessary analytic machinery, including the Cauchy integral formula, Harrison was able to formalize "a direct, modern and elegant proof instead of the more involved 'elementary' Erdős–Selberg argument".
Let denote the number of primes in the arithmetic progression that are less than . Dirichlet and Legendre conjectured, and de la Vallée Poussin proved, that if and are coprime, then
\pid,a(x)\sim
\operatorname{Li | |
(x) |
}{\varphi(d)} ,
The Siegel–Walfisz theorem gives a good estimate for the distribution of primes in residue classes.
Bennett et al.[26] proved the following estimate that has explicit constants and (Theorem 1.3):Let
\ge3
\left|\pid,a(x)-
\operatorname{Li | |
(x) }{ \varphi(d) } |
\right|<
A x | |
(logx)2 |
forall x\geB ,
A=
1 | |
840 |
if 3\leqd\leq104 and A=
1 | |
160 |
if d>104~,
B=8 ⋅ 109 if 3\leqd\leq105 and B=\exp( 0.03 \sqrt{d } (log{d})3 ) if d>105 .
Although we have in particular
\pi4,1(x)\sim\pi4,3(x) ,
\pi4,1(x)-\pi4,3(x)~,
The prime number theorem is an asymptotic result. It gives an ineffective bound on as a direct consequence of the definition of the limit: for all, there is an such that for all,
(1-\varepsilon) | x |
logx |
< \pi(x) < (1+\varepsilon)
x | |
logx |
.
However, better bounds on are known, for instance Pierre Dusart's
x | \left(1+ | |
logx |
1 | |
logx |
\right) < \pi(x) <
x | \left(1+ | |
logx |
1 | + | |
logx |
2.51 | |
(logx)2 |
\right) .
The proof by de la Vallée Poussin implies the following bound: For every, there is an such that for all,
x | |
logx-(1-\varepsilon) |
< \pi(x) <
x | |
logx-(1+\varepsilon) |
.
The value gives a weak but sometimes useful bound for :[31]
x | |
logx+2 |
< \pi(x) <
x | |
logx-4 |
.
In Pierre Dusart's thesis there are stronger versions of this type of inequality that are valid for larger . Later in 2010, Dusart proved:[32]
\begin{align} | x |
logx-1 |
&< \pi(x)&&forx\ge5393 ,and\\ \pi(x)&<
x | |
logx-1.1 |
&&forx\ge60184 . \end{align}
Note that the first of these obsoletes the condition on the lower bound.
As a consequence of the prime number theorem, one gets an asymptotic expression for the th prime number, denoted by :
pn\simnlogn.
pn | |
n |
=logn+loglogn-1+
loglogn-2 | |
logn |
-
(loglogn)2-6loglogn+11 | |
2(logn)2 |
+o\left(
1 | |
(logn)2 |
\right).
Rosser's theorem states that
pn>nlogn.
\begin{align} logn+loglogn-1 &<
pn | |
n |
&&forn\ge2 ,\\
pn | |
n |
&< logn+loglogn-0.9484&&forn\ge39017 . \end{align}
The table compares exact values of to the two approximations and . The approximation difference columns are rounded to the nearest integer, but the "% error" columns are computed based on the unrounded approximations. The last column,, is the average prime gap below .
% error | |||||||
---|---|---|---|---|---|---|---|
scope=col | scope=col | ||||||
10 | 4 | 0 | 2 | 8.22% | 42.606% | 2.500 | |
102 | 25 | 3 | 5 | 14.06% | 18.597% | 4.000 | |
103 | 168 | 23 | 10 | 14.85% | 5.561% | 5.952 | |
104 | 1,229 | 143 | 17 | 12.37% | 1.384% | 8.137 | |
105 | 9,592 | 906 | 38 | 9.91% | 0.393% | 10.425 | |
106 | 78,498 | 6,116 | 130 | 8.11% | 0.164% | 12.739 | |
107 | 664,579 | 44,158 | 339 | 6.87% | 0.051% | 15.047 | |
108 | 5,761,455 | 332,774 | 754 | 5.94% | 0.013% | 17.357 | |
109 | 50,847,534 | 2,592,592 | 1,701 | 5.23% | 3.34 % | 19.667 | |
1010 | 455,052,511 | 20,758,029 | 3,104 | 4.66% | 6.82 % | 21.975 | |
1011 | 4,118,054,813 | 169,923,159 | 11,588 | 4.21% | 2.81 % | 24.283 | |
1012 | 37,607,912,018 | 1,416,705,193 | 38,263 | 3.83% | 1.02 % | 26.590 | |
1013 | 346,065,536,839 | 11,992,858,452 | 108,971 | 3.52% | 3.14 % | 28.896 | |
1014 | 102,838,308,636 | 314,890 | 3.26% | 9.82 % | 31.202 | ||
1015 | 891,604,962,452 | 1,052,619 | 3.03% | 3.52 % | 33.507 | ||
1016 | 3,214,632 | 2.83% | 1.15 % | 35.812 | |||
1017 | 7,956,589 | 2.66% | 3.03 % | 38.116 | |||
1018 | 21,949,555 | 2.51% | 8.87 % | 40.420 | |||
1019 | 99,877,775 | 2.36% | 4.26 % | 42.725 | |||
1020 | 222,744,644 | 2.24% | 1.01 % | 45.028 | |||
1021 | 597,394,254 | 2.13% | 2.82 % | 47.332 | |||
1022 | 1,932,355,208 | 2.03% | 9.59 % | 49.636 | |||
1023 | 7,250,186,216 | 1.94% | 3.76 % | 51.939 | |||
1024 | 17,146,907,278 | 1.86% | 9.31 % | 54.243 | |||
1025 | 55,160,980,939 | 1.78% | 3.21 % | 56.546 | |||
1026 | 155,891,678,121 | 1.71% | 9.17 % | 58.850 | |||
1027 | 508,666,658,006 | 1.64% | 3.11 % | 61.153 | |||
1028 | 1.58% | 9.05 % | 63.456 | ||||
1029 | 1.53% | 2.99 % | 65.759 |
The value for was originally computed assuming the Riemann hypothesis;[36] it has since been verified unconditionally.[37]
There is an analogue of the prime number theorem that describes the "distribution" of irreducible polynomials over a finite field; the form it takes is strikingly similar to the case of the classical prime number theorem.
To state it precisely, let be the finite field with elements, for some fixed, and let be the number of monic irreducible polynomials over whose degree is equal to . That is, we are looking at polynomials with coefficients chosen from, which cannot be written as products of polynomials of smaller degree. In this setting, these polynomials play the role of the prime numbers, since all other monic polynomials are built up of products of them. One can then prove that
Nn\sim
qn | |
n |
.
x | |
logqx |
,
One can even prove an analogue of the Riemann hypothesis, namely that
Nn=
qn | |
n |
+O\left(
| ||||||||
n |
\right).
The proofs of these statements are far simpler than in the classical case. It involves a short, combinatorial argument,[38] summarised as follows: every element of the degree extension of is a root of some irreducible polynomial whose degree divides ; by counting these roots in two different ways one establishes that
qn=\sumd\middNd,
Nn=
1 | |
n |
\sumd\mid\mu\left(
n | |
d |
\right)qd,