Zipf-Mandelbrot law

The Zipf-Mandelbrot law (also known as the Pareto-Zipf law) is a power-law distribution on ranked data, named after the Harvard linguistic professor George Kingsley Zipf (1902-1950) who suggested regularity in texts, and the mathematician Benoit Mandelbrot (born November 20, 1924), who generalized it.

The distribution of words ranked by their frequency in a random corpus of text is generally a power-law distribution, known as Zipf's law.

If one plots the frequency rank of words contained in a large corpus of text data versus the number of occurences or actual frequencies, one obtains a power-law distribution, with exponent close to one (but see Gelbukh and Sidoro 2001).

External References

Z. K. Silagadze: Citations and the Zipf-Mandelbrot's law
NIST: Zipf's law
W. Li's References on Zipf's law