You're reading: Irregulars

Cushing your luck: properties of randomly chosen numbers

Long-time Aperiodical muse David Cushing has made a bet with us that he can give us an interesting post every Friday for the next ten weeks. Every week that he sends a post, we buy him a bar of chocolate. Every week that he doesn’t send us a post, he buys us a bar of chocolate. For his first trick, David is going to do some unnatural things with the natural numbers.

The greatest common divisor (gcd) of two or more integers is the greatest integer that evenly divides those integers. For example, the gcd of 8 and 12 is 4 (usually written as gcd(8,12)=4). Two integers are called coprime (or “relatively prime”) if their gcd is equal to 1.

A reasonable question to ask is,

Given two randomly chosen integers a and b, what is the probability that gcd(a,b)=1?

Randomly chosen integers” is a bit of a dodgy term but we’ll just ignore that and hope for the best.

If no prime divides n, then n=1, by the fundamental theorem of arithmetic. So we want to find the probability that no prime divides gcd(a,b). Let a and b be two randomly chosen integers and p a fixed prime. If p divides both a and b then gcd(a,b)p. We are interested in the probability that this does not occur for any p.

Now I need to say what I mean by “randomly chosen”. It’s very handwavey, but we will calculate the probability for an integer chosen uniformly at random between 1 and N, and then let N tend to infinity. We won’t actually do that, but that’s what we’re thinking of.

Every pth integer is divisible by p, so the probability that p divides a and b is 1p2, and the probability that it divides at most one of a or b is 11p2. That’s a good start, but what we want is this probability over all primes. The probabilities for each prime are independent of each other, so you can just multiply them all together. Therefore the probability that gcd(a,b)=1 is

Pr(gcd(a,b)=1)=p prime(11p2).

We now have the problem of evaluating this number. For those in the know, our expression looks pretty darn close to the definition of the Riemann zeta function:

ζ(s)=p prime(11ps)1=n=1ns.

(We used the Euler product formula to relate the product we had with the usual definition of the zeta function, which is a sum.)

By comparing the two equations, we can see that Pr(gcd(a,b)=1)=1ζ(2). Thankfully, ζ(2) is simply the sum of the reciprocals of the square numbers, which we know is π26 ((This infinite sum is the subject of the Basel problem, solved by Euler in 1735)). Therefore the probability that two randomly chosen integers are coprime is the reciprocal of that: 6π2. So we are done!

Pr(gcd(a,b)=1)=6π20.6079

Well, sort of. Now that we know the probability the gcd of two randomly chosen integers is 1, why not ask a few more general questions? What is the probability that the gcd of two randomly chosen integers is a prime number p? More generally, what is the probability that the gcd of two randomly chosen integers is some kN, and what’s the probability that it’s any prime?

First of all, what’s the probability that gcd(a,b) is a prime p? That means:

  • p divides both a and b
  • p2, and any other prime, only divide at most one of a and b.

As we said before, Pr(pgcd(a,b))=1p2. We want to exclude the cases when p2 divides both a and b, which happens 1p4 of the time. For any other prime q, we want the same thing as before: Pr(qgcd(a,b))=11q2. So, putting it all together, we have:

Pr(gcd(a,b)=p)=(1p21p4)qp prime(11q2)=1p2(11p2)qp prime(11q2)=1p2q prime(11q2)=6p2π2.

Next, we can apply a similar argument to find the probability that gcd(a,b)=pn. pn divides both numbers 1(pn)2 of the time, and we need to exclude the case where pn+1 divides both numbers, which happens 1(pn+1)2 of the time. Like last time, we can take a factor of 1p2n out of this, to get:

Pr(gcd(a,b)=pn)=1p2nq prime(11q2)=6(pn)2π2

It should be clear that you can take a couple of steps from here to derive that, for any kN,

Pr(gcd(a,b)=k)=6k2π2

So what’s the probability that the gcd of two randomly chosen integers is any prime? Well, that’s simply

p primePr(gcd(a,b)=p)=p prime6p2π2=6π2p prime1p2

To proceed any further we need to evaluate the sum of the reciprocals of the squares of the primes. When we did that with all the natural numbers, we used ζ(2). It turns out that there’s a related function which sums just over the primes, called the prime zeta function:

P(s)=p primeps.

Replacing our sum with P(2), we get:

Pr(gcd(a,b) prime)=6π2P(2)6π2×0.452240.274914.

So about a quarter.

Let’s end with one more generalisation. Let’s say we have 3 randomly chosen integers. No, scrap that. Let’s say we have 4 randomly chosen integers. No, that’s still not right… I know – let’s say we have n randomly chosen integers!

Using similar arguments as above we can get that the probability of the gcd of n integers chosen at random to be prime is P(n)ζ(n).

A nice simple(ish) answer and we are all done.

References

Riemann zeta function

Proof of the Euler product formula for the Riemann zeta function

Prime zeta function

There are three different proofs that n1n2=π26 in Proofs from THE BOOK by Martin Aigner and Günter Ziegler. This was known as the Basel problem. Some more history is given in the comprehensive historical survey The Ubiquitous π by Dario Castellanos.

The Wolfram MathWorld page on “relatively prime” contains the excellent observation that 6π2 is the fraction of points in the 2d lattice that you can see when standing at the origin. It also gives probabilities for pairs of numbers chosen from a few different principal ideal domains.

“On the distribution of the greatest common divisor” by Persi Diaconis and Paul Erdős does all this rigorously, with error terms and everything.

One Response to “Cushing your luck: properties of randomly chosen numbers”

(will not be published)

LATEX: You can use LaTeX in your comments. e.g. $ e^{\pi i} $ for inline maths; \[ e^{\pi i} \] for display-mode (on its own line) maths.

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>