Difference between revisions of "Trailing zeroes in factorial"

From PEGWiki
Jump to: navigation, search
(Created page with "A well-known programming exercise involves finding the number of trailing zeroes at the end of the decimal representation of <math>n!</math>, for some given value of <math>n</mat...")
(No difference)

Revision as of 19:28, 28 February 2012

A well-known programming exercise involves finding the number of trailing zeroes at the end of the decimal representation of n!, for some given value of n.[1][2]

One possible approach, of course, is to explicitly calculate the factorial. The only problem with this is that n! may be very large for reasonably sized values of n. For example, 100! > 10^{100}, so bignum arithmetic is needed for this kind of approach. This tends to be slow and inexpedient.

A much cleverer solution—one that does not involve calculating the factorial explicitly—can be obtained by noting that what is really desired is the largest k such that 10^k divides n!. The prime factorization of 10^k is 2^k 5^k, so really all we need to do is find out how many factors of 2 are in n!, and how many factors of 5 are in n!, and the smaller of the two gives the answer. For example, 10! has prime factorization 2^8 \cdot 3^4 \cdot 5^2 \cdot 7, so it contains 8 factors of 2 and 2 factors of 5. Consistent with these figures, there are 2 zeroes at the end of its decimal representation (3628800).

How do we calculate this? A workable approach is to simply determine how many factors of 2 and 5 are in each of the numbers 1, 2, ..., n. When we multiply those numbers to get n!, we of course add the exponents in their prime factorizations; so the sum of the numbers of factors of 2 in each of the integers 1, 2, ..., n gives us the number of factors of 2 in n!; and we do something similar for factors of 5. Determining how many times 2 or 5 divides a number is as easy as actually performing the division over and over again (until a non-integer result is obtained).

There is a cleverer solution yet, though. When we multiply all the numbers from 1 to n, exactly \lfloor n/2 \rfloor of them will be even, so each one of those contributes a factor of 2. However, the ones that are divisible by 4 actually contribute two factors of 2, so we have to add \lfloor n/4 \rfloor to the result (we counted those numbers once in \lfloor n/2 \rfloor, and now we are counting them again in \lfloor n/4 \rfloor). But then there may be some numbers that are divisible by 8, which contribute at least three factors of 2, so we have to add \lfloor n/8 \rfloor; and so on until we reach some power of 2 that is greater than n (so that it obviously divides none of 1, 2, ..., n). We can then do something similar for factors of 5. This gives us a formula:

k = \min\left(\left\lfloor \frac{n}{2} \right\rfloor + \left\lfloor \frac{n}{4} \right\rfloor + \left\lfloor \frac{n}{8} \right\rfloor + ..., \left\lfloor \frac{n}{5} \right\rfloor + \left\lfloor \frac{n}{25} \right\rfloor + \left\lfloor \frac{n}{125} \right\rfloor + ...\right)

It is now fairly obvious by inspection that the second argument—the number of factors of 5—is always the smaller of the two.

This formula can be used as-is, hence:

#include <stdio.h>
int main()
{
    int n;
    scanf("%d", &n);
    int p = 5;
    int k = 0;
    while (p <= n)
    {
        k += n/p;
        p *= 5;
    }
    printf("There are %d zeroes at the end of %d!\n", k, n);
    return 0;
}

The formula can also be re-worked into the following form. The mathematical reasoning behind the correctness of this implementation is left as an exercise for the reader.

#include <stdio.h>
int main()
{
    int n;
    scanf("%d", &n);
    int k = 0;
    while (n > 0)
        k += n /= 5;
    printf("There are %d zeroes at the end of %d!\n", k, n);
    return 0;
}

Functional style (Haskell):

zeroes n = sum $ tail $ takeWhile (>0) (iterate (flip div 5) n)

Bases other than 10 may be trickier to handle. For example, consider the problem of finding the number of zeroes at the end of n! when expressed in base 12 (with prime factorization 2^2 \cdot 3). In this case we have to count factors of 2 and factors of 3, but it takes two factors of 2 to make one factor of 12, so we have to divide by two and then compare it with the number of factors of 3 (the smaller of the two results will be the answer). It's not obvious which factor will "run out" first, so it might be necessary to find the number of times each prime factor in the base appears in n!. The algorithm shown above still works for each individual prime factor, though (provided that "5" is replaced with the appropriate number).

References

  1. Factorial
  2. Task 3: Zeros