Karatsuba algorithm

This is an old revision of this page, as edited by Jorge Stolfi (talk | contribs) at 07:26, 29 September 2008 (Generlal rewording; concrete analysis instead of just big-O;). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Karatsuba multiplication algorithm, a technique for quickly multiplying large numbers, was discovered by Anatolii Alexeevich Karatsuba in 1960 and published in the joint paper with Yu. Ofman in 1962. It reduces the multiplication of two n-digit numbers to single-digit multiplications. This makes it faster than the classical algorithm, which requires n2 single-digit products. If n = 210 = 1024, for example, the counts are 310 = 59,049 and (210)2 = 1,048,576, respectively.

The Toom-Cook algorithm is a faster generalization of Karatsuba's. For sufficiently large n, Karatsuba's algorithm is beaten by the Schönhage-Strassen algorithm.

The Karatsuba algorithm is a notable example of the divide and conquer paradigm, specifically of binary splitting.

Algorithm

The basic step of Karatsuba's algorithm is a formula that allows us to compute the product of two large numbers x and y using three multiplications of smaller numbers, each with about half as many digits as x or y, plus some additions and digit shifts.

Let x and y be represented as n-digit strings in some base B. For any positive integer m less than n, one can split the two given numbers as follows

x = x1Bm + x0
y = y1Bm + y0

where x0 and y0 are less than Bm. The product is then

xy = (x1Bm + x0)(y1Bm + y0)
= z2 B2m + z1 Bm + z0

where

z2 = x1y1
z1 = x1y0 + x0y1
z0 = x0y0

These formulas require four multiplications. Karatsuba observed that we can compute xy in only three multiplications, at the cost of a few extra additions:

Let z2 = x1y1
Let z0 = x0y0
Let z1 = (x1 + x0)(y1 + y0) - z2 - z0

since

z1 = (x1y1 + x1y0 + x0y1 + x0y0) - x1y1 - x0y0 = x1y0 + x0y1

To compute these three products of m-digit numbers, we can recursively employ Karatsuba multiplication again, until the numbers become so small that their product can be computed directly. One typically chooses B so that this condition is verified when the numbers are 1 digit each. In a computer with a full 32-bit by 32-bit multiplier, for example, one could choose B = 232 or B = 109, and store each digit as a separate 32-bit binary word.

Example

Say we want to compute the product of 1234 and 5678. We choose B = 10 and m = 2. We have

12 34 = 12 × 102 + 34
56 78 = 56 × 102 + 78
z2 = 12 × 56 = 672
z0 = 34 × 78 = 2652
z1 = (12 + 34)(56 + 78) - z2 - z0 = 46 × 134 - 672 - 2652 = 2840
result = z2 × 102×2 + z1 × 102 + z0 = 672 0000 + 2840 00 + 2652 = 7006652

Efficiency analysis

Karatsuba's basic step works for any base B and any m, but the recursive algorithm is most efficient when, and m is equal to n/2, rounded up. In particular, if n is 2k, for some integer k, and the recursion stops only when n is 1, then the number of single-digit multiplications is 3k, which is nc where c = log23.

Since one can extend any inputs with zero digits until their length is a power of two, it follows that the number of elementary multiplications, for any n, is at most  .

Since the additions, subtractions, and digit shifts (multiplications by powers of B) in Karasuba's basic step take time proportional to n, their cost of all becomes negligible as n increases. More precisely, if t(n) denotes the total number of elementary operations that the algorithm performs when multiplying two n-digit numbers, then we can write

t(n) = 3 t( n/2 ) + cn + d

for some constants c and d. For this recurrence relation, the master theorem gives the asymptotic bound t(n) = Θ(nlog(3)/log(2)).

It follows that, for sufficiently large n, Karatsuba's algorithm will perform fewer shifts and single-digit additions than longhand multiplication, even though its basic step uses more additions and shifts than the straightforward formula. For small values of n, however, the extra shift and add operations may make it run slower than the longhand method. The point of positive return depends on the computer platform and context. As a rule of thumb, Karatsuba is usually faster when n is 10 or more [1][2]

References