Analysis of Error Propagation in Floating Point Computations: Round-off Errors | Lecture notes Mathematics for Computing

1.6 Round-off errors in floating point computations.

1.6.1 Round-off errors.

When people or computers do computations with floating point numbers, they usually round the result of each

arithmetic operation to a certain fixed number of digits of precision. This introduces additional errors into the

final result called round-off errors. Usually round-off errors are insignificant compared to errors in

measurement or truncation errors, but sometimes they will actually be larger. This is the case when the result of

an addition or subtraction is significantly smaller in magnitude than the numbers which one is adding or

subtracting. In some cases the round-off errors can be serious enough to cause the final result to be

meaningless.

Example 1. An object moves along a straight line so that its position x at time t is given by x = t3. Let

to = 10 and t1 = 10 + h be two times and xo = to3 = 103 = 1000 and x1 = t13 = (10+h)3 be the corresponding

positions. The displacement F044x is the change in position, i.e. F044x = x1 – xo = (10+h)3 - 1000. Suppose

h = 0.014.

a. Compute F044x exactly.

b. Compute F044x doing the calculations using four digit decimal floating point arithmetic. What is the error in

the result?

c. An alternative formula for F044x is F044x = 3to2h + 3toh2 + h3 = 300h + 30h2 + h3. Compute F044x using this

alternative formula again doing the calculations using four digit decimal floating point arithmetic. What is

the error in the result? How does this compare with part b?

Solution. a. Compute F044x exactly.

10 + h = 10 + 0.014 = 10.014

(10 + h)2 = (10.014)2 = 100.280196

(10 + h)3 = (100.280196)(10.014) = 1004.205882744

F044x = 1004.205882744 – 1000 = 4.205882744

b. Compute F044x rounding results to four digits after each operation. In the following F 0 A E indicates

rounding and a subscript a indicates an approximate value.

10 + h = 10.014 F 0 A E (10 + h)a = 10.01

[(10 + h)a]2 = (10.01)2 = 100.2001 F 0 A E [(10 + h)2]a = 100.2

[(10 + h)2]a (10 + h)a = (100.2)(10.01) = 1003.002 F 0 A E [(10 + h)3]a = 1003

[(10 + h)3]a - 1000 = 1003 – 1000 = 3 F0 A E [ F044x]a = 3

Absolute error = 4.205882744 - 3 = 1.205882744

Relative error = 1.205882744/3 F 0 B B 0.4 = 40%

c. Compute F044x using the alternative formula.

10 + h = 10.014 F 0 A E (10 + h)a = 10.01

300h = (300)(0.014) = 4.2

h2 = (0.014)2 = 0.000196

30h2 = (30)(0.000196) = 0.00588

h3 = 0.000002744

300h + 30h2 + h3 = 4.205882744 F 0 A E [ F044x]a = 4.206

Absolute error = 4.205882744 - 4.206 = 0.000117256

Relative error = 0.000117256/4.206 F 0 B B 0.00003 = 0.003%

.6.1 - 1

Analysis of Error Propagation in Floating Point Computations: Round-off Errors, Lecture notes of Mathematics for Computing