Chapter 5 Floating-Point Arithmetic

Ulrich Kulisch

Chapter

Requires Authentication

Chapter 5 Floating-Point Arithmetic

Ulrich Kulisch
Ulrich Kulisch

Search for this author in:

De Gruyter De Gruyter Brill | Google Scholar

Published by

Become an author with De Gruyter Brill

How to publish with us

This chapter is in the book Computer Arithmetic and Validity

Chapter 5Floating-Point ArithmeticThus far our considerations have proceededunder the general assumptionthat the setRin Figure 1 is a linearly ordered ringoid. We are now goingto be more specific and assume thatRis the linearly orderedfield of realnumbers and that the subsetsDandSare special screens, which are calledfloating-point systems. In Section 5.1 we brieflyreviewthedefinition ofthe real numbers and their representation byb-adic expansions. Section 5.2deals withfloating-point numbers and systems. We also discuss severalbasic roundings of the real numbers into afloating-point system, and wederive error bounds for these roundings.In Section 5.3 we considerfloating-point operations defined by semi-morphisms, and we derive error bounds for these operations. Then weconsider the operations defined in the product sets listed underSandDin Figure 1, and we derive error formulas and bounds for these operationsalso. All this is done under the assumption that the operations are definedby semimorphisms.For scalar product and matrix multiplication, we also derive error for-mulas and bounds for the conventional definition of the operations. Theerror formulas are simpler if the operations are defined by semimorphisms,and the error bounds are smaller by a factor of at leastncompared to thebounds obtained by the conventional definition of the operations. These twoproperties are reproduced in the error analysis of many algorithms.Finally we review the so-called IEEEfloating-point arithmetic standard.5.1 Definition and Properties of the Real NumbersTwo methods of defining the real numbers are most commonly used. These may becalled the constructive method and the axiomatic method. We have mentioned theseconcepts already in the introduction of this book. For clarity we briefly consider theaxiomatic method here.The real numbers{R,+,·,≤}can be defined as a conditionally complete linearlyorderedfield, i.e.,I{R,+,·}is afield.II{R,≤}is a conditionally complete, linearly ordered set.

You are currently not able to access this content.

Not sure if you should have access? Please log in using an institutional account to see if you have access to view or download this content.

Chapter 5Floating-Point ArithmeticThus far our considerations have proceededunder the general assumptionthat the setRin Figure 1 is a linearly ordered ringoid. We are now goingto be more specific and assume thatRis the linearly orderedfield of realnumbers and that the subsetsDandSare special screens, which are calledfloating-point systems. In Section 5.1 we brieflyreviewthedefinition ofthe real numbers and their representation byb-adic expansions. Section 5.2deals withfloating-point numbers and systems. We also discuss severalbasic roundings of the real numbers into afloating-point system, and wederive error bounds for these roundings.In Section 5.3 we considerfloating-point operations defined by semi-morphisms, and we derive error bounds for these operations. Then weconsider the operations defined in the product sets listed underSandDin Figure 1, and we derive error formulas and bounds for these operationsalso. All this is done under the assumption that the operations are definedby semimorphisms.For scalar product and matrix multiplication, we also derive error for-mulas and bounds for the conventional definition of the operations. Theerror formulas are simpler if the operations are defined by semimorphisms,and the error bounds are smaller by a factor of at leastncompared to thebounds obtained by the conventional definition of the operations. These twoproperties are reproduced in the error analysis of many algorithms.Finally we review the so-called IEEEfloating-point arithmetic standard.5.1 Definition and Properties of the Real NumbersTwo methods of defining the real numbers are most commonly used. These may becalled the constructive method and the axiomatic method. We have mentioned theseconcepts already in the introduction of this book. For clarity we briefly consider theaxiomatic method here.The real numbers{R,+,·,≤}can be defined as a conditionally complete linearlyorderedfield, i.e.,I{R,+,·}is afield.II{R,≤}is a conditionally complete, linearly ordered set.

You are currently not able to access this content.

Not sure if you should have access? Please log in using an institutional account to see if you have access to view or download this content.