Home Chapter 5 Floating-Point Arithmetic
Chapter
Licensed
Unlicensed Requires Authentication

Chapter 5 Floating-Point Arithmetic

Become an author with De Gruyter Brill
Computer Arithmetic and Validity
This chapter is in the book Computer Arithmetic and Validity
Chapter 5Floating-Point ArithmeticThus far our considerations have proceededunder the general assumptionthat the setRin Figure 1 is a linearly ordered ringoid. We are now goingto be more specific and assume thatRis the linearly orderedfield of realnumbers and that the subsetsDandSare special screens, which are calledfloating-point systems. In Section 5.1 we brieflyreviewthedefinition ofthe real numbers and their representation byb-adic expansions. Section 5.2deals withfloating-point numbers and systems. We also discuss severalbasic roundings of the real numbers into afloating-point system, and wederive error bounds for these roundings.In Section 5.3 we considerfloating-point operations defined by semi-morphisms, and we derive error bounds for these operations. Then weconsider the operations defined in the product sets listed underSandDin Figure 1, and we derive error formulas and bounds for these operationsalso. All this is done under the assumption that the operations are definedby semimorphisms.For scalar product and matrix multiplication, we also derive error for-mulas and bounds for the conventional definition of the operations. Theerror formulas are simpler if the operations are defined by semimorphisms,and the error bounds are smaller by a factor of at leastncompared to thebounds obtained by the conventional definition of the operations. These twoproperties are reproduced in the error analysis of many algorithms.Finally we review the so-called IEEEfloating-point arithmetic standard.5.1 Definition and Properties of the Real NumbersTwo methods of defining the real numbers are most commonly used. These may becalled the constructive method and the axiomatic method. We have mentioned theseconcepts already in the introduction of this book. For clarity we briefly consider theaxiomatic method here.The real numbers{R,+,·,≤}can be defined as a conditionally complete linearlyorderedfield, i.e.,I{R,+,·}is afield.II{R,≤}is a conditionally complete, linearly ordered set.

Chapter 5Floating-Point ArithmeticThus far our considerations have proceededunder the general assumptionthat the setRin Figure 1 is a linearly ordered ringoid. We are now goingto be more specific and assume thatRis the linearly orderedfield of realnumbers and that the subsetsDandSare special screens, which are calledfloating-point systems. In Section 5.1 we brieflyreviewthedefinition ofthe real numbers and their representation byb-adic expansions. Section 5.2deals withfloating-point numbers and systems. We also discuss severalbasic roundings of the real numbers into afloating-point system, and wederive error bounds for these roundings.In Section 5.3 we considerfloating-point operations defined by semi-morphisms, and we derive error bounds for these operations. Then weconsider the operations defined in the product sets listed underSandDin Figure 1, and we derive error formulas and bounds for these operationsalso. All this is done under the assumption that the operations are definedby semimorphisms.For scalar product and matrix multiplication, we also derive error for-mulas and bounds for the conventional definition of the operations. Theerror formulas are simpler if the operations are defined by semimorphisms,and the error bounds are smaller by a factor of at leastncompared to thebounds obtained by the conventional definition of the operations. These twoproperties are reproduced in the error analysis of many algorithms.Finally we review the so-called IEEEfloating-point arithmetic standard.5.1 Definition and Properties of the Real NumbersTwo methods of defining the real numbers are most commonly used. These may becalled the constructive method and the axiomatic method. We have mentioned theseconcepts already in the introduction of this book. For clarity we briefly consider theaxiomatic method here.The real numbers{R,+,·,≤}can be defined as a conditionally complete linearlyorderedfield, i.e.,I{R,+,·}is afield.II{R,≤}is a conditionally complete, linearly ordered set.
Downloaded on 23.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/9783110203196.2.154/html?srsltid=AfmBOop5zmfLCIWj9Wdmh3bggmn8YHkiEN9-aDzT6yCJwtyIBiq02u0l
Scroll to top button