About the Project

floating-point

AdvancedHelp

(0.002 seconds)

9 matching pages

1: 3.1 Arithmetics and Error Measures
A nonzero normalized binary floating-point machine number x is represented as … …
IEEE Standard
Figure 3.1.1: Floating-point arithmetic. …
2: 4.48 Software
All scientific programming languages, libraries, and systems support computation of at least some of the elementary functions in standard floating-point arithmetic (§3.1(i)). … A more complete list of available software for computing these functions is found in the Software Index; again, software that uses only standard floating-point arithmetic is excluded. …
3: Bibliography I
  • IEEE (2008) IEEE Standard for Floating-Point Arithmetic. The Institute of Electrical and Electronics Engineers, Inc..
  • IEEE (2019) IEEE International Standard for Information Technology—Microprocessor Systems—Floating-Point arithmetic: IEEE Std 754-2019. The Institute of Electrical and Electronics Engineers, Inc..
  • 4: Bibliography O
  • M. L. Overton (2001) Numerical Computing with IEEE Floating Point Arithmetic. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA.
  • 5: Bibliography M
  • Maxima (free interactive system)
  • MPFR (free C library)
  • mpmath (free python library)
  • 6: Bibliography S
  • D. M. Smith (2001) Algorithm 814: Fortran 90 software for floating-point multiple precision arithmetic, gamma and related functions. ACM Trans. Math. Software 27 (4), pp. 377–387.
  • D. M. Smith (1991) Algorithm 693: A FORTRAN package for floating-point multiple-precision arithmetic. ACM Trans. Math. Software 17 (2), pp. 273–283.
  • 7: Bibliography G
  • D. Goldberg (1991) What every computer scientist should know about floating-point arithmetic. ACM Computing Surveys 23 (1), pp. 5–48.
  • 8: Bibliography C
  • C. W. Clenshaw and F. W. J. Olver (1984) Beyond floating point. J. Assoc. Comput. Mach. 31 (2), pp. 319–328.
  • 9: Errata
  • Section 3.1

    In ¶IEEE Standard (in §3.1(i)), the description was modified to reflect the most recent IEEE 754-2019 Floating-Point Arithmetic Standard IEEE (2019). In the new standard, single, double and quad floating-point precisions are replaced with new standard names of binary32, binary64 and binary128. Figure 3.1.1 has been expanded to include the binary128 floating-point memory positions and the caption has been updated using the terminology of the 2019 standard. A sentence at the end of Subsection 3.1(ii) has been added referring readers to the IEEE Standards for Interval Arithmetic IEEE (2015, 2018).

    Suggested by Nicola Torracca.