float_params2

버전 1.0.1 (2.66 KB) 작성자: Marco Cococcioni

MATLAB Code for Parameters of Floating-Point Arithmetics

팔로우

0.0

(0)

다운로드 수: 20

업데이트 날짜: 2021/6/10

라이선스 보기

`float_params2` is a MATLAB function for obtaining the parameters of several

floating-point arithmetics. The parameters are built into the code and are

not computed at run time.

The parameters are

- the unit roundoff,

- the smallest positive (subnormal) floating-point number,

- the smallest positive normalized floating-point number,

- the largest floating-point number,

- the number of binary digits in the significand (including the

implicit leading bit)

and the arithmetics supported are

- bfloat8,

- bfloat16,

- IEEE half precision (fp16),

- IEEE single precision (fp32),

- IEEE double precision (fp64),

- IEEE quadruple precision (fp128).

The code was developed in MATLAB R2020a and works with versions at least

back to R2016b.

This is a small extension to float_params of Nick Higham, to which I added the

support to the 8-bit Brain Float, as proposed at Intel by Naveen K. Mellempudi.

More details can be found here: https://arxiv.org/abs/1905.12334

I also renamed NVIDIA tf32 into tf19, just to reflect that it is a 19-bit precision float.

인용 양식

Marco Cococcioni (2024). float_params2 (https://www.mathworks.com/matlabcentral/fileexchange/93835-float_params2), MATLAB Central File Exchange. 검색됨 2024/7/26.

MATLAB 릴리스 호환 정보

개발 환경: R2021a

모든 릴리스와 호환

플랫폼 호환성

Windows macOS Linux

태그 태그 추가

도움

받음: float_params

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

float_params2.m

버전	게시됨	릴리스 정보
1.0.1	2021/6/10	very small update	다운로드
1.0.0	2021/6/10		다운로드