FMA is a very efficient basic arithmetic function to compute d=a*b+c. In practice a*b+c is internaly exactly computed and d receives the double precision approximation of this exact result. Our code FMA does this job using simd feature of any not too old processor. Our code is limited to double precision variable and AVX2 registers (256 bits i.e. 4 double). a, b, c and are scalars or 1D vectors of same length. Our C routine uses openmp (multithreadings) and simd (4 operations at the same time within 256 bits registers) and is faster than the built in matlab fma (less overhead because limited to 1D double).
This function can be extended to AVX3 (512 bits registers) for the most recent CPUs.
인용 양식
Alain Barraud (2025). double precision fma function for every body (https://www.mathworks.com/matlabcentral/fileexchange/155607-double-precision-fma-function-for-every-body), MATLAB Central File Exchange. 검색 날짜: .
MATLAB 릴리스 호환 정보
개발 환경:
R2023b
모든 릴리스와 호환
플랫폼 호환성
Windows macOS Linux태그
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!FMA
버전 | 게시됨 | 릴리스 정보 | |
---|---|---|---|
1.0.0 |