huge differences in single vs double precision math

Question

0 개 추천

I am calculating a sum of squares in 32-bit FP precision (for comparison with a GPU algorithm, which isn't relevant here).

Here is the code:

Y=single((0:499).^2);
sum(Y)
ans =
   41541684
sum(double(Y))
ans = 
   41541750

The (correct) double answer is off by 66! The largest value, 499^2 = 249001, is nowhere near any FP limits.

This is R2013A on OS X 10.9.

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

John D'Errico 2014년 8월 7일

MATLAB Online에서 열기

4 개 추천

What you don't understand is that single precision has a 23 bit mantissa. While there are 32 total bits stored in a single, don't forget that one of those bits is a sign bit, which leaves 8 bits to store an exponent in a biased form. So you cannot store an INTEGER larger than 2^24-1 in a single, if you wish to do so without error.

The sum you formed was larger than that limit, so you should expect an error.

log2(41541750)
ans =
     25.308

It is time for you to start reading about floating point arithmetic.

Wiki floating point article

What Every Computer Scientist Should Know About Floating Point Arithmetic

Computers are not all powerful, except for those in the movies/tv.

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

huge differences in single vs double precision math

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

태그

Community Treasure Hunt

huge differences in single vs double precision math

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

카테고리

제품

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기