can this loop be vectorized?

Question

3 개 추천

Here's a simplified example of some data I have. I'd like to eliminate the loop because my x and y arrays in reality have tens of thousands of elements, not just three.

X = repmat(0:5:1000,201,1);
Y = repmat((1000:-5:0)',1,201);
x = [50 326 800]; 
y = [900 429 600]; 
H = NaN(201,201,length(x)); 
for k = 1:length(x) 
    H(:,:,k) = hypot(X-x(k),Y-y(k)); 
end

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Matt J 2014년 5월 30일

편집: Matt J 2014년 5월 30일

MATLAB Online에서 열기

2 개 추천

Doing bsxfun(@minus,X,...) will be inefficient since X contains a lot of replicated data. So, I propose,

    X0=(0:5:1000).'; sX=length(X0);
    Y0=(1000:-5:0).'; sY=length(Y0);
    [I,J]=ndgrid(1:sY,1:sX);
    Xc=bsxfun(@minus,X0,x(:).').^2;
    Yc=bsxfun(@minus,Y0,y(:).').^2;
    H=reshape( sqrt( Yc(I,:)+Xc(J,:)) ,sY,sX,[]);

댓글 수: 5
이전 댓글 3개 표시 이전 댓글 3개 숨기기

Chad Greene 2014년 5월 30일

편집: Chad Greene 2014년 5월 30일

MATLAB Online에서 열기

This thread is fun to watch! I appreciate your clear examples; I'm learning quite a bit as I follow along.

To respond to a couple of points raised:

1. The data don't necessarily need to be repeated. If it's faster to not repeat the X and Y vectors, I'm happy to leave them as vectors.

2. I like the idea of taking out the square root to speed up the calculation because it's just as easy for me to compare the H.^2 values to a distance squared.

The reason I'm calculating H is because I have a surface Z and I want to identify all data in Z that are more than 80 meters from the points given by x and y. I'll end up setting

Z(min(H,[],3)>80) = NaN;

If it's faster, the sqrt function can be removed and I'll change the line above to

Z(min(H,[].3)>80^2) = NaN;

Chad Greene 2014년 6월 2일

I've adapted your solution and shared it as a function on File Exchange here . Thanks all for your help on this.

댓글을 달려면 로그인하십시오.

Answer 2

José-Luis 2014년 5월 30일

편집: José-Luis 2014년 5월 30일

MATLAB Online에서 열기

1 개 추천

bsxfun() is your friend:

X = repmat(0:5:1000,201,1);
Y = repmat((1000:-5:0)',1,201);
x(1,1,1:3) = [50 326 800]; 
y(1,1,1:3) = [900 429 600]; 
tic
H = NaN(201,201,length(x)); 
for k = 1:length(x) 
    H(:,:,k) = hypot(X-x(k),Y-y(k)); 
end
toc
tic
H_new = hypot(bsxfun(@minus,X,x),bsxfun(@minus,Y,y));
toc

Vectorized code is not necessarily faster. If performance is really an issue, you could try a mex file.

EDIT

Another option:

tic
H_nuevo = hypot(repmat(X,[1 1 3]) - repmat(x,[201 201 1]) , ...
    repmat(Y,[1 1 3]) - repmat(y,[201 201 1]) );
toc

An idea to increase the efficiency of your code is to try to compare squared distances instead, since the sqrt() is relatively expensive to perform.

댓글 수: 6
이전 댓글 4개 표시 이전 댓글 4개 숨기기

José-Luis 2014년 6월 2일

Thanks Sean. Good to know. I find it hard to know a-priori whether Matlab is going to perform decently or will be mired in overheads. Any good rule of thumb?

Sean de Wolski 2014년 6월 2일

What we say at seminars is:

You'll always see a speedup for signal processing, communications, or other applications where there is a persistent state that needs to be updated
Anything with fixed-point

You won't see a speedup for:

Linear Algebra, ffts, or functions that use IPP.

But, your milage will vary :)

댓글을 달려면 로그인하십시오.

Answer 3

Sean de Wolski 2014년 5월 30일

MATLAB Online에서 열기

1 개 추천

I'm seeing a slight (1.25ish times) speedup with the bsxfun approach

function speedyhypot()
X = repmat(0:5:1000,201,1);
Y = repmat((1000:-5:0)',1,201);
x = [50 326 800]; 
y = [900 429 600]; 
t1 = timeit(@FirstWay);
t2 = timeit(@SecondWay);
disp([t1 t2])
    function FirstWay
    H = NaN(201,201,length(x)); 
    for k = 1:length(x)
        H(:,:,k) = hypot(X-x(k),Y-y(k));
    end
    end
    function SecondWay
        H2 = hypot(bsxfun(@minus,X,reshape(x,1,1,3)),bsxfun(@minus,Y,reshape(y,1,1,3)));
    end
end

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

José-Luis 2014년 5월 30일

I get about the same times.

댓글을 달려면 로그인하십시오.

Answer 4

Jan 2014년 5월 31일

편집: Jan 2014년 5월 31일

MATLAB Online에서 열기

1 개 추천

hypot is smart with avoiding overflows. But for limited values sqrt(a^2+b^2) is much faster:

for k = 1:length(x) 
   H(:,:,k) = sqrt((X-x(k)).^2 + (Y-y(k)).^2); 
end

I get a speedup of factor 3, much more than the bsxfun tricks.

The power operator is expensive, so it is worth to try to reduce the squaring by using the identity: (a-b)^2 == a^2 - 2*a*b + b^2:

X2 = X .^ 2;
Y2 = Y .^ 2;
for k = 1:length(x) 
  H(:,:,k) = sqrt(X2 - X*(2*x(k)) + x(k)^2 + Y2 - Y*(2*y(k)) + y(k)^2);
end

2*X*x(k) is less efficient than X*(2*x(k)), because in the first case you have two multiplications of a scalar with a matrix, but in the second one one matrix operation only.

But this is slower than the first method. It seems like Matlab smartly performs a multiplikation for .^2 and not a power operation.

댓글 수: 5
이전 댓글 3개 표시 이전 댓글 3개 숨기기

Matt J 2014년 6월 1일

편집: Matt J 2014년 6월 1일

MATLAB Online에서 열기

In your solution hypot is replaced by sqrt(a^2+b^2) and I think that this is the main reason for the speedup.

I don't think so. Below is a test script that includes all three methods, mine, yours, and Sean's re-implemented without hypot. The timings I get are as follows,

Elapsed time is 1.846706 seconds. %Sean
Elapsed time is 0.881087 seconds. %Jan
Elapsed time is 0.427472 seconds. %Matt

Note that I have dropped the sqrt operations because Chad says he doesn't need them. When I put them back in, the gap between yours and mine diminishes,

Elapsed time is 4.260694 seconds. %Sean
Elapsed time is 1.546904 seconds. %Jan
Elapsed time is 1.291443 seconds. %Matt

but there is still a considerable gap between mine and Sean's.

     N=4000; x=rand(1,N); y=x;
      X0=(0:5:1000).'; sX=length(X0);
      Y0=(1000:-5:0).'; sY=length(Y0);
     [Y,X]=ndgrid(Y0,X0);
      %%%Sean
      tic;
         H = (  bsxfun(@minus,X,reshape(x,1,1,N)).^2 - ...
                     bsxfun(@minus,Y,reshape(y,1,1,N)).^2  );
      toc
      %%%Jan
      tic;
      H=zeros(sY,sX,N);
      for k = 1:length(x) 
         H(:,:,k) = ((X-x(k)).^2 + (Y-y(k)).^2); 
      end
      toc
      %%%Matt
      tic;
       Xc=reshape( bsxfun(@minus,X0,x(:).') , 1,sX,N);
       Yc=reshape( bsxfun(@minus,Y0,y(:).'), sY,1,N);
       H=( bsxfun(@plus,Xc,Yc) );
      toc

Jan 2014년 6월 1일

편집: Jan 2014년 6월 1일

@Matt J: Which Matlab version is used for your timings?

Matt J 2014년 6월 1일

편집: Matt J 2014년 6월 1일

R2013b. I get similar timings in R2012a, though.

댓글을 달려면 로그인하십시오.

can this loop be vectorized?

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 5
이전 댓글 3개 표시 이전 댓글 3개 숨기기

추가 답변 (3개)

댓글 수: 6
이전 댓글 4개 표시 이전 댓글 4개 숨기기

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

댓글 수: 5
이전 댓글 3개 표시 이전 댓글 3개 숨기기

카테고리

태그

Community Treasure Hunt

can this loop be vectorized?

댓글 수: 0 이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 5 이전 댓글 3개 표시 이전 댓글 3개 숨기기

추가 답변 (3개)

댓글 수: 6 이전 댓글 4개 표시 이전 댓글 4개 숨기기

댓글 수: 1 이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

댓글 수: 5 이전 댓글 3개 표시 이전 댓글 3개 숨기기

카테고리

태그

참고 항목

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시 이전 댓글 -2개 숨기기

댓글 수: 5
이전 댓글 3개 표시 이전 댓글 3개 숨기기

댓글 수: 6
이전 댓글 4개 표시 이전 댓글 4개 숨기기

댓글 수: 1
이전 댓글 -1개 표시 이전 댓글 -1개 숨기기

댓글 수: 5
이전 댓글 3개 표시 이전 댓글 3개 숨기기