IFFT slow down with using gpuArray

조회 수: 3 (최근 30일)

Michael 2013년 5월 3일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/74536-ifft-slow-down-with-using-gpuarray

Two sets of data A (4096 x 1024) matrix and B (32768 x 1024) matrix have been transferred to the GPU using gpuArray. A is passed into the FFT function and has shown a significant speed increase in comparison to the CPU A data. B is passed into the IFFT function and has shown approximately a 50% decrease in efficiency in comparison to the CPU B data. Is there a reason why the IFFT function does not have the speed increase proportional to the FFT function? I understand the sizes differ but I do no understand why the GPU implemented IFFT is slower then the CPU implemented IFFT. Also, the tic toc function and the run and time function were used to time the results. Thank you for your help.

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

James Lebak 2013년 5월 3일

편집: James Lebak 2013년 5월 3일

MATLAB Online에서 열기

When I time this on MATLAB R2013a, 3.5 GHz Xeon, with a Tesla C2075 GPU, I see 0.36 s for the IFFT of a 32768x1024 matrix on the CPU and 0.051s on the GPU. Here is the code I used:

x=gpuArray.ones(32768,1024);
gd=gpuDevice;
tic;y=ifft(x);wait(gd);toc
xc=gather(x);
tic;y=ifft(xc);toc

And the output:

Elapsed time is 0.050705 seconds.
Elapsed time is 0.364836 seconds.

I would be interested to know what this code shows you, and also whether having the other array that you mentioned in memory changes the performance. I didn't see a change, but I don't have access to this specific card that you have.

Michael 2013년 5월 3일

MATLAB Online에서 열기

Thank you for the test case. When I run this same program the output is:

Elapsed time is 0.466822 seconds.
Elapsed time is 0.863542 seconds.

I believe the Tesla C2075 has a faster processing time than the GeForce GT 630M. However, your efficiency is terrific with a speed up of approximately 600% and mine was 85%. Why would there be such a difference? Thank you

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

채택된 답변

Matt J 2013년 5월 3일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/74536-ifft-slow-down-with-using-gpuarray#answer_84331

What graphics card do you have? How much RAM does it have? It could be that the larger array is just having a harder time because of memory constraints.

댓글 수: 10
이전 댓글 8개 표시이전 댓글 8개 숨기기

Matt J 2013년 5월 3일

MATLAB Online에서 열기

Also, Matt J, my apologizes about the program. The code I attached is part of a project. Is there a way to attach .m files in this forum?

Just give values for

    NumberofAlines = pdHeader(1);
    nAlineLength = pdHeader(2);
    nPaddingFactor = pdInit(4);

I assume that pdBuffer is the 32768x1024 array?

Michael 2013년 5월 3일

MATLAB Online에서 열기

Of course. Thank you:

NumberofAlines = 1024
nAlineLength = 4096
nPaddingFactor = 8

댓글을 달려면 로그인하십시오.

추가 답변 (1개)

James Lebak 2013년 5월 3일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/74536-ifft-slow-down-with-using-gpuarray#answer_84340

편집: James Lebak 2013년 5월 4일

The GeForce GT630M is a mobile graphics card. Frequently, these cards don't perform as well in double-precision as they do in single-precision. If your application can handle single-precision, you can try the IFFT in single and see if that gives you better performance. If you need double precision performance, you might want to try a different card.

This especially applies if the card in question is compute capability 3.0. You can find out the compute capability of the card in MATLAB from the structure returned by 'gpuDevice'.

Edit: removed incorrect identification of the 630M.

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

Michael 2013년 5월 5일

James Lebak you were correct. Single-precision is performing significantly more efficient than the double-precision data. Matt J and James Lebak thank you for all your help.

Matt J 2013년 5월 6일

편집: Matt J 2013년 5월 6일

If James was right, then why didn't you accept his Answer instead of mine???

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

카테고리

Parallel Computing Parallel Computing Toolbox GPU Computing GPU Computing in MATLAB

Help Center 및 File Exchange에서 GPU Computing in MATLAB에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

IFFT slow down with using gpuArray

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

채택된 답변

댓글 수: 10
이전 댓글 8개 표시이전 댓글 8개 숨기기

추가 답변 (1개)

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

IFFT slow down with using gpuArray

댓글 수: 4 이전 댓글 2개 표시이전 댓글 2개 숨기기

채택된 답변

댓글 수: 10 이전 댓글 8개 표시이전 댓글 8개 숨기기

추가 답변 (1개)

댓글 수: 5 이전 댓글 3개 표시이전 댓글 3개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

댓글 수: 10
이전 댓글 8개 표시이전 댓글 8개 숨기기

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기