Arrayfun GPU in "Game of Life" works slower than CPU

조회 수: 2 (최근 30일)
Uladzislau
Uladzislau 2020년 3월 18일
편집: Joss Knight 2020년 3월 28일
Hello! I've run the demo "paralleldemo_gpu_stencil" and have such a result:
CPU: 2.815ms per generation.
Simple GPU: 2.650ms per generation (1.1x faster).
Arrayfun GPU: 13.253ms per generation (0.2x faster).
I've used ThinkPad P50 with Quadro M1000M and Matlab R2019b with appropriate drivers. Why Arrayfun works such slow ??
Uladzislau.

채택된 답변

Joss Knight
Joss Knight 2020년 3월 28일
편집: Joss Knight 2020년 3월 28일
Check out this Answer. The arrayfun version is rather dependent on good memory performance since the kernel is accessing global GPU memory in a non-coalesced way (multiple threads accessing overlapping regions of memory that aren't contiguous). For your chip, the version that runs multiple kernels on multiple shifted copies of the grid is actually more efficient, despite the kernel launch overhead and the extra memory allocation needed.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 GPU Computing in MATLAB에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by