CUDA_ERROR_LAUNCH_FAILED problem

Question

Michael 2012년 3월 30일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/34052-cuda_error_launch_failed-problem

댓글: Matteo Piccio 2021년 1월 29일

Hi Forum,

I'm very new to GPU programming, and I'm trying to write a simple CUDA program to speed up calculations of the digamma function for large 2D matrices. I've adapted a C implementation of the digamma function (from the "lightspeed" Matlab toolbox) to run on the GPU. I then compile the .cu code to a .ptx file and instantiate the kernel in Matlab by calling

k = parallel.gpu.CUDAKernel('digamma_test.ptx','digamma_test.cu');

and then use

o = feval(k,matrix);

The function works great, providing huge speed-ups on large matrices. The problem is that after a bunch of feval calls I get an error when I try to instantiate the kernel or call feval: "CUDA_ERROR_LAUNCH_FAILED", with no more information than that. If I restart Matlab, I can get it to work again for about the same period of time.

My thoughts are that somehow I'm not managing my memory very well. I've never worked in a language where memory has to be explicitly managed like CUDA or C, so I'm sure I'm leaving something out. The problem is, I just don't know what.

Here's my kernel:

#include <math.h>
#include <stdlib.h>
#include <float.h>
/* this is directly from Minka's lightspeed toolbox*/
__global__ void digamma(double *y) 
{
  int idx_x = blockIdx.x;
  int idx_y = blockIdx.y;
  int offset = idx_x + idx_y * gridDim.x;
  double x = y[offset];
  double neginf = -INFINITY;
  const double c = 12,
  digamma1 = -0.57721566490153286,
  trigamma1 = 1.6449340668482264365, /* pi^2/6 */
  s = 1e-6,
  s3 = 1./12,
  s4 = 1./120,
  s5 = 1./252,
  s6 = 1./240,
  s7 = 1./132,
  s8 = 691./32760,
  s9 = 1./12,
  s10 = 3617./8160;
double result;
/* Illegal arguments */
if((x == neginf) || isnan(x)) {
  y[offset] = NAN;
  return;
}
/* Singularities */
if((x <= 0) && (floor(x) == x)) {
  y[offset] = neginf;
  return;
}
/* Negative values */
/* Use the reflection formula (Jeffrey 11.1.6):
 * digamma(-x) = digamma(x+1) + pi*cot(pi*x)
 *
 * This is related to the identity
 * digamma(-x) = digamma(x+1) - digamma(z) + digamma(1-z)
 * where z is the fractional part of x
 * For example:
 * digamma(-3.1) = 1/3.1 + 1/2.1 + 1/1.1 + 1/0.1 + digamma(1-0.1)
 *               = digamma(4.1) - digamma(0.1) + digamma(1-0.1)
 * Then we use
 * digamma(1-z) - digamma(z) = pi*cot(pi*z)
 *
if(x < 0) {
  *p = digamma(p,1-x) + M_PI/tan(-M_PI*x);
  return;
}
*/
/* Use Taylor series if argument <= S */
if(x <= s) {
    y[offset] = digamma1 - 1/x + trigamma1*x;
    return;
}
/* Reduce to digamma(X + N) where (X + N) >= C */
result = 0;
while(x < c) {
  result -= 1/x;
  x++;
}
/* Use de Moivre's expansion if argument >= C */
/* This expansion can be computed in Maple via asympt(Psi(x),x) */
if(x >= c) {
  double r = 1/x, t;
  result += log(x) - 0.5*r;
  r *= r;
#if 0
  result -= r * (s3 - r * (s4 - r * (s5 - r * (s6 - r * s7))));
#else
  /* this version for lame compilers */
  t = (s5 - r * (s6 - r * s7));
  result -= r * (s3 - r * (s4 - r * t));
#endif
}
/* assign the result to the pointer*/
y[offset] = result;
return;
}

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Joss Knight 2014년 1월 17일

MATLAB Online에서 열기

How are you setting your launch configuration? (ThreadBlockSize, GridSize). You are accessing your array in an unusual way using the blockIdx instead of threadIdx. If your array is just a vector it would be more normal for you to have:

offset = threadIdx.x + blockDim.x*blockIdx.x;

You're also going to need some sort of guard to prevent threads from accessing beyond the end of the array. The normal way to do this would be to be passing in the length of your array, or have it hard-coded into your kernel.

Note that if your kernel fails to launch it will put the device into an error state and no further kernels can be launched until the state is cleared. This can be done in MATLAB by resetting the device:

reset(gpuDevice);

or in C code using

cudaGetLastError();

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Edward 2014년 1월 14일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/34052-cuda_error_launch_failed-problem#answer_120863

MATLAB Online에서 열기

I am having a similar problem as well. My kernel "runs," but then all GPU functionality is lost. If I try to do anything involving the GPU, I am met with:

An unexpected error occurred during CUDA execution. The CUDA error was:
CUDA_ERROR_LAUNCH_FAILED

or:

An unexpected error occurred during CUDA execution. The CUDA error was:
unspecified launch failure

Anyone have any ideas? Thanks!

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

Suvidha Tripathi 2020년 6월 10일

Thank you, this worked for me. Simple and effective. Thanks a lot again!

Matteo Piccio 2021년 1월 29일

@Marius Bledea I changed the settings as suggested but still having same issues. Do you have any other suggestion?

I'm using an NVIDIA GTX 770.

Thanks in advance

댓글을 달려면 로그인하십시오.

Answer 2

fausto 2013년 6월 28일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/34052-cuda_error_launch_failed-problem#answer_90244

편집: fausto 2013년 6월 28일

i have a similar problem. my matlab won't even work for a short period of time. it's giving this error immediately.

did you solve it? does somebody have the answer to this?

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Tim DeWolf 2017년 8월 11일

For me, I got this when a kernel hit the execution time limit on Windows. Solution for me is to break up the problem more so the kernel completes faster. I had to completely restart MATLAB as reset(gpuDevice) didn't work

댓글을 달려면 로그인하십시오.

CUDA_ERROR_LAUNCH_FAILED problem

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (2개)

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

CUDA_ERROR_LAUNCH_FAILED problem

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

답변 (2개)

댓글 수: 4 이전 댓글 2개 표시이전 댓글 2개 숨기기

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 4
이전 댓글 2개 표시이전 댓글 2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기