how to use printf inside a CUDA kernel?

조회 수: 63 (최근 30일)
Daniel Castaño Díez
Daniel Castaño Díez 2024년 6월 28일
댓글: Daniel Castaño Díez 2024년 7월 11일
Hi,
I wonder why I cannot use printf in cuda kernels. The code inside my file test.cu (adapted from the Mathworks help)
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <iostream>
__global__ void add2( double * v1, const double * v2)
{
int idx = threadIdx.x;
v1[idx] += v2[idx];
printf("identity: %d \n",idx);
}
compiles nicely with mexcuda with
mexcuda -ptx test.cu
but trying to runt it from the command line as
k = parallel.gpu.CUDAKernel("test.ptx","test.cu");
N = 8;
k.ThreadBlockSize = N;
in1 = ones(N,1,"gpuArray");
in2 = ones(N,1,"gpuArray");
result = feval(k,in1,in2);
gather(result);
does not put any result on screen.
this link suggests some operations with the header, as #undef printf to avoid conflicts with mex.h... but it didn't work for me.
  댓글 수: 5
Daniel Castaño Díez
Daniel Castaño Díez 2024년 7월 4일
Hi Umar,
thanks, but why do you think snprintf is allowed in device code? mexcuda does protest:
/mnt/stor0hdd/castano/workplace/gpu/kernel/test.cu(21): error:
calling a __host__ function("snprintf") from a __global__
function("add2") is not allowed
Should I do something special to specify that it is some special version of a host device?
cheers,
D.
Umar
Umar 2024년 7월 4일
Hi Daniel,
In CUDA C/C++, snprintf is allowed in device code because it is a host function that can be used in device code without any special modifications and facilitating easier code development. Also, when you use snprintf in device code, CUDA automatically handles the necessary translations and optimizations for device execution. Therefore, you do not need to specify snprintf as a special version for host or device; it can be used directly in device code as you would in host code.

댓글을 달려면 로그인하십시오.

채택된 답변

Joss Knight
Joss Knight 2024년 7월 6일
편집: Joss Knight 2024년 7월 6일

Just use it, and launch MATLAB from a terminal. On Linux, the output will appear in the terminal window. On Windows you will need to launch MATLAB with the undocumented options -wait -log.

  댓글 수: 3
Joss Knight
Joss Knight 2024년 7월 8일
If you want to redirect the output into the MATLAB command window you are going to need to use one of the tricks off the internet or mentioned here to redirect the kernel stdout into a buffer so you can invoke mexPrintf with it on the host. For debugging purposes, this is more trouble than it's worth.
Daniel Castaño Díez
Daniel Castaño Díez 2024년 7월 11일
Thanks, Joss. As you say, for debugging purposes I don't need it... I'm already happy with the output in the terminal :-)

댓글을 달려면 로그인하십시오.

추가 답변 (1개)

Udit06
Udit06 2024년 7월 1일
Hi Daniel,
One more suggestion that I found in the following discussion is to use "cudaDeviceSynchronize" to ensure that the kernel finishes and the driver flushes the output buffer.
If the issue still persists, you can refer to the solution given in the following discussion:
I hope this helps.

카테고리

Help CenterFile Exchange에서 GPU Computing에 대해 자세히 알아보기

제품


릴리스

R2024a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by