Two tesla c2050 + quadro 600 - GPU error. Ideas ?

조회 수: 1 (최근 30일)
Denis Zhegalin
Denis Zhegalin 2011년 2월 17일
Hello, I've got a remote cluster with 2 teslas c2050 inside (compute capability 2.0) + quadro 600 for OS purposes. All drivers are the newest from NVIDIA. CUDA toolkit - the latest one 3.2. MATLAB R2010b x64 + PCT + MDCS. Parallel computing test works fine. win7 x64 + 16Gb of RAM. This machine is public one, so some other software (CUDA compatible) is being tested on it. MATLAB is being run under system administrator. Some time ago it worked, now I have an error:
>> gpuDevice ??? Error using ==> feval An error occurred during: setting up the CUDA device. The CUDA error was: (Unknown code: 10100).
Error in ==> currentDeviceIndex at 7 idx = feval( '_gpu_selectedDeviceIdx' );
Error in ==> deviceProperties at 8 idx = parallel.internal.gpu.currentDeviceIndex;
Error in ==> GPUDevice.GPUDevice>GPUDevice.current at 34 [props, E] = parallel.internal.gpu.deviceProperties();
Error in ==> gpuDevice at 18 dev = parallel.gpu.GPUDevice.current();
Actually, I don't have an idea what to test or to check. Please help.

채택된 답변

Edric Ellis
Edric Ellis 2011년 2월 17일
That error code usually indicates that CUDA is having a problem connecting to the device. Are you connecting to the remote machines using remote desktop? (If so, you'll need the latest Tesla-specific "TCC" drivers).
  댓글 수: 12
Jason Ross
Jason Ross 2011년 3월 3일
You might want to try downloading the developer driver from the CUDA downloads page. That's what we have and we definitely have the utility.
http://developer.nvidia.com/object/cuda_3_2_downloads.html
Najeeb Ahmad
Najeeb Ahmad 2011년 3월 7일
Thanks a lot for your reply.
As per your advice, I downloaded and installed developer driver for Vista and Windows 7 from the link you provided. The file name was devdriver_3.2_winvista-win7_32_263.06_general.exe. However, I still have the folders I mentioned in my previous reply and no NVSMI folder. Tried running 'gpuDevice' command and still get the same error:
??? Error using ==> feval
An error occurred during: setting up the CUDA
device. The CUDA error was: (Unknown code: 10100).
Error in ==> currentDeviceIndex at 7
idx = feval( '_gpu_selectedDeviceIdx' );
Error in ==> deviceProperties at 8
idx = parallel.internal.gpu.currentDeviceIndex;
Error in ==> GPUDevice.GPUDevice>GPUDevice.current
at 34
[props, E] =
parallel.internal.gpu.deviceProperties();
Error in ==> gpuDevice at 18
dev = parallel.gpu.GPUDevice.current();
Am I doing something wrong? Thank you for your co-operation.

댓글을 달려면 로그인하십시오.

추가 답변 (1개)

Najeeb Ahmad
Najeeb Ahmad 2011년 3월 8일
The problem was with Windows 7 32-bit as it doesn't support TCC driver. I installed Windows 7 64-bit and associated TCC drivers for Tesla C1060 and don't get the error anymore. Moroever, now I do have NVSMI folder in its rightful place but it never required me to run nvidia-smi.exe utility to switch driver into TCC mode which indicates that driver is in TCC mode by default.

카테고리

Help CenterFile Exchange에서 GPU Computing에 대해 자세히 알아보기

태그

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by