GPU temperature limit control

조회 수: 10 (최근 30일)
rokP
rokP 2018년 2월 18일
답변: Jason Ross 2018년 2월 21일
I'm running extensive CUDA code on 3 GPUs and eventually found out that one GPU stops working due to reaching the temperature limit (95C). Is there a way that I can adapt/decrease the GPU temperature limit programmatically?

채택된 답변

Jason Ross
Jason Ross 2018년 2월 21일
You can change some GPU settings via the nvidia-smi utility. One of them is the power cap. Perhaps this can act as a proxy to reducing the temperature? There are a number of ways to monitor the temperature using the utility, but on a quick scan I don't see a way to set a lower temperature limit directly. You might be able to do this via the desktop settings app they ship. On Windows it's in Control Panel and on Linux, it's nvidia-settings.
I'd also suggest seeing if you can get more cooling in your machine. There might be some free/open fan headers, and fans are pretty inexpensive to buy.
The relevant nvidia-smi entry is here:
SW Power Cap SW Power Scaling algorithm is reducing the clocks below
requested clocks because the GPU is consuming too much
power. E.g. SW power cap limit can be changed with
nvidia-smi --power-limit=

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 GPU Computing에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by