Deep learning with a GPU that supports fp16

Question

0 개 추천

Hi.

NVDIA has released the new RTX 2XXX and 3XXX series that support fp16 that accelrates training process.

Does Matlab support this?

Thank you

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

Walter Roberson 2019년 9월 1일

An interesting article came through recently, https://www.linkedin.com/pulse/deep-learning-cant-progress-ieee-754-floating-point-heres-omtzigt/

Krishna Bindumadhavan 2019년 9월 14일

There is support for half precision in MATLAB via the half precision object, available in the fixed point designer toolbox:https://www.mathworks.com/help/fixedpoint/ref/half.html.

General Code generation support for half precision data type via MATLAB Coder and GPU Coder is under active development. This functionality is expected in an upcoming release.

As mentioned below, there is no support currently for using half for training a deep learning network in MATLAB. This is expected in a future release.

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Follow Question

Answer 1

Joss Knight 2019년 8월 29일

1 개 추천

You can take advantage of FP16 when generating code for prediction on a deep neural network. Follow the pattern of the Deep Learning Prediction with NVIDIA TensorRT example but set the DataType property of the DeepLearningConfig to 'fp16'. This will use the Tensor cores on a Volta or Turing card such as the RTX series.

There is no way yet to use half precision or Tensor cores for training a deep neural network in MATLAB. This is expected in an upcoming release.

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

Juuso Korhonen 2021년 2월 24일

What about now? Or do we have to wait for 2021 release?

Joss Knight 2021년 2월 24일

You can use the Deep Network Quantizer to calibrate a trained network for 8-bit reduced precision types. For now, fp16 is not supported, and quantization-aware training is not supported.

With an Ampere card, using the latest R2021a release of MATLAB (soon to be released), you will be able to take advantage of the Tensor cores using single precision because of the new TF32 datatype that cuDNN leverages when performing convolutions on an Ampere card.

댓글을 달려면 로그인하십시오.

Deep learning with a GPU that supports fp16

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

채택된 답변

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

추가 답변 (0개)

카테고리

태그

Community Treasure Hunt

Deep learning with a GPU that supports fp16

댓글 수: 4 이전 댓글 2개 표시 이전 댓글 2개 숨기기

채택된 답변

댓글 수: 4 이전 댓글 2개 표시 이전 댓글 2개 숨기기

추가 답변 (0개)

카테고리

태그

참고 항목

Community Treasure Hunt

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기

댓글 수: 4
이전 댓글 2개 표시 이전 댓글 2개 숨기기