How to export INT8 quantized weight of deep neural network?

Question

Jisu Kwon 2024년 5월 29일

1
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2123701-how-to-export-int8-quantized-weight-of-deep-neural-network

댓글: Angelo Yeo 2024년 5월 30일

채택된 답변: Angelo Yeo

MATLAB Online에서 열기

I trained neural network using Deep Learning Toolbox, and quantized it.

Below code is what I used to INT8 quantize network model.

% Create a dlquantizer object for quantization
quantObj = dlquantizer(net);
% quantOpts = dlquantizationOptions(target='host');
calibrate(quantObj,imdsTrain);
% valResults = validate(quantObj, imdsValidation, quantOpts);
% valResults.Statistics
% Perform quantization
quantObj = quantize(quantObj);
qDetailsQuantized = quantizationDetails(quantObj)
% Save the quantized network
save('quantizedNet.mat', 'quantObj');
exportONNXNetwork(quantObj,'quantizedNet.onnx')

After quantization, I got quantized network quantObj .

However, I cannot access weight and bias which coverted to INT8 format.

When I display quantized networks' weight and bias using bwloe code,

>> disp(quantObj.Layers(2).Bias(:,:,1))
-6.9011793e-12

It still shows float type value.

Even I tried to export network as ONNX, MATLAB shows below warning,

>> exportONNXNetwork(quantObj,'quantizedNet.onnx')
Warning: Exported weights are not quantized when exporting quantized networks. 

How can I access INT8 quantized weight and bias value?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Angelo Yeo 2024년 5월 30일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/2123701-how-to-export-int8-quantized-weight-of-deep-neural-network#answer_1465151

Use the quantizationDetails function to extract quantization details.

You should inspect your qDetailsQuantized which was extracted with quantizationDetails. Would you look up the qDetailsQuantized.QuantizedLearnables?

The following example can be helpful for you.

Display quantization details for a neural network - MATLAB quantizationDetails (mathworks.com)

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

Jisu Kwon 2024년 5월 30일

MATLAB Online에서 열기

Hello Angelo Yeo,

Thank you for your answer.

According to your answer, I can see details of quantization of layers.

>> qDetailsQuantized.QuantizedLearnables
ans =
8×3 table
Layer      Parameter          Value      
________    _________    _________________
"conv_1"    "Weights"    {3×3×1×60  int8 }
"conv_1"    "Bias"       {1×1×60    int32}
"conv_2"    "Weights"    {3×3×60×60 int8 }
"conv_2"    "Bias"       {1×1×60    int32}
"conv_3"    "Weights"    {3×3×60×56 int8 }
"conv_3"    "Bias"       {1×1×56    int32}
"conv_4"    "Weights"    {3×3×56×12 int8 }
"conv_4"    "Bias"       {1×1×12    int32}

But this makes me in struggle. I know this network is quantized.

>> qDetailsQuantized.IsQuantized
ans =
logical
1

But what I want is exact value of weight and bias which type is INT8 (or INT32).

If I display weight value belongs to quantized network, it still shows float type.

>> disp(quantObj.Layers(2).Bias(:,:,1))
-6.9011793e-12

Thank you in advance!

Jisu Kwon 2024년 5월 30일

MATLAB Online에서 열기

I found it, qDetailsQuantized.QuantizedLearnables was what I want...

It was already obviously shown in member of table.

>> qDetailsQuantized.QuantizedLearnables
ans =
8×3 table
Layer      Parameter          Value      
________    _________    _________________
"conv_1"    "Weights"    {3×3×1×60  int8 }
"conv_1"    "Bias"       {1×1×60    int32}
"conv_2"    "Weights"    {3×3×60×60 int8 }
"conv_2"    "Bias"       {1×1×60    int32}
"conv_3"    "Weights"    {3×3×60×56 int8 }
"conv_3"    "Bias"       {1×1×56    int32}
"conv_4"    "Weights"    {3×3×56×12 int8 }
"conv_4"    "Bias"       {1×1×12    int32}

I can access value like this.

>> conv_1_weight = qDetailsQuantized.QuantizedLearnables.Value(1)
conv_1_weight =
1×1 cell array
{3×3×1×60 int8}
>> conv_1_weight{:,:,:,1}
3×3×1×60 int8 array
ans(:,:,1,1) =
18   -16   -50
-6   -54   -10
-37   -49   -18

Thanks again for your response!

Angelo Yeo 2024년 5월 30일

Yes, exactly. Thanks for the feedback. It's great to know it worked for you.

댓글을 달려면 로그인하십시오.

How to export INT8 quantized weight of deep neural network?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

How to export INT8 quantized weight of deep neural network?

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 3 이전 댓글 1개 표시이전 댓글 1개 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 3
이전 댓글 1개 표시이전 댓글 1개 숨기기