Deep Learning Toolbox Model Compression Library

Optimize deep learning models with efficient compression techniques

MathWorks Fixed Point Team

다운로드 수: 3.3K

(10)

2026/6/17

다운로드

팔로우

다운로드

팔로우

Deep Learning Toolbox Model Compression Library enables compression of your deep learning models with pruning, projection, and quantization to reduce their memory footprint and computational requirements.

Pruning and projection are structural compression techniques that reduce the size of deep neural networks by removing learnables and filters that have the smallest impact on inference accuracy.

Quantization to 8-bit integers (INT8) is supported for CPUs, FPGAs, and NVIDIA GPUs, for supported layers. The library enables you to collect layer-level data on the weights, activations, and intermediate computations. Using this data, the library quantizes your model and provides metrics to validate the accuracy of the quantized network against the single precision baseline. The iterative workflow allows you to optimize the quantization strategy.

As of R2024b, you can export quantized networks to Simulink deep learning layer blocks for simulation and deployment to embedded systems.

Please refer to the documentation here: https://www.mathworks.com/help/deeplearning/quantization.html

Quantization Workflow Prerequisites can be found here:

https://www.mathworks.com/help/deeplearning/ug/quantization-workflow-prerequisites.html

If you have download or installation problems, please contact Technical Support - www.mathworks.com/contact_ts

Additional Resources

Learn more about MATLAB and Simulink for tinyML
Quantization Aware Training (QAT) with MobileNet-v2 (Example, GitHub Repo)
Overview Video - https://www.youtube.com/watch?v=jufOpBeSvHM

카테고리

Help Center 및 MATLAB Answers에서 Deep Learning Toolbox에 대해 자세히 알아보기

MATLAB 릴리스 호환 정보

R2020a에서 R2026b까지의 릴리스와 호환

플랫폼 호환성

Windows
macOS (Apple Silicon)
macOS (Intel)
Linux

Deep Learning Toolbox Model Compression Library

카테고리

태그

필수 제품:

MATLAB 릴리스 호환 정보

플랫폼 호환성