신경망 압축 시작하기

Deep Learning Toolbox™ Model Compression Library의 기본 사항 알아보기

Deep Learning Toolbox를 Deep Learning Toolbox Model Compression Library 지원 패키지와 함께 사용하여 심층 신경망의 메모리 사용량과 계산 요구 사항을 줄입니다.

1차 테일러 근사를 사용하여 컨벌루션 계층에서 필터를 가지치기합니다.
계층 활성화 부분에 주성분 분석(PCA)을 수행하여 계층을 사영합니다.
계층의 가중치, 편향 및 활성화를 정수 데이터형으로 스케일링한 낮은 정밀도로 양자화합니다.

그런 다음 압축된 신경망에서 코드를 생성하여 원하는 하드웨어에 배포할 수 있습니다.

Diagram of suggested compression workflow: first pruning, then projection, then quantization, then code generation.

도움말 항목

Reduce Memory Footprint of Deep Neural Networks
Learn about neural network compression techniques, including pruning, projection, and quantization.

추천 예제

Analyze and Compress 1-D Convolutional Neural Network

Analyze and Compress 1-D Convolutional Neural Network

Analyze 1-D convolutional network for compression and compress it using Taylor pruning and projection.

R2024b 이후
라이브 스크립트 열기

Export Quantized Networks to Simulink and Generate Code

Export Quantized Networks to Simulink and Generate Code

Export a quantized neural network to Simulink and generate code from the exported model.

라이브 스크립트 열기

Parameter Pruning and Quantization of Image Classification Network

Parameter Pruning and Quantization of Image Classification Network

Use parameter pruning and quantization to reduce network size.

라이브 스크립트 열기