이 제출물을 팔로우합니다
- 팔로우하는 게시물 피드에서 업데이트를 확인할 수 있습니다
- 정보 수신 기본 설정에 따라 이메일을 받을 수 있습니다
The Vision Transformer (ViT) model is a pretrained transformer model for image classification. It is also used as a backbone for other computer vision tasks such as object detection. The support package consists of three variants of the ViT model:
- Base-16 model
- Small-16 model
- Tiny-16 model
Here, “base”, “small” and “tiny” represent the model architecture and size, and 16 represents the patch size hyper-parameter. Each variant has been pretrained on ImageNet data set with input resolution of 384 and is stored as a .MAT file.
MATLAB 릴리스 호환 정보
- R2023b에서 R2026b까지의 릴리스와 호환
플랫폼 호환성
- Windows
- macOS (Apple Silicon)
- macOS (Intel)
- Linux
