인스턴스 분할

ground truth에 레이블 지정, SOLOv2, Mask R-CNN, SAM과 같은 사전 훈련된 AI 모델을 사용하여 인스턴스 분할 수행, 전이 학습을 통해 사용자 지정 신경망 훈련

Computer Vision Toolbox™의 인스턴스 분할 툴을 사용하면 여러 객체가 중첩되어 있는 경우에도 영상 내의 개별 객체를 검출, 분류, 분할할 수 있습니다. 영상 레이블 지정기 앱과 비디오 레이블 지정기 앱을 사용하여 레이블이 지정된 ground truth를 만드는 것부터 시작할 수 있습니다. 이 두 앱은 대화형 및 AI 지원 방식으로 객체 인스턴스에 대해 다각형 ROI나 사각형 ROI를 사용한 주석 처리를 지원합니다. 자세한 내용은 Label Objects Using Polygons for Instance Segmentation 항목을 참조하십시오.

이 툴박스는 SOLOv2, Mask R-CNN과 같은 사전 훈련된 인스턴스 분할 신경망을 제공합니다. 이러한 모델을 바로 추론에 사용하거나 전이 학습을 통해 특정 응용 분야에 맞게 조정할 수 있습니다. 자세한 내용은 Get Started with Instance Segmentation Using Deep Learning 항목과 Get Started with SOLOv2 for Instance Segmentation 항목을 참조하십시오. 클래스에 독립적인 인스턴스 분할을 위해, 이 툴박스는 imsegsam 함수와 segmentAnythingModel 객체를 통해 SAM(Segment Anything Model)을 지원합니다.

훈련 데이터를 준비하기 위해, 이 툴박스는 데이터 세트를 관리하고 구성하는 유틸리티와 함께 데이터 증강 및 전처리 기능을 제공합니다. 자세한 내용은 Postprocess Exported Labels for Instance Segmentation Training 항목을 참조하십시오.

사전 훈련된 모델이나 사용자 지정 모델을 사용하여 예측을 생성한 후에는, 인스턴스 분할 성능을 평가하고 분할 정확도, 객체 수준 정밀도, 그리고 서로 다른 객체 크기별 성능에 대한 상세한 정보를 생성할 수 있습니다. 이러한 메트릭은 마스크 예측과 경계 상자 위치추정의 품질을 평가하는 데 유용합니다. 자세한 내용은 evaluateInstanceSegmentation 항목을 참조하십시오.

이 툴박스는 Pose Mask R-CNN 프레임워크를 통해 인스턴스 분할을 사용한 3차원 객체의 자세 추정도 지원하여, 이를 통해 객체의 방향과 구조에 대한 정밀한 분석을 가능하게 합니다. 자세한 내용은 Perform 6-DoF Pose Estimation for Bin Picking Using Deep Learning 항목을 참조하십시오.

Instance segmentation using SOLOv2: Left — A segmented and labeled road scenario using a sample modified RGB image from the CamVid data set, Right — A segmented image of PVC pipe connectors

앱

영상 레이블 지정기	컴퓨터 비전 응용 분야에서 영상에 레이블 지정
비디오 레이블 지정기	Label video for computer vision applications

함수

모두 확장

사전 훈련된 인스턴스 분할 신경망

SOLOv2

`solov2`	Segment objects using SOLOv2 instance segmentation network (R2023b 이후)
`segmentObjects`	Segment objects using SOLOv2 instance segmentation (R2023b 이후)

Mask R-CNN

`maskrcnn`	Detect objects using Mask R-CNN instance segmentation (R2021b 이후)
`segmentObjects`	Segment objects using Mask R-CNN instance segmentation (R2021b 이후)

SAM(Segment Anything Model)

`imsegsam`	Perform automatic full image segmentation using Segment Anything Model 2 (SAM 2) (R2024b 이후)
`segmentAnythingModel`	의미론적 분할을 위해 사전 훈련된 SAM(Segment Anything Model) (R2024a 이후)

사용자 지정된 인스턴스 분할 신경망 훈련

훈련 데이터 불러오기

`boxLabelDatastore`	Datastore for bounding box label data
`groundTruth`	Ground truth label data
`imageDatastore`	이미지 데이터의 데이터저장소
`combine`	여러 데이터저장소의 데이터 결합

인스턴스 분할 신경망 훈련

`trainSOLOV2`	Train SOLOv2 network to perform instance segmentation (R2023b 이후)
`trainMaskRCNN`	Train Mask R-CNN network to perform instance segmentation (R2022a 이후)

훈련 데이터 증강 및 전처리

`poly2mask`	관심 영역(ROI) 다각형을 영역 마스크로 변환
`bwboundaries`	이진 영상에서 객체 경계선 추적
`balanceBoxLabels`	Balance bounding box labels for object detection
`bboxcrop`	Crop bounding boxes
`bboxerase`	Remove bounding boxes
`bboxresize`	Resize bounding boxes
`bboxwarp`	Apply geometric transformation to bounding boxes
`bbox2points`	Convert rectangle to corner points list
`imwarp`	영상에 기하 변환 적용
`imcrop`	영상 자르기
`imresize`	이미지 크기 조정
`randomAffine2d`	Create randomized 2-D affine transformation
`centerCropWindow2d`	사각 형태의 가운데 자르기 윈도우 만들기
`randomWindow2d`	Randomly select rectangular region in image

예측 결과 평가

`evaluateInstanceSegmentation`	Evaluate instance segmentation data set against ground truth (R2022b 이후)
`instanceSegmentationMetrics`	Instance segmentation quality metrics (R2022b 이후)
`metricsByArea`	Evaluate instance segmentation across object mask size ranges (R2023b 이후)

결과 시각화하기

`insertObjectMask`	Insert masks in image or video stream
`insertObjectAnnotation`	트루컬러 또는 회색조 영상 또는 비디오에 주석 추가
`insertShape`	영상 또는 비디오에 형태 삽입
`insertText`	영상 또는 비디오에 텍스트 삽입
`showShape`	Display shapes on image, video, or point cloud

인스턴스 분할을 사용하여 자세 추정 수행

`posemaskrcnn`	Predict object pose using Pose Mask R-CNN pose estimation (R2024a 이후)
`predictPose`	Estimate object pose using Pose Mask R-CNN deep learning network (R2024a 이후)
`trainPoseMaskRCNN`	Train Pose Mask R-CNN network to perform pose estimation (R2024a 이후)

도움말 항목

시작하기

Get Started with Instance Segmentation Using Deep Learning
Segment objects using an instance segmentation model such as SOLOv2 or Mask R-CNN.
Get Started with SOLOv2 for Instance Segmentation
Perform multiclass instance segmentation using SOLOv2 and deep learning.
Getting Started with Mask R-CNN for Instance Segmentation
Perform multiclass instance segmentation using Mask R-CNN and deep learning.
Get Started with Segment Anything Model for Image Segmentation
Perform interactive image segmentation using Segment Anything Model 2 (SAM 2) and deep learning.

인스턴스 분할을 위한 Ground Truth 만들기

Label Objects Using Polygons for Instance Segmentation
Label ground truth objects using polygons for instance segmentation.
Postprocess Exported Labels for Instance Segmentation Training
Postprocess exported ground truth labels and create training datastore for training instance segmentation networks such as SOLOv2 or Mask R-CNN.

인스턴스 분할을 위한 훈련 데이터 준비하기

Create Instance Segmentation Training Data From Ground Truth
This example shows how to create instance segmentation training data from a groundTruth object.
Get Started with Image Preprocessing and Augmentation for Deep Learning
Preprocess data for deep learning applications with deterministic operations such as resizing, or augment training data with randomized operations such as random cropping.
Datastores for Deep Learning (Deep Learning Toolbox)
Learn how to use datastores in deep learning applications.

추천 예제

새로 만들기

Automate Ground Truth Polygon Labeling Using Grounded SAM Model

Combine Grounding DINO and the Segment Anything Model 2 (SAM 2) to automatically produce polygon labels using the Video Labeler app.

R2026a 이후
라이브 스크립트 열기

새로 만들기

Automate Ground Truth Labeling for Instance Segmentation

Create an automation algorithm to automatically label data for instance segmentation using a pretrained SOLOv2 network in the Video Labeler app.

R2026a 이후
라이브 스크립트 열기

새로 만들기

Automatically Search and Label Video Frames Using VLMs

Automatically search and detect objects based on natural language text queries using vision-language models (VLMs).

R2026a 이후
라이브 스크립트 열기

Perform Instance Segmentation Using SOLOv2

Segment object instances of randomly rotated machine parts in a bin using a deep learning SOLOv2 network.

라이브 스크립트 열기

Perform Instance Segmentation Using Mask R-CNN

Segment individual instances of people and cars using a multiclass mask region-based convolutional neural network (R-CNN).

라이브 스크립트 열기

Automatically Label Ground Truth Using Segment Anything Model

Produce pixel labels for semantic segmentation using the Segment Anything Model (SAM) in the 영상 레이블 지정기 app. The SAM is an automatic segmentation technique that you can use to segment object regions to label with just a few clicks, or automatically segment the entire image and instantaneously create labels for selected regions. In this example, you interactively label pixels for semantic segmentation in two ways.

R2024b 이후
라이브 스크립트 열기

Segment Anything Model을 사용하여 대화형 ROI에서 객체 분할하기

이 예제에서는 SAM(Segment Anything Model)을 사용하여 영상에서 선택한 ROI(관심 영역)에 있는 객체를 대화형 방식으로 분할하는 방법을 보여줍니다.

Perform 6-DoF Pose Estimation for Bin Picking Using Deep Learning

Perform six degrees-of-freedom (6-DoF) pose estimation by estimating the 3-D position and orientation of machine parts in a bin using RGB-D images and a deep learning network.

라이브 스크립트 열기