이 페이지의 내용은 이전 릴리스에 관한 것입니다. 해당 영문 페이지는 최신 릴리스에서 제거되었습니다.

객체 검출

컨벌루션 신경망(CNN 또는 ConvNet)을 사용하여 분류, 객체 검출, 전이 학습 수행, 커스터마이즈된 검출기 만들기

객체 검출은 영상 또는 비디오에서 객체 인스턴스를 찾기 위한 컴퓨터 비전 기법입니다. 객체 검출 알고리즘은 의미 있는 결과를 생성하기 위해 일반적으로 머신러닝 또는 딥러닝을 활용합니다. 사람은 영상이나 비디오를 보고 바로 관심 객체를 인식하고 찾을 수 있습니다. 객체 검출의 목표는 컴퓨터를 사용하여 이러한 사람의 지능을 재현하는 것입니다. 어떤 객체 검출 방법이 가장 적합한지는 사용자가 해결하려는 문제와 해당 응용 분야에 따라 다릅니다.

딥러닝 기법은 레이블이 지정된 훈련 영상이 아주 많이 필요하며, 따라서 모델 훈련에 걸리는 시간을 줄이기 위해 GPU를 사용할 것을 권장합니다. 딥러닝 기반의 객체 검출은 R-CNN, YOLO 같은 컨벌루션 신경망(CNN 또는 ConvNet)을 사용하거나 SSD(single-shot detection)를 사용합니다. 커스텀 객체 검출기를 훈련시킬 수도 있고, 사전 훈련된 신경망으로 시작해서 응용 사례에 맞게 미세 조정하는 전이 학습을 활용하는 방식으로 사전 훈련된 객체 검출기를 사용할 수도 있습니다. 컨벌루션 신경망을 사용하려면 Deep Learning Toolbox™가 필요합니다. 훈련과 예측은 CUDA^®가 사용 가능한 GPU에서 지원됩니다. GPU를 사용하는 것이 권장되며, 이를 위해서는 Parallel Computing Toolbox™가 필요합니다. 자세한 내용은 Computer Vision Toolbox 기본 설정 및 Parallel Computing Support in MathWorks Products (Parallel Computing Toolbox) 항목을 참조하십시오.

객체 검출을 위한 머신러닝 기법으로는 ACF(Aggregate Channel Features), HOG(Histograms of Oriented Gradient) 특징을 사용하는 SVM(서포트 벡터 머신) 분류, 사람의 얼굴이나 상반신 검출을 위한 Viola-Jones 알고리즘 등이 있습니다. 사전 훈련된 객체 검출기로 시작하거나 응용 사례에 적합한 커스텀 객체 검출기를 만들 수 있습니다.

Labeled boats, neural network, and person detector

앱

영상 레이블 지정기	컴퓨터 비전 응용 분야에서 영상에 레이블 지정
비디오 레이블 지정기	Label video for computer vision applications

함수

모두 확장

객체 검출

딥러닝 검출기

`rcnnObjectDetector`	Detect objects using R-CNN deep learning detector
`fastRCNNObjectDetector`	Detect objects using Fast R-CNN deep learning detector
`fasterRCNNObjectDetector`	Detect objects using Faster R-CNN deep learning detector
`ssdObjectDetector`	Detect objects using SSD deep learning detector (R2020a 이후)
`yolov2ObjectDetector`	Detect objects using YOLO v2 object detector
`yolov3ObjectDetector`	Detect objects using YOLO v3 object detector (R2021a 이후)
`yolov4ObjectDetector`	Detect objects using YOLO v4 object detector (R2022a 이후)
`solov2`	Segment objects using SOLOv2 instance segmentation network (R2023b 이후)
`maskrcnn`	Detect objects using Mask R-CNN instance segmentation (R2021b 이후)
`ocr`	Recognize text using optical character recognition

특징 기반 검출기

`readAprilTag`	Detect and estimate pose for AprilTag in image (R2020b 이후)
`readBarcode`	Detect and decode 1-D or 2-D barcode in image (R2020a 이후)
`acfObjectDetector`	Detect objects using aggregate channel features
`peopleDetectorACF`	Detect people using aggregate channel features
`vision.CascadeObjectDetector`	Detect objects using the Viola-Jones algorithm
`vision.ForegroundDetector`	Foreground detection using Gaussian mixture models
`vision.PeopleDetector`	(To be removed) Detect upright people using HOG features
`vision.BlobAnalysis`	Properties of connected regions

특징점을 사용한 객체 검출

`detectBRISKFeatures`	BRISK 특징 검출
`detectFASTFeatures`	FAST 알고리즘을 사용하여 코너 검출
`detectHarrisFeatures`	Harris–Stephens 알고리즘을 사용하여 코너 검출
`detectKAZEFeatures`	Detect KAZE features
`detectMinEigenFeatures`	최소 고유값 알고리즘을 사용하여 코너 검출
`detectMSERFeatures`	Detect MSER features
`detectORBFeatures`	Detect ORB keypoints
`detectSIFTFeatures`	SIFT(Scale-Invariant Feature Transform) 특징 검출 (R2021b 이후)
`detectSURFFeatures`	SURF 특징 검출
`extractFeatures`	Extract interest point descriptors
`matchFeatures`	매칭되는 특징 찾기

검출된 객체 선택

`selectStrongestBbox`	Select strongest bounding boxes from overlapping clusters using nonmaximal suppression (NMS)
`selectStrongestBboxMulticlass`	Select strongest multiclass bounding boxes from overlapping clusters using nonmaximal suppression (NMS)

커스텀 객체 검출기 훈련

훈련 데이터 불러오기

`boxLabelDatastore`	Datastore for bounding box label data (R2019b 이후)
`groundTruth`	Ground truth label data
`imageDatastore`	이미지 데이터의 데이터저장소
`objectDetectorTrainingData`	Create training data for an object detector
`ocrTrainingOptions`	Options for training OCR model (R2023a 이후)
`combine`	여러 데이터저장소의 데이터 결합

특징 기반 객체 검출기 훈련

`trainACFObjectDetector`	Train ACF object detector
`trainCascadeObjectDetector`	Train cascade object detector model
`trainImageCategoryClassifier`	Train an image category classifier

딥러닝 기반 객체 검출기 훈련

`trainRCNNObjectDetector`	Train R-CNN deep learning object detector
`trainFastRCNNObjectDetector`	Train Fast R-CNN deep learning object detector
`trainFasterRCNNObjectDetector`	Train Faster R-CNN deep learning object detector
`trainSSDObjectDetector`	Train an SSD deep learning object detector (R2020a 이후)
`trainYOLOv2ObjectDetector`	Train YOLO v2 object detector
`trainYOLOv4ObjectDetector`	Train YOLO v4 object detector (R2022a 이후)
`trainSOLOV2`	Train SOLOv2 network to perform instance segmentation (R2023b 이후)
`trainMaskRCNN`	Train Mask R-CNN network to perform instance segmentation (R2022a 이후)
`ocrTrainingOptions`	Options for training OCR model (R2023a 이후)
`trainOCR`	Train OCR model to recognize text in image (R2023a 이후)
`quantizeOCR`	Quantize OCR model (R2023a 이후)

딥러닝을 위한 훈련 데이터 증대 및 전처리

`balanceBoxLabels`	Balance bounding box labels for object detection (R2020a 이후)
`bboxcrop`	Crop bounding boxes (R2019b 이후)
`bboxerase`	Remove bounding boxes (R2021a 이후)
`bboxresize`	Resize bounding boxes (R2019b 이후)
`bboxwarp`	Apply geometric transformation to bounding boxes (R2019b 이후)
`bbox2points`	Convert rectangle to corner points list
`imwarp`	영상에 기하 변환 적용
`imcrop`	영상 자르기
`imresize`	이미지 크기 조정
`randomAffine2d`	Create randomized 2-D affine transformation (R2019b 이후)
`centerCropWindow2d`	사각 형태의 가운데 자르기 창 만들기 (R2019b 이후)
`randomWindow2d`	Randomly select rectangular region in image (R2021a 이후)
`integralImage`	Calculate 2-D integral image

객체 검출 심층 신경망 설계

R-CNN(Regions With Convolutional Neural Networks)

`rcnnBoxRegressionLayer`	Box regression layer for Fast and Faster R-CNN
`fasterRCNNLayers`	Create a faster R-CNN object detection network (R2019b 이후)
`rpnSoftmaxLayer`	Softmax layer for region proposal network (RPN)
`rpnClassificationLayer`	Classification layer for region proposal networks (RPNs)
`regionProposalLayer`	Region proposal layer for Faster R-CNN
`roiAlignLayer`	Non-quantized ROI pooling layer for Mask-CNN (R2020b 이후)
`roiInputLayer`	ROI input layer for Fast R-CNN
`roiMaxPooling2dLayer`	Neural network layer used to output fixed-size feature maps for rectangular ROIs
`roialign`	Non-quantized ROI pooling of `dlarray` data (R2021b 이후)

YOLO v2(You Only Look Once 버전 2)

`yolov2Layers`	Create YOLO v2 object detection network
`yolov2TransformLayer`	Create transform layer for YOLO v2 object detection network
`yolov2OutputLayer`	Create output layer for YOLO v2 object detection network
`spaceToDepthLayer`	Space to depth layer (R2020b 이후)

중점 손실 계층

`focalLossLayer`	(To be removed) Create focal loss layer using focal loss function (R2020a 이후)
`focalCrossEntropy`	Compute focal cross-entropy loss (R2020b 이후)

SSD(Single Shot Detector)

ssdMergeLayer Create SSD merge layer for object detection (R2020a 이후)

앵커 상자

estimateAnchorBoxes Estimate anchor boxes for deep learning object detectors (R2019b 이후)

검출 결과 시각화

`cuboid2img`	Project cuboids from 3-D world coordinates to 2-D image coordinates (R2022b 이후)
`insertObjectAnnotation`	Annotate truecolor or grayscale image or video
`insertObjectMask`	Insert masks in image or video stream (R2020b 이후)
`insertShape`	영상 또는 비디오에 형태 삽입
`showShape`	Display shapes on image, video, or point cloud (R2020b 이후)

예측 결과 평가

`evaluateObjectDetection`	Evaluate object detection data set against ground truth (R2023b 이후)
`objectDetectionMetrics`	Object detection quality metrics (R2023b 이후)
`evaluateInstanceSegmentation`	Evaluate instance segmentation data set against ground truth (R2022b 이후)
`instanceSegmentationMetrics`	Instance segmentation quality metrics (R2022b 이후)
`bboxOverlapRatio`	Compute bounding box overlap ratio
`bboxPrecisionRecall`	Compute bounding box precision and recall against ground truth
`evaluateOCR`	Evaluate OCR results against ground truth (R2023a 이후)
`evaluateDetectionMissRate`	(To be removed) Evaluate miss rate metric for object detection
`evaluateDetectionPrecision`	(To be removed) Evaluate precision metric for object detection
`evaluateDetectionAOS`	(To be removed) Evaluate average orientation similarity metric for object detection (R2020a 이후)

블록

Deep Learning Object Detector

훈련된 딥러닝 객체 검출기를 사용하여 객체 검출 (R2021b 이후)

도움말 항목

시작하기

Getting Started with Object Detection Using Deep Learning
Perform object detection using deep learning neural networks.
Choose an Object Detector
Compare object detection deep learning models, such as YOLOX and YOLOv4.
Local Feature Detection and Extraction
Learn the benefits and applications of local feature detection and extraction.
Get Started with Cascade Object Detector
Train a custom classifier.
Point Feature Types
Choose functions that return and accept points objects for several types of features.
Getting Started with OCR
Detect and recognize text in multiple languages, train OCR models to recognize custom text.
Image Classification with Bag of Visual Words
Use the Computer Vision Toolbox™ functions for image category classification by creating a bag of visual words.
Coordinate Systems
Specify pixel Indices, spatial coordinates, and 3-D coordinate systems.

객체 검출 및 인스턴스 분할을 위해 데이터 훈련시키기

Get Started with the Image Labeler
Interactively label rectangular ROIs for object detection, pixels for semantic segmentation, polygons for instance segmentation, and scenes for image classification.
Get Started with the Video Labeler
Interactively label rectangular ROIs for object detection, pixels for semantic segmentation, polygons for instance segmentation, and scenes for image classification in a video or image sequence.
Datastores for Deep Learning (Deep Learning Toolbox)
Learn how to use datastores in deep learning applications.
Get Started with SOLOv2 for Instance Segmentation
Perform multiclass instance segmentation using SOLOv2 and deep learning.
Getting Started with Mask R-CNN for Instance Segmentation
Perform multiclass instance segmentation using Mask R-CNN and deep learning.
Training Data for Object Detection and Semantic Segmentation
Create training data for object detection or semantic segmentation using the Image Labeler or Video Labeler.
Get Started with Image Preprocessing and Augmentation for Deep Learning
Preprocess data for deep learning applications with deterministic operations such as resizing, or augment training data with randomized operations such as random cropping.

딥러닝 시작하기

심층 신경망 디자이너 (Deep Learning Toolbox)
딥러닝 계층 목록 (Deep Learning Toolbox)
MATLAB^®에서 제공하는 딥러닝 계층에 대해 알아봅니다.
MATLAB의 딥러닝 (Deep Learning Toolbox)
사전 훈련된 신경망 및 전이 학습, 그리고 GPU, CPU, 클러스터 및 클라우드에서의 훈련 등 분류 및 회귀에 컨벌루션 신경망을 사용하여 MATLAB의 딥러닝 기능을 알아봅니다.
사전 훈련된 심층 신경망 (Deep Learning Toolbox)
분류, 전이 학습 및 특징 추출을 위해 사전 훈련된 컨벌루션 신경망을 다운로드하고 사용하는 방법을 알아봅니다.