객체 검출

컨벌루션 신경망(CNN 또는 ConvNet)을 사용하여 분류, 객체 검출, 전이 학습 수행, 커스터마이즈된 검출기 만들기

객체 검출은 영상 또는 비디오에서 객체 인스턴스를 찾기 위한 컴퓨터 비전 기법입니다. 객체 검출 알고리즘은 의미 있는 결과를 생성하기 위해 일반적으로 머신러닝 또는 딥러닝을 활용합니다. 사람은 영상이나 비디오를 보고 바로 관심 객체를 인식하고 찾을 수 있습니다. 객체 검출의 목표는 컴퓨터를 사용하여 이러한 사람의 지능을 재현하는 것입니다. 어떤 객체 검출 방법이 가장 적합한지는 사용자가 해결하려는 문제와 해당 응용 분야에 따라 다릅니다.

딥러닝 기법은 레이블이 지정된 훈련 영상이 아주 많이 필요하며, 따라서 모델 훈련에 걸리는 시간을 줄이기 위해 GPU를 사용할 것을 권장합니다. 딥러닝 기반의 객체 검출은 YOLO 같은 컨벌루션 신경망(CNN 또는 ConvNet)을 사용하거나 SSD(single-shot detection)를 사용합니다. 커스텀 객체 검출기를 훈련시킬 수도 있고, 사전 훈련된 신경망으로 시작해서 응용 사례에 맞게 미세 조정하는 전이 학습을 활용하는 방식으로 사전 훈련된 객체 검출기를 사용할 수도 있습니다. 컨벌루션 신경망을 사용하려면 Deep Learning Toolbox™가 필요합니다. 훈련과 예측은 CUDA^®가 사용 가능한 GPU에서 지원됩니다. GPU를 사용하는 것이 권장되며, 이를 위해서는 Parallel Computing Toolbox™가 필요합니다. 자세한 내용은 Computer Vision Toolbox 기본 설정 및 Parallel Computing Support in MathWorks Products (Parallel Computing Toolbox) 항목을 참조하십시오.

객체 검출을 위한 머신러닝 기법으로는 ACF(Aggregate Channel Features), HOG(Histograms of Oriented Gradient) 특징을 사용하는 SVM(서포트 벡터 머신) 분류, 사람의 얼굴이나 상반신 검출을 위한 Viola-Jones 알고리즘 등이 있습니다. 사전 훈련된 객체 검출기로 시작하거나 응용 사례에 적합한 커스텀 객체 검출기를 만들 수 있습니다.

Labeled boats, neural network, and person detector

앱

영상 레이블 지정기	컴퓨터 비전 응용 분야에서 영상에 레이블 지정
비디오 레이블 지정기	Label video for computer vision applications

함수

모두 확장

객체 검출

딥러닝 검출기

`rtmdetObjectDetector`	Detect objects using RTMDet object detector (R2024b 이후)
`ssdObjectDetector`	Detect objects using SSD deep learning detector
`yolov2ObjectDetector`	Detect objects using YOLO v2 object detector
`yolov3ObjectDetector`	Detect objects using YOLO v3 object detector (R2021a 이후)
`yolov4ObjectDetector`	Detect objects using YOLO v4 object detector (R2022a 이후)
`yoloxObjectDetector`	Detect objects using YOLOX object detector (R2023b 이후)
`peopleDetector`	Detect people using pretrained deep learning object detector (R2024b 이후)
`faceDetector`	Detect faces using pretrained RetinaFace face detector (R2025a 이후)

특징 기반 검출기

`readAprilTag`	영상에서 AprilTag 검출 및 자세 추정
`readArucoMarker`	Detect and estimate pose for ArUco marker in image (R2024a 이후)
`generateArucoMarker`	Generate ArUco marker images (R2024a 이후)
`readBarcode`	Detect and decode 1-D or 2-D barcode in image
`acfObjectDetector`	Detect objects using aggregate channel features
`peopleDetectorACF`	ACF(Aggregate Channel Features)를 사용하여 사람 검출
`vision.CascadeObjectDetector`	Detect objects using the Viola-Jones algorithm
`vision.ForegroundDetector`	Foreground detection using Gaussian mixture models
`vision.BlobAnalysis`	Properties of connected regions

특징점을 사용한 객체 검출

`detectBRISKFeatures`	BRISK 특징 검출
`detectFASTFeatures`	FAST 알고리즘을 사용하여 코너 검출
`detectHarrisFeatures`	Harris–Stephens 알고리즘을 사용하여 코너 검출
`detectKAZEFeatures`	Detect KAZE features
`detectMinEigenFeatures`	최소 고유값 알고리즘을 사용하여 코너 검출
`detectMSERFeatures`	Detect MSER features
`detectORBFeatures`	Detect ORB keypoints
`detectSIFTFeatures`	SIFT(Scale-Invariant Feature Transform) 특징 검출 (R2021b 이후)
`detectSURFFeatures`	SURF 특징 검출
`extractFeatures`	Extract interest point descriptors
`matchFeatures`	매칭되는 특징 찾기

검출된 객체 선택

`selectStrongestBbox`	Select strongest bounding boxes from overlapping clusters using nonmaximal suppression (NMS)
`selectStrongestBboxMulticlass`	Select strongest multiclass bounding boxes from overlapping clusters using nonmaximal suppression (NMS)

커스텀 객체 검출기 훈련

훈련 데이터 불러오기

`boxLabelDatastore`	Datastore for bounding box label data
`groundTruth`	Ground truth label data
`imageDatastore`	이미지 데이터의 데이터저장소
`objectDetectorTrainingData`	Create training data for an object detector
`combine`	여러 데이터저장소의 데이터 결합

특징 기반 객체 검출기 훈련

`trainACFObjectDetector`	Train ACF object detector
`trainCascadeObjectDetector`	Train cascade object detector model
`trainImageCategoryClassifier`	Train an image category classifier

딥러닝 기반 객체 검출기 훈련

`trainSSDObjectDetector`	Train SSD deep learning object detector
`trainYOLOv2ObjectDetector`	Train YOLO v2 object detector
`trainYOLOv3ObjectDetector`	Train YOLO v3 object detector (R2024a 이후)
`trainYOLOv4ObjectDetector`	Train YOLO v4 object detector (R2022a 이후)
`trainYOLOXObjectDetector`	Train YOLOX object detector (R2023b 이후)

딥러닝을 위한 훈련 데이터 증강 및 전처리

`balanceBoxLabels`	Balance bounding box labels for object detection
`bboxcrop`	Crop bounding boxes
`bboxerase`	Remove bounding boxes (R2021a 이후)
`bboxresize`	Resize bounding boxes
`bboxwarp`	Apply geometric transformation to bounding boxes
`bbox2points`	Convert rectangle to corner points list
`blockLocationsWithROI`	Select image block locations that contain bounding box ROIs (R2025a 이후)
`imwarp`	영상에 기하 변환 적용
`imcrop`	영상 자르기
`imresize`	이미지 크기 조정
`randomAffine2d`	Create randomized 2-D affine transformation
`centerCropWindow2d`	사각 형태의 가운데 자르기 윈도우 만들기
`randomWindow2d`	Randomly select rectangular region in image (R2021a 이후)
`integralImage`	Calculate 2-D integral image

객체 검출 심층 신경망 설계

R-CNN(Regions With Convolutional Neural Networks)

`roiAlignLayer`	Non-quantized ROI pooling layer for Mask-CNN
`roiMaxPooling2dLayer`	Neural network layer used to output fixed-size feature maps for rectangular ROIs
`roialign`	Non-quantized ROI pooling of `dlarray` data (R2021b 이후)

YOLO v2(You Only Look Once 버전 2)

`yolov2TransformLayer`	Create transform layer for YOLO v2 object detection network
`spaceToDepthLayer`	Space to depth layer

중점 손실

focalCrossEntropy Compute focal cross-entropy loss

SSD(Single Shot Detector)

ssdMergeLayer Create SSD merge layer for object detection

앵커 상자

estimateAnchorBoxes Estimate anchor boxes for deep learning object detectors

검출 결과 시각화

`cuboid2img`	Project cuboids from 3-D world coordinates to 2-D image coordinates (R2022b 이후)
`insertObjectAnnotation`	트루컬러 또는 회색조 영상 또는 비디오에 주석 추가
`insertObjectMask`	Insert masks in image or video stream
`insertShape`	영상 또는 비디오에 형태 삽입
`showShape`	Display shapes on image, video, or point cloud

예측 결과 평가

`evaluateObjectDetection`	Evaluate object detection data set against ground truth (R2023b 이후)
`objectDetectionMetrics`	Object detection quality metrics (R2023b 이후)
`mAPObjectDetectionMetric`	Mean average precision (mAP) metric for object detection (R2024a 이후)
`bboxOverlapRatio`	Compute bounding box overlap ratio
`bboxPrecisionRecall`	Compute bounding box precision and recall against ground truth

블록

Deep Learning Object Detector

훈련된 딥러닝 객체 검출기를 사용하여 객체 검출 (R2021b 이후)

도움말 항목

시작하기

Get Started with Object Detection Using Deep Learning
Perform object detection using deep learning neural networks such as YOLOX, YOLO v4, and SSD.
Choose an Object Detector
Compare object detection deep learning models, such as YOLOX, YOLO v4, RTMDet, and SSD.
Local Feature Detection and Extraction
Learn the benefits and applications of local feature detection and extraction.
Get Started with Cascade Object Detector
Train a custom classifier.
Point Feature Types
Choose functions that return and accept points objects for several types of features.
Getting Started with OCR
Detect and recognize text in multiple languages, train OCR models to recognize custom text.
Image Classification with Bag of Visual Words
Use the Computer Vision Toolbox™ functions for image category classification by creating a bag of visual words.

객체 검출 및 인스턴스 분할을 위해 데이터 훈련시키기

Get Started with the Image Labeler
Interactively label rectangular ROIs for object detection, pixels for semantic segmentation, polygons for instance segmentation, and scenes for image classification.
Get Started with the Video Labeler
Interactively label rectangular ROIs for object detection, pixels for semantic segmentation, polygons for instance segmentation, and scenes for image classification in a video or image sequence.
Datastores for Deep Learning (Deep Learning Toolbox)
Learn how to use datastores in deep learning applications.
Training Data for Object Detection and Semantic Segmentation
Create training data for object detection or semantic segmentation using the Image Labeler or Video Labeler.
Get Started with Image Preprocessing and Augmentation for Deep Learning
Preprocess data for deep learning applications with deterministic operations such as resizing, or augment training data with randomized operations such as random cropping.

딥러닝 시작하기

MATLAB의 딥러닝 (Deep Learning Toolbox)
사전 훈련된 신경망 및 전이 학습, 그리고 GPU, CPU, 클러스터 및 클라우드에서의 훈련 등 분류 및 회귀에 컨벌루션 신경망을 사용하여 MATLAB^®의 딥러닝 기능을 알아봅니다.
사전 훈련된 심층 신경망 (Deep Learning Toolbox)
분류, 전이 학습 및 특징 추출을 위해 사전 훈련된 컨벌루션 신경망을 다운로드하고 사용하는 방법을 알아봅니다.

추천 예제

Detect Small Objects Using Tiled Training of YOLOX Network

Detect small objects in full-resolution images using tiled training of a you only look once version X (YOLOX) deep learning network.

R2024b 이후
라이브 스크립트 열기

Object Detection in Large Satellite Imagery Using Deep Learning

Perform object detection on large satellite imagery using deep learning.

라이브 스크립트 열기

YOLO v4 딥러닝을 사용한 객체 검출

이 예제에서는 YOLO v4(You Only Look Once Version 4) 딥러닝 신경망을 사용하여 영상에서 객체를 검출하는 방법을 보여줍니다. 이 예제에서는 다음을 수행합니다

라이브 스크립트 열기

Multiclass Object Detection Using YOLO v2 Deep Learning

Train a YOLO v2 multiclass object detector and evaluate object detector performance across selected classes and overlap thresholds.

R2024b 이후
라이브 스크립트 열기

Perform 6-DoF Pose Estimation for Bin Picking Using Deep Learning

Perform six degrees-of-freedom (6-DoF) pose estimation by estimating the 3-D position and orientation of machine parts in a bin using RGB-D images and a deep learning network.

라이브 스크립트 열기

Train Object Detectors in Experiment Manager

Use the Experiment Manager app to find optimal training options for object detectors.

스크립트 열기

영상 특징점을 사용하여 복잡한 장면에서 객체 찾기

이 예제에서는 객체의 참조 영상이 주어졌을 때 복잡한 장면에서 특정 객체를 검출하는 방법을 보여줍니다.

스크립트 열기

Read Barcodes in Image

Detect, decode, and localize 1-D and 2-D barcodes in an image.

라이브 스크립트 열기

Detect Cars Using Gaussian Mixture Models

Detect and count cars in a video sequence using foreground detector based on Gaussian mixture models (GMMs).

스크립트 열기

Perform Instance Segmentation Using Mask R-CNN

Segment individual instances of people and cars using a multiclass mask region-based convolutional neural network (R-CNN).

라이브 스크립트 열기

Import Pretrained ONNX YOLO v2 Object Detector

Import pretrained YOLO v2 object detector from ONNX deep learning framework.

라이브 스크립트 열기

Export YOLO v2 Object Detector to ONNX

Export pretrained YOLO v2 object detector to ONNX deep learning framework.

라이브 스크립트 열기

Generate Code for Detecting Objects in Images by Using ACF Object Detector

Generate code from a MATLAB® function that detects objects in images by using an acfObjectDetector object. When you intend to generate code from your MATLAB function that uses an acfObjectDetector object, you must create the object outside of the MATLAB function. The example explains how to modify the MATLAB code in Train Stop Sign Detector Using ACF Object Detector to support code generation.

라이브 스크립트 열기

YOLO v2를 사용하여 객체 검출을 위한 코드 생성하기

YOLO v2를 사용하여 객체 검출을 위한 CUDA® 코드를 생성합니다.

라이브 스크립트 열기

Code Generation for Object Detection by Using Single Shot Multibox Detector

Generate CUDA code for an SSD network.

라이브 스크립트 열기

Code Generation for People Detection Using Deep Learning

Generate CUDA code for people detection

R2025a 이후
라이브 스크립트 열기