Detect objects using YOLO v2 object detector
bboxes = detect(detector,I)
[bboxes,scores] = detect(detector,I)
[___,labels] = detect(detector,I)
[___] = detect(___,roi)
[___] = detect(___,Name,Value)
When using this function, use of a CUDA®-enabled NVIDIA® GPU with a compute capability of 3.0 or higher is highly recommended. The GPU reduces computation time significantly. Usage of the GPU requires Parallel Computing Toolbox™.
[___] = detect(___,
detects objects within the rectangular search region specified by
roi. Use output arguments from any of the previous syntaxes. Specify
input arguments from any of the previous syntaxes.
[___] = detect(___,
specifies options using one or more
Name,Value pair arguments in
addition to the input arguments in any of the preceding syntaxes.
Load a YOLO v2 object detector pretrained to detect vehicles.
vehicleDetector = load('yolov2VehicleDetector.mat','detector'); detector = vehicleDetector.detector;
Read a test image into the workspace.
I = imread('highway.png');
Display the input test image.
Run the pretrained YOLO v2 object detector on the test image. Inspect the results for vehicle detection. The labels are derived from the
ClassNames property of the detector.
[bboxes,scores,labels] = detect(detector,I)
bboxes = 1×4 78 81 64 63
scores = single 0.6224
labels = categorical vehicle
Annotate the image with the bounding boxes for the detections.
if ~isempty(bboxes) detectedI = insertObjectAnnotation(I,'rectangle',bboxes,cellstr(labels)); end figure imshow(detectedI)
I— Test image
Test image, specified as a real, nonsparse, grayscale, or RGB image.
The range of the test image must be same as the range of the images used to train
the YOLO v2 object detector. For example, if the detector was trained on
uint8 images, the test image must also have pixel values in the
range [0, 255]. Otherwise, use the
function to rescale the pixel values in the test image.
roi— Search region of interest
Search region of interest, specified as a four-element vector of form [x y width height]. The vector specifies the upper left corner and size of a region of interest in pixels.
comma-separated pairs of
the argument name and
Value is the corresponding value.
Name must appear inside quotes. You can specify several name and value
pair arguments in any order as
'Threshold'— Detection threshold
0.5(default) | scalar in the range [0, 1]
Detection threshold, specified as a comma-separated pair consisting of
'Threshold' and a scalar in the range [0, 1]. Detections that
have scores less than this threshold value are removed. To reduce false positives,
increase this value.
'SelectStrongest'— Select strongest bounding box
Select the strongest bounding box for each detected object, specified as the
comma-separated pair consisting of
'SelectStrongest' and either
true — Returns the strongest bounding box per object. The
method calls the
selectStrongestBboxMulticlass function, which uses nonmaximal
suppression to eliminate overlapping bounding boxes based on their confidence
By default, the
selectStrongestBboxMulticlass function is called as
selectStrongestBboxMulticlass(bbox,scores,... 'RatioType','Min',... 'OverlapThreshold',0.5);
false — Return all the detected bounding boxes. You can
then write your own custom method to eliminate overlapping bounding boxes.
'MinSize'— Minimum region size
[1 1](default) | vector of the form [height width]
Minimum region size, specified as the comma-separated pair consisting of
'MinSize' and a vector of the form [height
width]. Units are in pixels. The minimum region size defines the
size of the smallest region containing the object.
MinSize is 1-by-1.
'MaxSize'— Maximum region size
I) (default) | vector of the form [height width]
Maximum region size, specified as the comma-separated pair consisting of
'MaxSize' and a vector of the form [height
width]. Units are in pixels. The maximum region size defines the
size of the largest region containing the object.
'MaxSize' is set to the height and width of the
I. To reduce computation time, set this value to the
known maximum region size for the objects that can be detected in the input test
'ExecutionEnvironment'— Hardware resource
Hardware resource on which to run the detector, specified as the comma-separated
pair consisting of
'auto' — Use a GPU if it is available. Otherwise, use the
'gpu' — Use the GPU. To use a GPU, you must have
Computing Toolbox and a CUDA-enabled NVIDIA GPU with a compute capability of 3.0 or higher. If a suitable GPU
is not available, the function returns an error.
'cpu' — Use the CPU.
bboxes— Location of objects detected within image
Location of objects detected within the input image, returned as an
M-by-4 matrix, where M is the number
of bounding boxes. Each row of
bboxes contains a
four-element vector of the form [x
height]. This vector specifies the upper left corner and size
of that corresponding bounding box in pixels.
scores— Detection scores
Detection confidence scores, returned as an M-by-1 vector, where M is the number of bounding boxes. A higher score indicates higher confidence in the detection.
labels— Labels for bounding boxes
Labels for bounding boxes, returned as an M-by-1 categorical array of M labels. You define the class names used to label the objects when you train the input