텍스트 검출 및 인식

영상 특징 검출 및 설명, 딥러닝, OCR을 사용하여 텍스트 검출 및 인식

영상에서 텍스트를 검출하고 인식하는 일은 컴퓨터 비전 응용 분야에서 일반적으로 수행되는 작업입니다. 예를 들어, 이동하는 차량에서 도로 장면의 비디오를 캡처하고 캡처된 장면에서 표지판을 인식한 후 표지판 정보를 운전자에게 알릴 수 있습니다.

검출과 인식을 2단계 절차로 결합할 수 있습니다. 첫 번째 단계에서는 텍스트가 포함된 영역을 찾고 두 번째 단계에서는 해당 영역 내의 텍스트를 인식합니다.

Input image showing an accessible parking sign, connected to a detector, which outputs an image with predicted bounding boxes overlaid on the sign text, connected to a recognizer that outputs a list of the words recognized on the sign.

텍스트 검출 알고리즘은 영상 내 텍스트의 위치를 찾거나 분할하기 위해 국소 영상 특징, 머신러닝 또는 딥러닝을 사용합니다. Computer Vision Toolbox™의 예제에서는 텍스트 검출에 블롭 분석, MSER(maximally stable extremal regions) 특징 검출기 및 CRAFT(character region awareness for text detection) 딥러닝 모델을 사용하는 방법을 보여줍니다.

텍스트를 검출한 후에는 머신러닝 또는 딥러닝에 기반한 텍스트 인식 모델이 텍스트 영역을 처리해 예측 텍스트를 반환합니다. ocr 함수는 사전 훈련된 언어 모델을 사용하여 여러 언어로 작성된 텍스트를 인식합니다. trainOCR 함수를 사용하여 커스텀 언어 모델을 훈련시킬 수도 있습니다. 자세한 내용은 Getting Started with OCR 항목을 참조하십시오.

앱

영상 레이블 지정기

컴퓨터 비전 응용 분야에서 영상에 레이블 지정

함수

모두 확장

레이블 지정

Image Labeler 컴퓨터 비전 응용 분야에서 영상에 레이블 지정

훈련 및 평가

`trainOCR`	Train OCR model to recognize text in image (R2023a 이후)
`evaluateOCR`	Evaluate OCR results against ground truth (R2023a 이후)
`ocrMetrics`	Store OCR quality metrics (R2023a 이후)
`ocrTrainingOptions`	Options for training OCR model (R2023a 이후)
`ocrTrainingData`	Create training data for OCR from ground truth (R2023a 이후)

양자화

quantizeOCR Quantize OCR model (R2023a 이후)

텍스트 검출

`detectTextCRAFT`	Detect texts in images by using CRAFT deep learning model (R2022a 이후)
`detectMSERFeatures`	Detect MSER features
`vision.BlobAnalysis`	Properties of connected regions
`extractHOGFeatures`	Extract histogram of oriented gradients (HOG) features

텍스트 인식

`ocr`	Recognize text using optical character recognition
`ocrText`	Store OCR results
`visionSupportPackages`	인스톨러를 시작하여 Computer Vision Toolbox 데이터 다운로드, 설치 또는 제거

도움말 항목

시작하기

Getting Started with OCR
Detect and recognize text in multiple languages, train OCR models to recognize custom text.
Train Custom OCR Model
Train an optical character recognition (OCR) model to recognize custom text.
Install OCR Language Data Files
Support files for optical character recognition (OCR) languages.
Local Feature Detection and Extraction
Learn the benefits and applications of local feature detection and extraction.
Point Feature Types
Choose functions that return and accept points objects for several types of features.

추천 예제

Recognize Text Using Optical Character Recognition (OCR)

Recognize text in images using optical character recognition.

라이브 스크립트 열기

Recognize Seven-Segment Digits Using OCR

Use OCR to recognize seven-segmented digits in text detected by CRAFT and region properties.

라이브 스크립트 열기

Automatically Detect and Recognize Text Using MSER and OCR

Automatically detect and recognize text in images using MSER and OCR.

라이브 스크립트 열기

Automatically Detect and Recognize Text Using Pretrained CRAFT Network and OCR

Perform text recognition by using a deep learning based text detector and OCR. In the example, you use a pretrained CRAFT (character region awareness for text) deep learning network to detect the text regions in the input image. You can modify the region threshold and the affinity threshold values of the CRAFT model to localise an entire paragraph, a sentence, or a word. Then, you use OCR to recognize the characters in the detected text regions.

라이브 스크립트 열기

Automate Ground Truth Labeling for OCR

Automate the labeling of text for OCR training and evaluation.

라이브 스크립트 열기

Train an OCR Model to Recognize Seven-Segment Digits

Train an OCR model that can recognize seven-segment numerals.

라이브 스크립트 열기

Digit Classification Using HOG Features

Classify digits using HOG features and a multiclass SVM classifier.

라이브 스크립트 열기