Guide for OCR Modes Available with Private AI DEID Container

info

This guide is relevant for container users only

This guide outlines the different Optical Character Recognition (OCR) modes available for processing documents and images. Each mode has its unique application, depending on the document's nature and the specific requirements for text extraction.

Table of contents:

Standard OCR

Attribute Rating
Accuracy Medium
Speed High
Cost Low

Standard OCR is built into the container and is the default OCR engine. It's designed for high-performance text recognition in images and digital documents, focusing on efficiency and accuracy for standard printed text.

The control flow for Standard OCR mode is depicted in the following flow chart:

OCR
PDF
Office Docs
Image
Standard / Azure OCR
Process File
Document Type?
Convert to Image
Extract Image
Text Deid
Create Redacted Image
Create Redacted File

When to choose Standard OCR?

  • Most cost efficient OCR solution is needed.

Azure Computer Vision OCR

Attribute Rating
Accuracy High
Speed High
Cost Medium

Azure Computer Vision OCR uses Microsoft's Computer Vision API to analyze and extract text from a wide range of document types. It's particularly effective for complex documents, including those with handwriting, images, and various fonts, layouts, rotations, and Japanese characters.

Set PAI_OCR_SYSTEM=azure_computer_vision environment variable at container startup to use this OCR mode. Check out our Guide for Integrating Private AI DEID container with Azure OCR for setup information.

The control flow is similar to the Standard OCR mode.

When to choose Azure Computer Vision OCR?

  • Handwriting support is needed.
  • Rotated document support is needed.
  • Enhanced language support is needed.
  • Wide range of supported languages, such as Japanese. See Azure OCR Language support for more details.

Azure Document Intelligence

Attribute Rating
Accuracy High
Speed High
Cost High

Azure Document Intelligence uses Microsoft's Document Intelligence API. On top of what Azure Computer Vision OCR already offers, Document Intelligence provide advanced features for complex layouts like tables and forms.

Set PAI_OCR_SYSTEM=azure_doc_intelligence environment variable at container startup to use this OCR mode. Check out our Guide for Integrating Private AI DEID container with Azure OCR for setup information.

The control flow is similar to the Standard OCR mode.

When to choose Azure Document Intelligence?

  • Need advanced features for complex layouts like tables and forms.
  • Handwriting support is needed.
  • Rotated document support is needed.
  • Enhanced language support is needed.
  • Wide range of supported languages. See Azure OCR Language support for more details.

Hybrid OCR

Attribute Rating
Accuracy Medium
Speed High
Cost Medium

The Hybrid OCR mode combines Azure Computer Vision OCR and Standard OCR to optimize text extraction and the cost for PDF documents. This mode uses Azure Computer Vision OCR for pages containing graphics and switches to Standard OCR for other pages. The choice between OCR modes is made on a page by page basis.

The setup of OCR in hybrid mode is the same as Azure Computer Vision OCR mode, just follow our Guide for Integrating Private AI DEID container with Azure OCR & use PAI_OCR_SYSTEM=hybrid environment variable instead of PAI_OCR_SYSTEM=azure_computer_vision.

The control flow for Hybrid OCR mode is depicted in the following flow chart:

OCR
Yes
No
Yes
No
Standard OCR Flow
Azure Computer Vision OCR Flow
Process File
File is a PDF
Render Pages
Image present in the page
Text Deid
Create Redacted Image
Create Redacted File

When to choose Hybrid OCR?

  • Need a mix of performance and cost effectiveness.
  • Need to process mainly PDFs. For other document types the Hybrid OCR will work just like the Standard OCR.
© Copyright 2024 Private AI.