Optical Character Recognition

Introduction

Optical Character Recognition is a Catalyst Zia Services component that performs the electronic detection of handwritten or printed textual characters in images or digital documents, and converts the detected characters to machine-encoded text. Zia detects text in photos and scanned documents, then breaks the text down into individual characters, and identifies the language it is in. The recognized text is then presented as a JSON response.

The recognized text is presented as a JSON response, along with a confidence score that informs you of its accuracy. You can code the Catalyst application to store the recognized data or process it further in any way you require. Zia OCR can automatically detect and recognize texts in 10 major languages.

OCR is widely used in web and mobile applications that are created to read content from scanned or photographed documents, flyers, menus, posters, signs, and other files containing text. The identified text can be stored digitally or used for further data processing.

Catalyst provides Zia OCR in the Java, Node.js and Python SDK packages, and you can integrate it in your Catalyst web or Android application. The Catalyst console provides easy access to code templates for these environments that you can implement in your application’s code.

You can also test Zia OCR by uploading sample images or documents that contain text in the console and obtain the recognized text, to get a better idea of Zia’s accuracy and the OCR response format.

You can refer to the Java SDK documentation, Node.js SDK documentation and Python SDK documentation for code samples of Zia OCR. Refer to the API documentation to learn about the API available for OCR.

You can learn more about the other components of Catalyst Zia Services from this page.

Last Updated 2023-08-18 18:27:19 +0530 +0530

ON THIS PAGE