The OCR (Optical Character Recognition) includes activities for extracting text from images or scanned documents. It includes functionalities like Create Tesseract OCR, OCR – Find Element, OCR – Get Text, and OCR Click Text, enabling automation to recognize and interact with text-based content in images and UI elements.
The OCR category has the following sub activities:
Create Tesseract OCR: This activity creates an instance of Tesseract OCR Engine for use in an OCR-based activity.
Options Properties:
Language: The language to be used for text recognition.
Whitelist: List of characters to be whitelisted for recognition.
Engine Properties:
OCREngine: Creates Instance of OCR Engine. This field supports only OCREngine variable.
OCR- Find Element: This activity searches for a given text within the bounds of a specified element and returns the corresponding element.
Engine Properties:
OCREngine: Instance of an OCR engine to be used. By default, OCR Engine is Tesseract.
Options Properties:
Element: This field supports only Ul element variable returned by another activity.
Text: Text to be searched.
Output Properties:
Bounding Element: Element variable where the corresponding UI Element is found.
OCR: Get Text- This activity processes an image or an element with OCR Engine and returns all the identified text.
Engine Properties:
OCREngine: Instance of an OCR engine to be used. By default, the OCR Engine is Tesseract.
Target Properties:
Element: This field supports only Ul element variable returned by another activity.
BoundingBox: Bounding rectangle in pixels relative to the UIElement.
Image: An existing image to extract text from. This field supports only Image variables.
Output Properties:
Text: String extracted from an indicated Ul element.
OCR- Click Text: This activity searches and clicks on the element with the given text in a window.
Engine Properties:
OCREngine: Instance of an OCR engine to be used. By default, the OCR Engine is Tesseract.
Element Properties:
Element: This field supports only the Ul element variable returned by another activity.
Options Properties:
Mouse Button: Specifies the mouse button to be used for the click action (left or right).
Text: Unique text that needs to be clicked from the specified element.
CursorPosition: Specifies the displacement of the cursor position relative to the option selected in the position field.