show / hide menu

Activity – OCR

The OCR (Optical Character Recognition) includes activities for extracting text from images or scanned documents. It includes functionalities like Create Tesseract OCR, OCR – Find Element, OCR – Get Text, and OCR Click Text, enabling automation to recognize and interact with text-based content in images and UI elements.

The OCR category has the following sub activities:

Create Tesseract OCR: This activity creates an instance of Tesseract OCR Engine for use in an OCR-based activity.

Options Properties:

  • Language: The language to be used for text recognition.
  • Whitelist: List of characters to be whitelisted for recognition.

Engine Properties:

  • OCREngine: Creates Instance of OCR Engine. This field supports only OCREngine variable.

OCR- Find Element: This activity searches for a given text within the bounds of a specified element and returns the corresponding element.

Engine Properties:

  • OCREngine: Instance of an OCR engine to be used. By default, OCR Engine is Tesseract.

Options Properties:

  • Element: This field supports only Ul element variable returned by another activity.
  • Text: Text to be searched.

Output Properties:

  • Bounding Element: Element variable where the corresponding UI Element is found.

OCR: Get Text- This activity processes an image or an element with OCR Engine and returns all the identified text.

Engine Properties:

  • OCREngine: Instance of an OCR engine to be used. By default, the OCR Engine is Tesseract.

Target Properties:

  • Element: This field supports only Ul element variable returned by another activity.
  • BoundingBox: Bounding rectangle in pixels relative to the UIElement.
  • Image: An existing image to extract text from. This field supports only Image variables.

Output Properties:

  • Text: String extracted from an indicated Ul element.

OCR- Click Text: This activity searches and clicks on the element with the given text in a window. 

Engine Properties:

  • OCREngine: Instance of an OCR engine to be used. By default, the OCR Engine is Tesseract.

Element Properties:

  • Element: This field supports only the Ul element variable returned by another activity.

Options Properties:

  • Mouse Button: Specifies the mouse button to be used for the click action (left or right).
  • Text: Unique text that needs to be clicked from the specified element.
  • CursorPosition: Specifies the displacement of the cursor position relative to the option selected in the position field.