pdftools_sdk.ocr.text_processing_mode

Classes

TextProcessingMode(*values)

The mode for processing text in OCR

class pdftools_sdk.ocr.text_processing_mode.TextProcessingMode(*values)[source]

Bases: IntEnum

The mode for processing text in OCR

Variables:
  • NONE (int) – Do not process text.

  • UPDATE (int) – Only process text that is not extractable. For all characters that have no meaningful Unicode, OCR processing is used to determine the Unicode. This is the recommended mode to make text extractable.

  • REPLACE (int) – Process all text. OCR is used to determine the Unicode of all characters, even if they seemingly have Unicode information. This is useful for documents that possibly contain wrong Unicode information.

NONE = 0
UPDATE = 1
REPLACE = 2