pdftools_sdk.ocr.image_processing_mode

Classes

ImageProcessingMode(*values)

The mode for processing images in OCR

class pdftools_sdk.ocr.image_processing_mode.ImageProcessingMode(*values)[source]

Bases: IntEnum

The mode for processing images in OCR

Variables:
  • NONE (int) – Do not process images.

  • UPDATE_TEXT (int) – Only process images that have no OCR text. This is the recommended mode to make scanned pages searchable while preserving existing OCR text.

  • REPLACE_TEXT (int) – Process all images and remove existing OCR text. Use this mode to re-OCR all images, replacing any previously generated text.

  • REMOVE_TEXT (int) – Remove existing OCR text without performing new OCR. This mode does not require an OCR engine.

  • IF_NO_TEXT (int) – Process images only if the document contains no text at all.

NONE = 0
UPDATE_TEXT = 1
REPLACE_TEXT = 2
REMOVE_TEXT = 3
IF_NO_TEXT = 4