pdftools_sdk.extraction.text_extraction_format

Classes

TextExtractionFormat(value)

class pdftools_sdk.extraction.text_extraction_format.TextExtractionFormat(value)[source]

Bases: IntEnum

Variables:
  • DOCUMENT_ORDER (int) – Text is extracted in the order how it is embedded in the PDF.

  • MONOSPACE (int) – The extracted monospaced text mimics the layout of the page by use of whitespaces.

DOCUMENT_ORDER = 1
MONOSPACE = 2