Processor Class |
Process PDF documents with OCR
The processor applies Optical Character Recognition (OCR) to PDF documents. It can make scanned documents searchable, fix text extraction issues and generate PDF tagging/structure.
The processor is decoupled from the document - it takes a Document as input and produces a new Document as output.
public class Processor : NativeObject
The Processor type exposes the following members.
| Name | Description | |
|---|---|---|
| Equals | (Inherited from NativeObject) | |
| GetHashCode | (Inherited from NativeObject) | |
| Process | Apply OCR to a PDF document Process the input PDF document with OCR according to the specified options. The processed document is written to the output stream. Non-critical processing issues raise a Warning. It is recommended to review the WarningCategory and handle them if necessary for the application. |
| Name | Description | |
|---|---|---|
| Warning | Event for warnings occurring during OCR processing Non-critical issues during processing are reported via this event. It is recommended to review the WarningCategory and handle warnings if necessary for the application. |