This adapter reads a PDF document (also password-protected), extracts the text content of a specific page range or the entire document, and outputs the extracted strings in an XML or text document with a freely selectable character encoding.
Properties
|
Operation |
Describes which operation the adapter performs. Possible values: |
Parameters
|
password |
Password (for a protected PDF document) Possible values: Any string |
|
startPage |
First page number from which the texts are to be extracted Possible values:
|
|
endPage |
The last page number up to which you want to perform the text extraction Possible values:
|
|
encoding |
Character encoding of the result document Possible values: Any valid string (e.g. |
|
force |
Also try to extract text on invalid PDF pages Possible values:
|
|
toXML |
Output text content in an XML document Possible values:
|
Status values
|
|
The operation was successful. |
|
|
The operation failed due to a technical error. |