PdfToText Class |
Namespace: SelectPdf
The PdfToText type exposes the following members.
Name | Description | |
---|---|---|
ClipText |
Do not return hidden text from the PDF document.
| |
DocumentInformation |
PDF document information. This property is populated after the requested operation (GetText, GetHtml, etc) is finished.
(Inherited from PdfTool.) | |
EndPageNumber |
The page number where the current operation will end on the PDF file.
The default value is 0 which means that all the PDF document is processed starting from the StartPageNumber page.
(Inherited from PdfTool.) | |
HtmlCharset |
The charset meta tag added to the generated HTML document when the GetHtml() method is used. The default value is UTF-8.
| |
Layout |
Gets or sets the layout of the output text. The default value is Original.
| |
MarkPageBreaks |
Insert a special character after the text extracted from each PDF page.
The special character defined by the PageBreakMark property.
| |
PageBreakMark |
Gets the page break mark character used when the MarkPageBreaks property is true.
| |
StartPageNumber |
The page number from where the current operation will start on the PDF file.
The default value is 1 which means that the operation will start from the first page.
(Inherited from PdfTool.) | |
Timeout |
Timeout in seconds for the current operation. Default value is 600 seconds.
(Inherited from PdfTool.) | |
UserPassword |
The user password to be used to open the PDF document for reading. The default value is null, which means that
no password will be used to open the PDF document.
(Inherited from PdfTool.) |
Name | Description | |
---|---|---|
ExtractText |
Extracts the text from the specified page and coordinates.
| |
GetHtml |
Gets the text from a PDF document and wraps it with HTML tags.
| |
GetInfo |
Gets the information of the loaded PDF document.
(Inherited from PdfTool.) | |
GetPageCount |
Gets the number of pages in the loaded PDF document.
(Inherited from PdfTool.) | |
GetText |
Gets the text from a range of pages from a PDF document.
| |
Load(Byte) |
Loads a pdf document from a byte array.
(Inherited from PdfTool.) | |
Load(Stream) |
Loads a pdf document from the specified stream.
(Inherited from PdfTool.) | |
Load(String) |
Loads an existing pdf file.
(Inherited from PdfTool.) | |
Load(Byte, String) |
Loads a password protected pdf document from a byte array.
(Inherited from PdfTool.) | |
Load(Stream, String) |
Loads a pdf document from a stream containing a password protected pdf document.
(Inherited from PdfTool.) | |
Load(String, String) |
Loads an existing password protected pdf file.
(Inherited from PdfTool.) | |
SaveHtml(String) |
Gets the text from a PDF document wrapped in HTML tags and saves it into a file.
| |
SaveHtml(String, Encoding) |
Gets the text from a PDF document wrapped in HTML tags and saves it into a file.
| |
SaveText(String) |
Gets the text from a PDF document and saves it into a file.
| |
SaveText(String, Encoding) |
Gets the text from a PDF document and saves it into a file.
| |
Search(String) |
Search for a specific text in a PDF document. The search is case insensitive and returns partial words also.
| |
Search(String, Boolean, Boolean) |
Search for a specific text in a PDF document.
|