v26.1

New Feature Extract Properties for PDF Extractor

Extract PDF Properties: Title, Author, Subject, Keywords, Number of Pages.
Class Extractor: added method Extract for extracting PDF Properties.
Class ExtractPropertiesOptions: Represents PDF Properties Extraction Options for the PdfExtractor plugin.
Class PdfProperties: Represents Properties and meta information of PDF document.
Interface IHaveInput: Used for Options with single input data.
Class OptionsWithInput: Used for Options with single input data.
Full Free functional.

Example Usage:

The example demonstrates how to Extract Properties (Title, Author, Subject, Keywords, Number of Pages) from PDF file.

// Create ExtractPropertiesOptions object to set input file
var options = new ExtractPropertiesOptions("path_to_your_pdf_file.pdf");
// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(options);
var title = pdfProperties.Title;
var author = pdfProperties.Author;
var subject = pdfProperties.Subject;
var keywords = pdfProperties.Keywords;
var numberOfPages = pdfProperties.NumberOfPages;

Example Usage:

The example demonstrates how to Extract Properties (Title, Author, Subject, Keywords, Number of Pages) from PDF stream.

// Create ExtractPropertiesOptions object to set input stream
var stream = File.OpenRead("path_to_your_pdf_file.pdf");
var options = new ExtractPropertiesOptions(stream);
// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(options);
var title = pdfProperties.Title;
var author = pdfProperties.Author;
var subject = pdfProperties.Subject;
var keywords = pdfProperties.Keywords;
var numberOfPages = pdfProperties.NumberOfPages;

Example Usage:

The example demonstrates how to Extract Properties from PDF file in the shortest possible style.

// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(new ExtractPropertiesOptions("path_to_your_pdf_file.pdf"));

Enhancements

HTML to PDF enhancement supports alternative text for images
PDF to PDFA1b compliance

Fixed Bugs

Error occurs when attempting to optimize PDF file
Form Field vertical alignment problem
Issue fixed with displaying Japanese text in FormField
HTML to PDF – Table formatting issue
Optimizing PDF size prior to PDF to PDF/A conversion results in corrupted output
PDF to DOC – Table rendered improperly
PDF to Excel – Output file formatting problems
PDF to HTML – ArgumentException during conversion
PDF to HTML conversion missing some links
Text missing when converting PDF to HTML