v26.1

New Feature Extract Properties for PDF Extractor

  • Extract PDF Properties: Title, Author, Subject, Keywords, Number of Pages.
  • Class Extractor: added method Extract for extracting PDF Properties.
  • Class ExtractPropertiesOptions: Represents PDF Properties Extraction Options for the PdfExtractor plugin.
  • Class PdfProperties: Represents Properties and meta information of PDF document.
  • Interface IHaveInput: Used for Options with single input data.
  • Class OptionsWithInput: Used for Options with single input data.
  • Full Free functional.

Example Usage:

The example demonstrates how to Extract Properties (Title, Author, Subject, Keywords, Number of Pages) from PDF file.

// Create ExtractPropertiesOptions object to set input file
var options = new ExtractPropertiesOptions("path_to_your_pdf_file.pdf");
// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(options);
var title = pdfProperties.Title;
var author = pdfProperties.Author;
var subject = pdfProperties.Subject;
var keywords = pdfProperties.Keywords;
var numberOfPages = pdfProperties.NumberOfPages;

Example Usage:

The example demonstrates how to Extract Properties (Title, Author, Subject, Keywords, Number of Pages) from PDF stream.

// Create ExtractPropertiesOptions object to set input stream
var stream = File.OpenRead("path_to_your_pdf_file.pdf");
var options = new ExtractPropertiesOptions(stream);
// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(options);
var title = pdfProperties.Title;
var author = pdfProperties.Author;
var subject = pdfProperties.Subject;
var keywords = pdfProperties.Keywords;
var numberOfPages = pdfProperties.NumberOfPages;

Example Usage:

The example demonstrates how to Extract Properties from PDF file in the shortest possible style.

// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(new ExtractPropertiesOptions("path_to_your_pdf_file.pdf"));

Enhancements

  • HTML to PDF enhancement supports alternative text for images
  • PDF to PDFA1b compliance

Fixed Bugs

  • Error occurs when attempting to optimize PDF file
  • Form Field vertical alignment problem
  • Issue fixed with displaying Japanese text in FormField
  • HTML to PDF – Table formatting issue
  • Optimizing PDF size prior to PDF to PDF/A conversion results in corrupted output
  • PDF to DOC – Table rendered improperly
  • PDF to Excel – Output file formatting problems
  • PDF to HTML – ArgumentException during conversion
  • PDF to HTML conversion missing some links
  • Text missing when converting PDF to HTML
 English