v26.1

PDF抽出機能の新機能: プロパティ抽出

PDFプロパティの抽出: タイトル、作者、サブジェクト、キーワード、ページ数。
Class Extractor: PDFプロパティ抽出用メソッド Extract を追加。
Class ExtractPropertiesOptions: PdfExtractor プラグイン用の PDFプロパティ抽出オプションを表します。
Class PdfProperties: PDFドキュメントのプロパティとメタ情報を表します。
Interface IHaveInput: 単一入力データを持つオプションで使用されます。
Class OptionsWithInput: 単一入力データを持つオプションで使用されます。
Full Free functional.

使用例:

この例は、PDFファイルからプロパティ（タイトル、作者、サブジェクト、キーワード、ページ数）を抽出する方法を示しています。

// Create ExtractPropertiesOptions object to set input file
var options = new ExtractPropertiesOptions("path_to_your_pdf_file.pdf");
// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(options);
var title = pdfProperties.Title;
var author = pdfProperties.Author;
var subject = pdfProperties.Subject;
var keywords = pdfProperties.Keywords;
var numberOfPages = pdfProperties.NumberOfPages;

使用例:

この例は、PDFストリームからプロパティ（タイトル、作者、サブジェクト、キーワード、ページ数）を抽出する方法を示しています。

// Create ExtractPropertiesOptions object to set input stream
var stream = File.OpenRead("path_to_your_pdf_file.pdf");
var options = new ExtractPropertiesOptions(stream);
// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(options);
var title = pdfProperties.Title;
var author = pdfProperties.Author;
var subject = pdfProperties.Subject;
var keywords = pdfProperties.Keywords;
var numberOfPages = pdfProperties.NumberOfPages;

使用例:

この例は、最も簡潔な形でPDFファイルからプロパティを抽出する方法を示しています。

// Perform the process and get Properties
var pdfProperties = PdfExtractor.Extract(new ExtractPropertiesOptions("path_to_your_pdf_file.pdf"));

改良点

HTMLからPDFへの変換で画像の代替テキストをサポート
PDFからPDFA1bへの準拠機能

修正済みバグ

PDFファイルの最適化を試みた際にエラーが発生する問題
フォームフィールドの垂直配置問題
フォームフィールドで日本語テキストが正しく表示されない問題の修正
HTMLからPDFへの変換でテーブルの書式設定問題
PDF→PDF/A変換前にサイズ最適化すると出力が破損する問題
PDF→DOC変換でテーブルが正しくレンダリングされない問題
PDF→Excel変換で出力ファイルの書式問題
PDF→HTML変換中の ArgumentException
PDF→HTML変換で一部リンクが欠落している問題
PDF→HTML変換時にテキストが欠落する問題