dump_data_utf8

Metadata, page and bookmark info (in UTF-8)

Usage

pdftl <input> dump_data_utf8 [json] [output <output>]

Details

Extracts document-level metadata and structural information from the input PDF, identical to the dump_data operation, except all string values in the output are written as raw UTF-8. No XML-style escaping is applied.

This format is designed to be read by the update_info_utf8 operation. Use this if you need to inspect or process the data with tools that do not understand XML escaping.

For a complete description of the output format and all possible fields, see the help for dump_data.

Examples

Save raw metadata for in.pdf to data.txt

pdftl in.pdf dump_data_utf8 output data.txt

Tags: info, metadata

Source: pdftl.operations.dump_data

Read online: https://pdftl.readthedocs.io/en/stable/operations/dump_data_utf8.html

Type: Operation