dump_data_utf8
Metadata, page and bookmark info (in UTF-8)
Usage
pdftl
<input>dump_data_utf8[json][output<output>]
Details
Extracts document-level metadata and structural information
from the input PDF, identical to the dump_data operation,
except all string values in the output are written as raw
UTF-8. No XML-style escaping is applied.
This format is designed to be read by the update_info_utf8
operation. Use this if you need to inspect or process the
data with tools that do not understand XML escaping.
For a complete description of the output format and all
possible fields, see the help for dump_data.
Examples
Save raw metadata for in.pdf to data.txt
pdftl in.pdf dump_data_utf8 output data.txt
Tags: info, metadata
Source: pdftl.operations.dump_data
Read online: https://pdftl.readthedocs.io/en/stable/operations/dump_data_utf8.html
Type: Operation