# `dump_data_utf8` Metadata, page and bookmark info (in UTF-8) ## Usage > pdftl `` `dump_data_utf8` `[json]` `[output` `]` ## Details Extracts document-level metadata and structural information from the input PDF, identical to the `dump_data` operation, except all string values in the output are written as raw UTF-8. No XML-style escaping is applied. This format is designed to be read by the `update_info_utf8` operation. Use this if you need to inspect or process the data with tools that do not understand XML escaping. For a complete description of the output format and all possible fields, see the help for `dump_data`. ## Examples > Save raw metadata for in.pdf to data.txt ``` pdftl in.pdf dump_data_utf8 output data.txt ``` **Tags**: info, metadata *Source: pdftl.operations.dump_data* *Read online: [https://pdftl.readthedocs.io/en/stable/operations/dump_data_utf8.html](https://pdftl.readthedocs.io/en/stable/operations/dump_data_utf8.html)* *Type: Operation*