`dump_data_fields`

Print PDF form field data with XML-style escaping

Usage

pdftl <input> dump_data_fields [output <output>]

Details

Extracts data from all interactive form fields (AcroForm fields) within the input PDF.

The output uses a stanza-based format compatible with pdftk. All string values (such as the field’s content) are processed with XML-style escaping.

This output is for informational purposes or for use in external scripts. It is not designed to be read by the update_info operation. To fill form fields, use the fill_form operation.

Field Stanza Format

Each field is represented by a single stanza. Each stanza consists of a separator line ---, followed by:

FieldName: <full_field_name>

The unique identifying name of the field (e.g., form1.name).
FieldNameAlt: <tooltip_text>

The alternate name (tooltip) of the field, if present.
FieldType: <Text|Button|Choice|Signature|...>

The readable type of the field.
FieldValue: <current_value>

The current value of the field.
FieldValueDefault: <default_value>

The default value of the field, if defined.
FieldFlags: <integer>

An integer representing a bitmask of field properties.
FieldJustification: <Left|Center|Right>

Text alignment for text fields.
FieldStateOption: <option_value>

Available options for Choice (dropdown/list) fields or Button (checkbox/radio) fields. Multiple lines may appear.
FieldStateOptionDisplay: <display_text>

The display text corresponding to the preceding FieldStateOption, used when the export value differs from the display value.

Examples

Print XML-escaped form field data for in.pdf

pdftl in.pdf dump_data

Save XML-escaped form field data for in.pdf to data.txt

pdftl Form.pdf dump_data_fields output data.txt

Tags: info, forms

Source: pdftl.operations.dump_data_fields

Read online: https://pdftl.readthedocs.io/en/stable/operations/dump_data_fields.html

Type: Operation

dump_data_fields

Usage

Details

Field Stanza Format

Examples

`dump_data_fields`