pdfinfo (1) Linux Manual Page
NAME
pdfinfo – Portable Document Format (PDF) document information extractor (version 3.03)
SYNOPSIS
pdfinfo [options] [PDF-file]
DESCRIPTION
Pdfinfo prints the contents of the ‘Info’ dictionary (plus some other useful information) from a Portable Document Format (PDF) file.
If PDF-file is ‘-‘, it reads the PDF file from stdin.
The ‘Info’ dictionary contains the following values:
- title
- subject
- keywords
- author
- creator
- producer
- creation date
- modification date
In addition, the following information is printed:
- custom metadata (yes/no)
- metadata stream (yes/no)
- tagged (yes/no)
- userproperties (yes/no)
- suspects (yes/no)
- form (AcroForm / XFA / none)
- javascript (yes/no)
- page count
- encrypted flag (yes/no)
- print and copy permissions (if encrypted)
- page size
- file size
- linearized (yes/no)
- PDF version
- metadata (only if requested)
The options -listenc, -meta, -js, -struct, and -struct-text only print the requested information. The ‘Info’ dictionary and related data listed above is not printed. At most one of these five options may be used.
OPTIONS
-fnumber- Specifies the first page to examine. If multiple pages are requested using the "-f" and "-l" options, the size of each requested page (and, optionally, the bounding boxes for each requested page) are printed. Otherwise, only page one is examined.
-lnumber- Specifies the last page to examine.
-box- Prints the page box bounding boxes: MediaBox, CropBox, BleedBox, TrimBox, and ArtBox.
-meta- Prints document-level metadata. (This is the "Metadata" stream from the PDF file’s Catalog object.)
-custom- Prints custom and standard metadata.
-js- Prints all JavaScript in the PDF.
-struct- Prints the logical document structure of a Tagged-PDF file.
-struct-text- Print the textual content along with the document structure of a Tagged-PDF file. Note that extracting text this way might be slow for big PDF files. (Implies
-struct.) -url- Print all URLs in the PDF. Only the URL types supported by Poppler are listed. Currently, this is limited to Annotations. Note: only URLs referenced by the PDF objects such as Link Annotations are listed. pdfinfo does not attempt to extract strings matching http://… from the text content.
-isodates- Prints dates in ISO-8601 format (including the time zone).
-rawdates- Prints the raw (undecoded) date strings, directly from the PDF file.
-dests- Print a list of all named destinations. If a page range is specified using "-f" and "-l", only destinations in the page range are listed.
-encencoding-name- Sets the encoding to use for text output. This defaults to "UTF-8".
-listenc- Lits the available encodings
-opwpassword- Specify the owner password for the PDF file. Providing this will bypass all security restrictions.
-upwpassword- Specify the user password for the PDF file.
-v- Print copyright and version information.
-h- Print usage information. (
-helpand–helpare equivalent.)
EXIT CODES
The Xpdf tools use the following exit codes:
- 0
- No error.
- 1
- Error opening a PDF file.
- 2
- Error opening an output file.
- 3
- Error related to PDF permissions.
- 99
- Other error.
AUTHOR
The pdfinfo software and documentation are copyright 1996-2011 Glyph & Cog, LLC.
SEE ALSO
pdfdetach(1), pdffonts(1), pdfimages(1), pdftocairo(1), pdftohtml(1), pdftoppm(1), pdftops(1), pdftotext(1) pdfseparate(1), pdfsig(1), pdfunite(1)
