View PDF metadata with ExifTool and pdfinfo

From Parallel Library Services
Revision as of 19:29, 20 November 2021 by Simon (talk | contribs) (Created page with "A couple of tools you can use to look into the metadata of a PDF are Exiftool and <code>pdfinfo</code>, a tool that is part of Poppler's utilites. == Using exiftool =...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

A couple of tools you can use to look into the metadata of a PDF are Exiftool and pdfinfo, a tool that is part of Poppler's utilites.

Using exiftool

Exiftool can be used to view the metadata on digital files, including PDF, JPEG and many more. It mostly displays information about the file's creation, and with PDf sometimes the

  • Title
  • Author
  • Keywords
  • Creation date

By running

exiftool filename.pdf
<syntaxhighlight>

it will print metadata to the terminal, which may look like this:

<syntaxhighlight lang="bash">


==Using pdfinfo==

First, install Poppler.

On Mac:

<syntaxhighlight lang="bash">
brew install poppler

On Linux:

apt install poppler

For help, see the manual by running

pdfinfo --help

pdfinfo allows you to see metadata of the file, by running

pdfinfo filename.pdf

It will then print something like this to the terminal:

Producer:        Adobe Acrobat 9.0 Paper Capture Plug-in with ClearScan
CreationDate:    Fri Sep  4 16:29:41 2009 CEST
ModDate:         Sat Feb 23 16:41:07 2019 CET
Custom Metadata: no
Metadata Stream: yes
Tagged:          no
UserProperties:  no
Suspects:        no
Form:            AcroForm
JavaScript:      no
Pages:           11
Encrypted:       no
Page size:       468 x 640.8 pts
Page rot:        0
File size:       246950 bytes
Optimized:       yes
PDF version:     1.6

pdfinfo also has ability to print document level metadata with the -meta option.

pdfinfo filename.pdf -meta
<?xpacket begin="" id="W5M0MpCehiHzreSzNTczkc9d"?>
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="Adobe XMP Core 4.2.1-c043 52.372728, 2009/01/18-15:08:04        ">
   <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
      <rdf:Description rdf:about=""
            xmlns:pdf="http://ns.adobe.com/pdf/1.3/">
         <pdf:Producer>Acrobat Distiller 7.0 (Windows)</pdf:Producer>
      </rdf:Description>
      <rdf:Description rdf:about=""
            xmlns:pdfx="http://ns.adobe.com/pdfx/1.3/">
         <pdfx:GTS_PDFXVersion>PDF/X-4</pdfx:GTS_PDFXVersion>
      </rdf:Description>
      <rdf:Description rdf:about=""
            xmlns:xmp="http://ns.adobe.com/xap/1.0/">
         <xmp:CreateDate>2016-09-24T20:10:06+05:30</xmp:CreateDate>
         <xmp:ModifyDate>2016-10-11T16:54:29-05:00</xmp:ModifyDate>
         <xmp:MetadataDate>2016-10-11T16:54:29-05:00</xmp:MetadataDate>
         <xmp:CreatorTool>Adobe Acrobat 7.0</xmp:CreatorTool>
      </rdf:Description>
      <rdf:Description rdf:about=""
            xmlns:xmpMM="http://ns.adobe.com/xap/1.0/mm/"
            xmlns:stRef="http://ns.adobe.com/xap/1.0/sType/ResourceRef#">
         <xmpMM:DocumentID>uuid:3B0E29BD6482E611B55AC7698D1090EA</xmpMM:DocumentID>
         <xmpMM:InstanceID>uuid:07134e91-ad54-41ed-8ab9-8b527a9cf01e</xmpMM:InstanceID>
         <xmpMM:DerivedFrom rdf:parseType="Resource">
            <stRef:instanceID>uuid:4a759112-a125-48bd-afb5-266418c75c7f</stRef:instanceID>
            <stRef:documentID>xmp.id:CF2F1D650479E61193ABF2273C4D4837</stRef:documentID>
            <stRef:versionID>1</stRef:versionID>
            <stRef:renditionClass>proof:pdf</stRef:renditionClass>
         </xmpMM:DerivedFrom>
      </rdf:Description>
      <rdf:Description rdf:about=""
            xmlns:dc="http://purl.org/dc/elements/1.1/">
         <dc:format>application/pdf</dc:format>
         <dc:title>
            <rdf:Alt>
               <rdf:li xml:lang="x-default">In the Wake: On Blackness and Being</rdf:li>
            </rdf:Alt>
         </dc:title>
         <dc:creator>
            <rdf:Bag>
               <rdf:li>Christina Sharpe</rdf:li>
            </rdf:Bag>
         </dc:creator>
      </rdf:Description>
      <rdf:Description rdf:about=""
            xmlns:pdfxid="http://www.npes.org/pdfx/ns/id/">
         <pdfxid:GTS_PDFXVersion>PDF/X-4</pdfxid:GTS_PDFXVersion>
      </rdf:Description>
   </rdf:RDF>
</x:xmpmeta>