| Feature | Standard PDF | PDF/A (Archival) | | :--- | :--- | :--- | | | May use system fonts | Must embed all fonts | | Transparency | Supported | Must be flattened | | Color Space | RGB (screen) | RGB, CMYK, or Grayscale only | | JavaScript | Allowed | Forbidden | | Metadata | Optional | Required (XMP) | Why You Cannot Use HTML's <!DOCTYPE> Logic A common mistake is assuming a PDF has a plain-text doctype string. It does not. If you open a .pdf in a text editor, you will see binary data mixed with obj and stream tags. To find the "doctype," look for the XMP metadata :
<xmpMM:History> <stEvt:action>converted</stEvt:action> <stEvt:parameters>pdfa1b</stEvt:parameters> </xmpMM:History> If that string is absent, you are looking at a standard PDF, not an archival one. When someone asks for a "doctype pdf," they are really asking: "How do I guarantee this file will look the same in 50 years?" doctype pdf
The answer is . It strips out JavaScript, video, audio, and external dependencies. It forces every font and image into the file. It is the legal standard for court filings (US federal courts), library archives (Internet Archive), and engineering blueprints (ISO standards). | Feature | Standard PDF | PDF/A (Archival)