PDF Archival Standard

PDF/A: The ISO Standard for Permanent PDF Archiving

PDF/A (ISO 19005) is the international standard for creating PDF files designed to be readable and perfectly reproducible decades into the future. It embeds all fonts, locks out encryption and JavaScript, and ensures every rendering dependency is contained within the file — guaranteed to open identically on any conformant viewer, now or in 50 years.

Quick Answer

A regular PDF can depend on system fonts, link to external websites, run JavaScript, or require a specific password — all things that may not work in 20 years. PDF/A eliminates every dependency: fonts are embedded inside the file itself, colours are described in absolute terms, external links are prohibited, encryption is banned. The result is a single, completely self-contained file that renders identically on any ISO-conformant PDF viewer — from today's software to a viewer written in 2075. It's the format required by government archives, courts, healthcare systems, and any organisation with legal document retention obligations.

What Is PDF/A?

PDF/A is a family of ISO standards derived from the base PDF specification, each designed to ensure electronic documents remain permanently readable and reproducible. The name stands for PDF for Archiving. The first version, PDF/A-1, was published as ISO 19005-1 in 2005 following years of work by libraries, national archives, courts, and document management organisations that needed a reliable long-term file format.

The core principle: every resource needed to render a PDF/A document is embedded inside it. The document must be self-contained, device-independent, and technology-agnostic — rendering must not depend on the specific software, operating system, font library, or network environment of any particular era.

To enforce this, PDF/A mandates and prohibits a specific set of features:

  • Required: All fonts fully embedded, device-independent colour spaces (with PDF/A output intent), XMP metadata, document information dictionary.
  • Prohibited: Encryption, JavaScript, external content references, audio/video, LZW compression (PDF/A-1), unembedded fonts, and actions that depend on the viewer environment.
📌

PDF/A is a subset, not a superset. A PDF/A file is a valid PDF that conforms to additional restrictions. Not every valid PDF is a valid PDF/A file — most are not.

PDF/A Versions Compared

StandardYearKey AdditionsEmbedded Files
PDF/A-1 (ISO 19005-1)2005Baseline archival rules — strict, no frillsNone
PDF/A-2 (ISO 19005-2)2011Transparency, JPEG 2000, layers (PDF 1.7)PDF/A files only
PDF/A-3 (ISO 19005-3)2012Same as PDF/A-2 plus any embedded file type✓ Any format
PDF/A-4 (ISO 19005-4)2020Based on PDF 2.0 — encrypted metadata, improved accessibility, associated files✓ Any format

Real-World Examples

🏛️ Government Scenario

National Archive: Millions of Digitised Records

A national archive digitises 50 million historical documents — deeds, census records, court filings — and stores them as PDF/A-1. The archivists know that no matter which operating system or PDF software exists in 2075, any ISO-conformant viewer will render the documents identically to how they look today. The fonts are embedded in the files themselves, and the colour values are absolute. No system font is needed. No external server is contacted. The documents are as self-sufficient as the original paper.

⚕️ Healthcare Scenario

Hospital: Patient Records Compliant With Retention Laws

A hospital is required by regulation to retain patient records for 30 years. They export each patient encounter as PDF/A-2a — with full accessibility tagging, ensuring records are readable by screen reader software used by staff with visual impairments. In 2045, when the records reach their retention deadline, the hospital can verify the files are exactly as created because the PDF/A standard guarantees bitwise reproducibility. The files are also submitted to the national health authority, which requires PDF/A for all electronic record submissions.

⚖️ Legal Scenario

Court Filing System: Electronic Legal Documents

A federal court system accepts only PDF/A-1b for all electronic filings. Lawyers and litigants submit pleadings, exhibits, and judgments in PDF/A format. The court system validates conformance using veraPDF at the point of submission — non-conformant files are automatically rejected. Twenty years later, every filing from today is accessible in the same court archives system on different hardware and software, rendering identically to how it appeared on the day of filing.

Why PDF/A Matters

♾️

Future-Proof Documents

All rendering resources embedded in the file — fonts, colour profiles, images. No external dependencies that can break, expire, or disappear over decades.

⚖️

Legal & Regulatory Compliance

Required by government archives, courts, healthcare systems, and financial regulators in jurisdictions with electronic document retention laws — including the EU, US Federal Government, and many national archives worldwide.

🔍

Consistent Rendering

A PDF/A document looks identical regardless of viewer, operating system, or decade. No font substitution, no colour shift, no layout reflow — guaranteed by ISO standard.

🛡️

Tamper Evidence

While PDF/A itself doesn't digitally sign documents, it is the preferred archival wrapper for PAdES-signed PDFs — making the combination of format integrity and signature validity permanent.

🌍

Universal Accessibility

PDF/A-a (Level A) conformance requires full structural tagging — heading hierarchy, alt text, reading order — ensuring the document is readable by assistive technology now and in the future.

📦

Self-Contained Archives

PDF/A-3 and PDF/A-4 allow embedding any file type — source data, CAD files, XML invoices — making one PDF/A file serve as a complete, self-contained document and data package.

What PDF/A Requires and Prohibits

FeatureRegular PDFPDF/A (all levels)
Font embeddingOptional✓ Required — all fonts fully embedded
Device-independent colourOptional✓ Required — with PDF/A output intent
Encryption / password✓ Allowed✗ Prohibited
JavaScript✓ Allowed✗ Prohibited
External content / hyperlinks✓ Allowed✗ External references prohibited
Audio / video✓ Allowed✗ Prohibited (PDF/A-1/2/3)
XMP metadataOptional✓ Required
XFA forms✓ Allowed✗ Prohibited

Common Mistakes to Avoid

  • Not validating the PDF/A output after conversion. Many tools claim to export PDF/A but produce non-conformant files — especially for documents containing system fonts, transparency, or embedded media. Always validate with veraPDF (free, ISO-certified) before archiving.
  • Adding a password to a PDF/A file. Encryption is explicitly prohibited by the PDF/A standard. An encrypted file is automatically non-conformant. Manage access control at the file system or DMS level, not inside the PDF.
  • Choosing PDF/A-1 for modern documents with transparency. PDF/A-1 prohibits transparency effects (drop shadows, blended layers). If your document uses transparency — common in design software output — you must use PDF/A-2 or higher, which permits it.
  • Using XFA-based forms in PDF/A. XFA (XML Forms Architecture) is prohibited in all PDF/A levels. If your workflow uses XFA forms, migrate to AcroForms before exporting to PDF/A.
  • Confusing "saved as PDF" with "saved as PDF/A." These are different export options. Saving as PDF does not create a PDF/A file. Always explicitly select the PDF/A option in your export dialog and confirm the version (PDF/A-1b, PDF/A-2a, etc.).

Frequently Asked Questions

  • PDF/A (ISO 19005) is a family of international standards for long-term PDF archiving. A PDF/A file embeds all fonts, uses device-independent colours, prohibits encryption and JavaScript, and contains no external dependencies — guaranteeing identical rendering on any conformant viewer, now or decades from now.

  • PDF/A-1 (2005): Strict baseline — no transparency, no embedded files. PDF/A-2 (2011): Adds transparency, JPEG 2000, layers. PDF/A-3 (2012): Allows any embedded file type. PDF/A-4 (2020): Based on PDF 2.0 with encrypted metadata and improved accessibility.

  • Encryption, JavaScript, audio/video (in PDF/A-1/2/3), external content references, unembedded fonts, device-dependent colours without an output intent, XFA forms, and actions that depend on the viewer environment. These rules ensure the document is fully self-contained.

  • Government archives, courts filing electronic legal documents, healthcare (patient record retention), financial services (regulatory compliance), academic publishers, and any organisation subject to document retention laws specifying electronic format requirements — in the EU, US Federal Government, and many national jurisdictions.

  • Use Adobe Acrobat (File > Save As > PDF/A), LibreOffice (Export as PDF, tick PDF/A), Foxit PDF Editor, Ghostscript, or iText. After conversion, validate with veraPDF (free, open-source, ISO-certified) — many converters produce non-conformant output that only a validator will catch.

  • PDF/A is for long-term digital archiving. PDF/X is for professional press printing (prepress exchange). Both are ISO subsets of PDF with different constraints — PDF/X prioritises print colour accuracy; PDF/A prioritises long-term digital preservation. They are not interchangeable.

Work With Archival-Grade PDFs — Free

PDFlyst helps you create, edit, and manage PDFs that meet professional archival and compliance standards.

Open PDF Editor — Free