What is the Info Dictionary?
Before advanced XML-based metadata (XMP) was invented, PDF files used a simple system called the **Document Information (Info) Dictionary**. It is a small block of text buried in the file code that answers the question: "What is this file?"
Even though modern PDFs use more complex systems, the **Info Dictionary** is still the first place that computers look for basic information. When you see a "Title" appear at the top of your web browser when opening a PDF, or when you right-click a file and select "Properties," you are likely looking at the contents of the Info Dictionary.
Common Fields in the Info Dictionary
- /Title: The human-friendly name of the document (often different from the filename).
- /Author: The person or organization that wrote the content.
- /Subject: A short one-sentence summary of the content.
- /Keywords: A list of terms used by search engines (SEO) to categorize the file.
- /Creator: The name of the original program used to create the content (e.g., Microsoft Word).
- /Producer: The name of the software that turned that content into a PDF (e.g., Adobe Acrobat).
- /CreationDate: The exact date and time the file was first generated.
- /ModDate: The date and time the file was last saved or modified.
Why You Should Manage the Info Dictionary
- SEO & Searchability: Search engines like Google use the `/Title` and `/Keywords` fields to decide how high to rank your PDF in search results.
- Security and Privacy:** The Info Dictionary can contain "Leaked Data." For example, it might show that a document was written on an ancient version of Word, or reveal the name of an employee who has since left the company. Professional "Sanitization" tools often wipe the Info Dictionary before a file is released publicly.
- Professionalism: Viewing a PDF that has "Untitled" or "Document1" as its title looks unprofessional and makes it hard for users to organize their files.
Info Dictionary vs. XMP
Starting with PDF 1.4, Adobe introduced **XMP (Extensible Metadata Platform)**, which is much more powerful and can hold thousands of custom fields. However, because many older programs (and some web browsers) only understand the **Info Dictionary**, professional PDF software keeps both systems "In Sync," writing the same information to both places so the file is compatible with everything.
Real-World Examples
A government researcher releases a 200-page report on climate change. They make sure to set the **/Title** to "2026 Climate Impact Summary" and add **/Keywords** like "Sustainability," "Carbon," and "Data." Because they filled out the **Info Dictionary**, when a journalist searches for those keywords on Google, the PDF report shows up with a professional, bold title rather than just an unhelpful link like `report_v2_final.pdf`.
A law firm is preparing for a high-stakes court case. Before they send their evidence (as PDFs) to the opposing counsel, they use a "Redaction Tool" to check the **Info Dictionary**. They discover that the files still contain the original **/Author** name—a consultant who wasn't supposed to be involved in the case. They wipe the Info Dictionary clean to protect their client's privacy and ensure no "Metadata Leak" occurs during the legal exchange.
When Should You Update the Info Dictionary?
- Before publishing any PDF to a website or public server.
- When you want your PDF to appear with a specific title in a browser tab.
- When you need to remove personal "leak" data from a file for privacy reasons.
- **Pro Tip:** Always check your metadata using a "PDF Properties" viewer before hit "Send" on an important email.