Document Lexicon.
Master the vocabulary of PDF forensics, archival standards, and secure document protocols.
AES-256 (Advanced Encryption Standard)
SecurityThe global standard for data encryption, using a 256-bit key to transform PDF data into unreadable noise, essentially unbreakable by brute force.
Annotate
InteractionThe process of adding supplementary information (comments, highlights, shapes) to a PDF without changing the underlying static content.
Bicubic Downsampling
TechnicalAn image processing technique that reduces the resolution of an image to shrink PDF file size while maintaining high visual fidelity.
CMYK (Cyan, Magenta, Yellow, Key/Black)
PrintingThe subtractive color model used in color printing. Unlike RGB (light), CMYK is optimized for physical ink on paper.
Cross-Reference Table (XRef)
ArchitectureThe internal index of a PDF that maps every object to its exact byte location in the file. A corrupt XRef is the most common cause of 'Broken' PDFs.
Digital Signature
SecurityA cryptographic 'seal' placed on a PDF that ensures the document hasn't been modified since it was signed and identifies the signer via a private key.
DPI (Dots Per Inch)
PrintingA measure of spatial printing or video dot density. Higher DPI results in sharper images but larger PDF file sizes.
Embedding (Fonts)
TechnicalThe process of including full font data inside the PDF file, ensuring the document looks the same on computers that don't have that font installed.
Flattening
InteractionThe process of merging multiple layers of a PDF (like checkboxes and text fields) into a single, static image layer that cannot be edited.
Grayscale
PrintingA range of shades of gray from white to black, used in PDFs to reduce color ink usage and decrease file size.
ICC Profile
PrintingA set of data that characterizes a color input or output device, used to ensure color consistency across different screens and printers.
JBIG2
TechnicalA specialized bi-level image compression standard used in PDFs primarily to shrink the size of scanned black-and-white documents.
Linearization (Fast Web View)
UXAn optimization that organizes a PDF file so the first page can be displayed in a browser while the rest of the file continues to download.
Lossless Compression
TechnicalA method of reducing file size that allows the original data to be perfectly reconstructed without any loss of quality.
Lossy Compression
TechnicalA method of reducing file size by discarding information that is less perceptible to the human eye, resulting in a permanent loss of detail.
Metadata
ArchitectureInvisible data stored inside a PDF, such as the author's name, creation date, and software version used to generate the file.
OCR (Optical Character Recognition)
AIThe use of AI to convert an image of text (like a scan) into searchable and selectable digital text.
PDF/A (Archival)
StandardsAn ISO-standardized version of PDF specialized for use in the archiving and long-term preservation of digital documents.
PDF/X (Exchange)
StandardsA subset of the PDF standard specialized for graphics exchange in print production, ensuring high-fidelity color and font consistency.
PKI (Public Key Infrastructure)
SecurityThe framework of roles and policies used to create, manage, and revoke digital certificates used for PDF signing and encryption.
Portability
StandardsThe ability of the PDF format to maintain its visual appearance regardless of the operating system, device, or software used to open it.
Rasterization
TechnicalThe process of converting vector graphics (perfect lines) into a grid of pixels. Scaling a rasterized image results in pixelation.
Redaction
SecurityThe permanent and irreversible deletion of sensitive information from a PDF. Unlike 'blacking out,' true redaction wipes the underlying data.
User Password
SecurityA security credential required to open and view a PDF, which triggers the file's primary encryption mechanism.
Vector Graphics
ArchitectureImages defined by mathematical points and paths rather than pixels, allowing for infinite scalability without loss of clarity.