What content types does the API detect and remove?

The API covers metadata and document properties, embedded JavaScript, hidden text and layers, markup annotations and comments, overlapping content, embedded files and attachments, and form field data.

Can I choose which elements to remove, or is it all-or-nothing?

You have full control. GetSanitizableContent() returns a report of everything found, and you specify which categories to remove before calling SanitizeDocument(). You can sanitize everything or target specific element types.

How is this different from redaction?

Redaction removes or obscures visible content, including specific text, images, or regions a reader can see. Sanitization targets hidden content that doesn't appear in the normal document view but is still present in the file structure. Both are often required; they solve different problems.

Does sanitization affect the visible content of my PDF?

No. Sanitization targets hidden and non-visible elements only. Page content, text, images, and layout remain intact.

PDF Sanitization: Share PDFs with Confidence

PDFs carry more than their visible content. The Apryse Sanitization API removes hidden metadata, embedded scripts, and sensitive data before documents leave your environment, giving compliance-driven teams a clean, auditable path to secure sharing.

Secure Sharing Starts with Certainty

Add an automated security layer to document workflows by programmatically scrubbing a PDF’s internal structure, including hidden, malicious and sensitive metadata and layers. Meet compliance requirements without manual review.

Visibility

Scan any PDF and get a full report of hidden content before you act. No surprises, no guesswork.

Auto-tagging Use Cases

Regulatory Documents

Strip metadata, hidden layers, and scripts before submitting documents through official channels. Sanitization is often a hard compliance requirement, not a best practice.

Secure Legal Sharing

Documents that pass through multiple hands including doctors, insurers, and regulators can surface privileged information or PHI. Sanitize before every handoff.

Inbound Document Hygiene

PDFs arriving from external sources are an active attack vector. Sanitize on ingestion to neutralize embedded scripts before files touch internal systems.

Server SDK:

PDF Sanitization: Share PDFs with Confidence

Secure Sharing Starts with Certainty

Visibility

Control

Auditability

Auto-tagging Use Cases

Regulatory Documents

Secure Legal Sharing

Inbound Document Hygiene

PDF SANITIZATION FAQS

What content types does the API detect and remove?

Can I choose which elements to remove, or is it all-or-nothing?

How is this different from redaction?

Does sanitization affect the visible content of my PDF?

RESOURCES

Learn more about PDF Sanitization.

Apryse Adds a PDF Sanitization Capability to the Server SDK

Apryse Spring 2026 Release: Handwriting Recognition, Improved Collaboration and More