By Garry Klooesterman | 2026 Jan 14

3 min
Tags
Smart Data Extraction
Let’s be honest, nothing kills a developer’s soul faster than the phrase, "Hey, can you write a quick script to pull the totals from these 5,000 PDFs?"
Ok, you think, "Sure, I'll just use regex to handle this. No problem." Then you realize every invoice has a different layout and some are scans. Then the noisy data starts breaking your internal AI models. Suddenly, you’re a professional copy-paster doing what feels like wrestling a swarm of bees.
We’ve all heard this story many times. So, we decided to paint these pain points into the panes of a graphic novel.
We’re releasing the first edition of the Apryse Chronicles, a graphic novel series built for the developers in the thick of document processing. This series visualizes the challenges of document workflows such as unstructured data and messy data extraction as villains our hero must tackle with heroic solutions like Smart Data Extraction.
Our first edition follows Alex. He’s a developer tasked with the impossible job of building a data extraction system that just works, doesn't leak data to the cloud, and doesn't require constant monitoring.
In Volume 1, you’ll follow Alex through the troublesome stages of his mission:
The Custom Script Trap: Trying to build unique routines for every document type and failing.
The Generic API Nightmare: Realizing most APIs just pull chunks of text without understanding the structure around it or even what a table is.
The Security Wall: The moment IT tells you that the data cannot leave your infrastructure.

Catch a sneak peek of Alex’s journey from copy-paste nightmare to Smart Data Extraction daydream below:
More than just bringing this story to life in art form, you’ll get a front-row seat to how Apryse Smart Data Extraction changes the game:
On-Prem Power: See how Alex keeps the data secure within his own infrastructure.
The JSON Solution: Watch unstructured black hole documents transform into clean, labeled data ready for AI training.
The ROI of Sanity: See what happens to a dev team when they actually get their time back.
While the Apryse Chronicles is a great read, the solution Alex discovers isn't fictional.
We wanted to show, not just tell, how Smart Data Extraction works. In the panels, you’ll see the transformation from a cluttered, messy scan into a clean, labeled JSON output. It’s the equivalent of a satisfying video for developers.

Whether you’re looking for a better way to feed your AI models or you just want to see a developer high-five a colleague, this is for you. So, take a break from wrestling those documents and start reading the Apryse Chronicles!
The saga doesn’t end here! Stay tuned for future volumes where our hero takes on other villainous challenges of the document processing world.

See how Apryse Smart Data Extraction works beyond the pages of the graphic novel.
Tags
Smart Data Extraction

Garry Klooesterman
Senior Technical Content Creator
Share this post