Pre-Purchase Insights: Everything you need to know before you buy.
By John Chow | 2023 Feb 28
Add Apryse’s AI-powered PDF Form Field Detection into your workflow and application, then leverage the JSON output to auto-generate fully interactive PDF e-forms.
Part 1 and Part 2 of our winter release series introduced the new Apryse intelligent data extraction capability, part of the IDP add-on in the Apryse Server SDK. This lets organizations automatically unlock information trapped in any PDFs and leverage content as JSON.
This blog, Part 3, introduces you to the AI-powered Form Field Detector, part of the new Apryse IDP data extraction capability. The new form field detection accurately detects and classifies PDF form fields — including radio buttons, checkboxes, signature fields, and more. Developers can then leverage the JSON output to generate fully interactive e-forms from any informal forms, whether the file is a digitally born PDF file or a scanned copy of a form.
Current PDF form field detectors involve a third-party application, which carries disadvantages:
By contrast, Apryse form field detection works within a standalone solution. With it, you can...
Form field detection uses Apryse AI technology to detect form elements in "informal" PDF forms.
Here, an "informal form” can be anything — a customer or patient intake form, a PDF form to order equipment, and so on. What such forms lack are true fillable PDF form functionality. It could be because files are scanned images of forms. They were created in Microsoft Word and saved to PDF. Or they contain non-fillable tables. As a result, users must manually fill table cells or rows using a third-party PDF editor — or by printing, filling, and scanning the document back in.
The Apryse form field detector introduces automatic PDF form field detection to change the landscape of how PDF forms are processed, making work better and life simpler.
It does so by automatically surveying the layout of an informal form and then determining the most probable arrangement of the individual fields. For example, it understands the difference between a table and a form. It then classifies the type of identified fields, including:
PDF form fields identified via the Automated Detector
Whether starting with a scanned form or a non-interactive, informal one, the process for AI-enabled field detection then looks like this:
At the end, you’ll have a proper PDF e-form with fillable fields. Let’s take a closer look:
You can extract form fields to an:
The JSON contains a list of all the detected fields in the document. Each field is made up of a type, confidence value, and bounding box coordinates. For example:
"type": string, "confidence": double, "rect": [x1, y1, x2, y2] }
With this list, you can:
The JSON output reflects the exact fields on the informal PDF — and the advantage of JSON is you can tweak the detected form fields as you wish. You can append content, delete form fields, or adjust the bounding rectangle coordinates in the JSON as required.
In low-stakes situations, you could decide to trust the Apryse AI’s confidence interval and skip reviewing.
Once satisfied with the JSON output, you can build a new e-form from the contents. The new form looks something like this, with form field boundaries indicated in color:
New PDF e-form with interactive and fillable fields indicated here in red.
As part of the IDP add-on to the Apryse Server SDK, the new form field detector runs efficiently on premises, in your application, instead of consuming costly cloud resources or requiring a third-party app. You own the entire workflow and lock down documents and data in the viewer, which improves security.
We’d love to see the forms you create using the new field detector and Apryse AI. If you have any issues or questions during your free trial, don’t hesitate to drop us a line or leave us a note in the support forum.
When you’re ready to add IDP and Form Field Detection to your existing Apryse Server SDK license, contact Sales.