Home

All Blogs

Handwriting Recognition Dev Tools Compared: Google, AWS, Azure, and On-Prem Alternatives (2026)

Isaac Maw

Technical Content Creator

Published June 10, 2026

Updated June 11, 2026

5 min

Handwriting Recognition Dev Tools Compared: Google, AWS, Azure, and On-Prem Alternatives (2026)

Isaac Maw

Technical Content Creator

Summary: Handwriting recognition software helps organizations extract data from handwritten documents such as medical forms, insurance claims, checks, shipment records, and government archives. This comparison examines five approaches: Google Document AI, AWS Textract, Azure Document Intelligence, Tesseract OCR, and Apryse Intelligent Character Recognition (ICR). While cloud-based handwriting recognition APIs offer convenience, they often require documents to leave secure environments and charge per page. Open-source OCR tools have limited handwriting support and require a significant developer lift. Apryse ICR provides a dedicated handwriting recognition engine that runs on-premises, in private clouds, or air-gapped environments, enabling secure, scalable extraction of handwritten text into searchable PDFs, JSON, XML, and other structured formats.

Rather than matching characters against fixed templates, the Apryse handwriting recognition engine uses neural networks and machine learning to analyze individual writing styles, adapt over time, and extract meaning from even the most unstructured inputs, including medical forms, insurance claims, historical archives, and more. The result is structured, searchable data from sources that were previously inaccessible for automation and the foundation for true end-to-end digitization.

handwriting icr

Handwriting Recognition has practical applications across nearly every industry that still utilizes paper:

The Banking, Financial Services, and Insurance sector is document-driven, with loan applications, checks, and account forms all containing handwritten fields that standard OCR simply can't parse reliably.
Healthcare providers handle handwritten intake forms, prescriptions, and clinical notes creating bottlenecks in patient data workflows. Apryse ICR has already demonstrated a 70% reduction in manual data entry for healthcare claims and e-prescriptions.
Logistics companies process handwritten shipment records and delivery confirmations, slowing down supply chains that are otherwise fully digital.
Government agencies are sitting on decades of census data, permit applications, and physical records that represent a largely untapped digitization opportunity.

For developers in 2026, choosing the best solution to build handwiting recognition comes down to a few major options:

open-source OCR libraries like Tesseract
API services like Google Document AI, AWS Textract, and Azure AI Document Intelligence
LLMs such as OpenAI GPT or Claude
Proprietary SDKs, such as Apryse Intelligent Character Recognition (ICR)

The choice of which solution is right for your project depends on deployment requirements, volume, and whether documents can leave your network.

API Services for Handwriting Recognition

Copied to clipboard

Handwriting Recognition API Services like AWS Textract and Google offer good accuracy, but documents leave your environment to be processed in the cloud, violating compliance requirements in certain use cases. In addition, these services are priced per page, so high-volume processing can incur eye-popping costs.

Basic OCR for Handwriting Recognition

Copied to clipboard

OCR Libraries such as Tesseract aren’t designed to process handwriting. OCR systems are designed to recognize printed text out of the box, and aren’t designed to handle the varied strokes and shapes of handwriting. Setting up open-source libraries to handle handwriting is possible, but requires more engineering effort. You can check out the tesseract documentation to see what’s required to train the model on handwriting.

LLMs for Handwriting Recognition

Copied to clipboard

LLMs such as Claude can perform handwriting recognition, but some developers report challenges using the output. For example, in recent versions of Claude, some developers reported challenges outputting usable JSON, with the model outputting written summaries of content instead, or JSON with formatting errors. In addition, using an LLM to process documents can incur high token costs, especially at high volume, similar to API services.

Why Developers Choose Apryse

Copied to clipboard

In comparison, Apryse ICR is purpose-built to process handwriting. It runs in your environment, self-hosted or on-premise.

Benefits of using Apryse ICR include:

Add support for machine‑learning‑based handwriting interpretation.
Handle highly unstructured inputs, including medical forms, insurance claims, historical and archival documents, logistics, and shipping records.
Produce JSON output suitable for downstream automation and analytics.
Unlock handwritten content previously excluded from digital workflows with true end-to-end automation.
Run ICR entirely within your secure infrastructure via the Apryse Server SDK locally, with no external APIs and no data exposure.
Build scalable architecture on Apryse’s high‑performance, cloud‑agnostic SDK.

5 Handwriting Extraction Approaches Compared

Copied to clipboard

Feature	Google Document AI	AWS Textract	Azure Document Intelligence	Apryse ICR	Tesseract
Product Type	Cloud API	Cloud API	Cloud API	SDK	Open-source OCR Library
Deployment	Cloud only	Cloud only	Cloud + Docker	On-prem, private cloud, air-gapped	Self-hosted
Pricing model	Per-page, usage-based	Per-page, usage-based	Per-page, usage-based, flat container fee for disconnected use	Custom licensing	Free (Apache 2.0)
Output format	Plain text, entities, structured JSON	Plain text, structured JSON	Plain text, structured JSON, Markdown	Plain text, structured JSON, XML, PDF text layer	Plain text, TSV, hOCR, ALTO, PDF text layer
Handwriting support	Handwriting via OCR processor, primarily English	English handwriting only, via DetectDocumentText / AnalyzeDocument	Read model handles handwriting in 9+ languages	Dedicated ICR engine, drops in to Server SDK deployment	Limited support with poor accuracy, OCR engine that is optimal for printed characters with simple layouts

Developer Experience Comparison

Copied to clipboard

Google Document AI

Copied to clipboard

Initial Setup: Developers must create a Google Cloud project, enable the Document AI API, configure authentication credentials, and create or select a document processor before processing files.

Output Format: Google Document AI returns extracted text, form fields, entities, tables, and layout information in structured JSON format.

AWS Textract

Copied to clipboard

Initial Setup: Developers must create an AWS account, configure IAM permissions, enable Textract access, and integrate with the AWS SDK or REST APIs before processing documents.

Output Format: AWS Textract returns extracted text, tables, forms, key-value pairs, and confidence scores in JSON format.

Azure Document Intelligence

Copied to clipboard

Initial Setup: Developers must provision an Azure AI Document Intelligence resource, generate API credentials, and connect through REST APIs, SDKs, or the Document Intelligence Studio (UI-based experience).

Output Format: Azure Document Intelligence returns structured JSON containing text, images and other document data.

Tesseract

Copied to clipboard

Initial Setup: Developers must install Tesseract locally, configure language packs, integrate the OCR engine into their application, and perform additional training or customization for handwriting recognition use cases.

Output Format: Tesseract can generate plain text, TSV, hOCR, ALTO XML, and searchable PDF text layers for OCR output.

Apryse ICR

Copied to clipboard

Initial Setup: Developers install the Apryse Server SDK, activate their license key, and add the Handwriting ICR module to enable handwriting recognition workflows.

Output Format: Apryse ICR can generate searchable PDFs with selectable text layers, as well as structured text, XML, and JSON output containing recognized handwriting and positional data.

How to Build Handwriting ICR with Apryse

Copied to clipboard

Using the ICR module, the Server SDK can create searchable and selectable text from images or PDFs, producing either a PDF with selectable text, or outputting just the text and position data in reusable JSON form.

Once integrated, the ICR Module enables the SDK to generate searchable PDFs with selectable text layers.

Step 1. Set up the Apryse Server SDK

Copied to clipboard

Follow the guide in our documentation to get started with the Server SDK in your required framework or language. Your trial key includes access to all the add-ons and modules available for the SDK, including handwriting ICR.

Step 2. Download the Handwriting ICR Module

Copied to clipboard

The Handwriting ICR module is packaged as a zip archive, and is meant to be expanded directly into the directory of your previous Apryse SDK download. For example, if you previously downloaded the 64 bit C/C++ package, then the ICR module would be expanded directly into the PDFNetC64 directory, overwriting files if required.

Download the module from our documentation.

Step 3. Check out the Sample Code

Copied to clipboard

There is a HandwritingICRTest sample application available in the main SDK download package that should be fully functional once this module is extracted as described above.

Here is a full code sample using the Apryse ICR module to search PDFs and extract handwritten text. This code is available in other languages including C++, C# (.Net), Java, JavaScript, PHP, Ruby and VB in our documentation.

#---------------------------------------------------------------------------------------
# Copyright (c) 2001-2026 by Apryse Software Inc. All Rights Reserved.
# Consult LICENSE.txt regarding license information.
#---------------------------------------------------------------------------------------

import site
site.addsitedir("../../../PDFNetC/Lib")
import sys
from PDFNetPython import *

sys.path.append("../../LicenseKey/PYTHON")
from LicenseKey import *

# Relative path to the folder containing test files.
input_path = "../../TestFiles/HandwritingICR/"
output_path = "../../TestFiles/Output/"

def WriteTextToFile(outputFile, text):
    # Write the contents of text to the disk
    f = open(outputFile, "w")
    try:
        f.write(text)
    finally:
        f.close()

# ---------------------------------------------------------------------------------------
# The Handwriting ICR Module is an optional PDFNet add-on that can be used to extract
# handwriting from image-based pages and apply them as hidden text.
#
# The Apryse SDK Handwriting ICR Module can be downloaded from https://dev.apryse.com/
# --------------------------------------------------------------------------------------

def main():

    # The first step in every application using PDFNet is to initialize the
    # library and set the path to common PDF resources. The library is usually
    # initialized only once, but calling Initialize() multiple times is also fine.
    PDFNet.Initialize(LicenseKey)

    # The location of the Handwriting ICR Module
    PDFNet.AddResourceSearchPath("../../../PDFNetC/Lib/")

    # Test if the add-on is installed
    if not HandwritingICRModule.IsModuleAvailable():

        print("""
        Unable to run HandwritingICRTest: Apryse SDK Handwriting ICR Module
        not available.
        ---------------------------------------------------------------
        The Handwriting ICR Module is an optional add-on, available for download
        at https://dev.apryse.com/. If you have already downloaded this
        module, ensure that the SDK is able to find the required files
        using the PDFNet.AddResourceSearchPath() function.""")

    else:

        # --------------------------------------------------------------------------------
        # Example 1) Process a PDF without specifying options
        print("Example 1: processing icr.pdf")

        # Open the .pdf document
        doc = PDFDoc(input_path + "icr.pdf")

        # Run ICR on the .pdf with the default options
        HandwritingICRModule.ProcessPDF(doc)

        # Save the result with hidden text applied
        doc.Save(output_path + "icr-simple.pdf", SDFDoc.e_linearized)
        doc.Close()

        # --------------------------------------------------------------------------------
        # Example 2) Process a subset of PDF pages
        print("Example 2: processing pages from icr.pdf")

        # Open the .pdf document
        doc = PDFDoc(input_path + "icr.pdf")

        # Process handwriting with custom options
        options = HandwritingICROptions()
        
        # Optionally, process a subset of pages
        options.SetPages("2-3")

        # Run ICR on the .pdf
        HandwritingICRModule.ProcessPDF(doc, options)

        # Save the result with hidden text applied
        doc.Save(output_path + "icr-pages.pdf", SDFDoc.e_linearized)
        doc.Close()

        # --------------------------------------------------------------------------------
        # Example 3) Ignore zones specified for each page
        print("Example 3: processing & ignoring zones")

        # Open the .pdf document
        doc = PDFDoc(input_path + "icr.pdf")

        # Process handwriting with custom options
        options = HandwritingICROptions()
        
        # Process page 2 by ignoring the signature area on the bottom
        options.SetPages("2")
        ignore_zones_page2 = RectCollection()
        # These coordinates are in PDF user space, with the origin at the bottom left corner of the page.
        # Coordinates rotate with the page, if it has rotation applied.
        ignore_zones_page2.AddRect(Rect(78, 850.1 - 770, 340, 850.1 - 676))
        options.AddIgnoreZonesForPage(ignore_zones_page2, 2)

        # Run ICR on the .pdf
        HandwritingICRModule.ProcessPDF(doc, options)

        # Save the result with hidden text applied
        doc.Save(output_path + "icr-ignore.pdf", SDFDoc.e_linearized)
        doc.Close()

        # --------------------------------------------------------------------------------
        # Example 4) The postprocessing workflow has also an option of extracting ICR results
        # in JSON format, similar to the one used by the OCR Module
        print("Example 4: extract & apply")

        # Open the .pdf document
        doc = PDFDoc(input_path + "icr.pdf")

        # Extract ICR results in JSON format
        json = HandwritingICRModule.GetICRJsonFromPDF(doc)
        WriteTextToFile(output_path + "icr-get.json", json)

        # Insert your post-processing step (whatever it might be)
        # ...

        # Apply potentially modified ICR JSON to the PDF
        HandwritingICRModule.ApplyICRJsonToPDF(doc, json)

        # Save the result with hidden text applied
        doc.Save(output_path + "icr-get-apply.pdf", SDFDoc.e_linearized)
        doc.Close()

        print("Done.")

        PDFNet.Terminate()


if __name__ == '__main__':
    main()

The API can also be used to apply ICR JSON generated by different OCR or ICR engines. The expected structure for input JSON is:

{  
   "Page":[  
    	{  
          "Word":[  
              {  
                  "font-size": 12,
                  "length": 43,
                  "text":"ABC",
                  "x": 321,
                  "y": 141
              }
         ],
         "num": 1,
         "dpi": 96,
         "origin": "TopLeft"
      	}
   ]
}

What to Do Next

Copied to clipboard

Don’t let your most valuable data stay trapped in an analog state. Whether your application is processing medical records or thousands of handwritten insurance claims, Apryse ICR gives you the power to extract intelligence locally, securely, and at scale.

Start testing ICR in your application with your free trial key. Please reach out to us with any questions!

FAQ

Copied to clipboard

Q: What is the difference between OCR and ICR?

A: OCR (Optical Character Recognition) converts printed or typed text into machine-readable data. ICR (Intelligent Character Recognition) is an advanced form of OCR designed to recognize handwritten text. While OCR works well on printed documents, ICR uses machine learning models to interpret handwriting, making it more effective for paper-based workflows.

Q: Can handwriting recognition run on-premises?

A: Yes. Some handwriting recognition solutions can be deployed entirely on-premises, in private clouds, or in air-gapped environments. While many handwriting recognition APIs process documents in the cloud, on-premises deployments such as using Apryse SDK allow organizations to keep sensitive data within their own infrastructure for security, privacy, and compliance purposes.

Q: Which handwriting recognition API is most accurate?

A: Accuracy depends on document quality, handwriting style, language support, and document structure. Structured forms with clearly defined fields generally produce the highest accuracy across all platforms.

Q: Can Large Language Models replace handwriting recognition software?

A: Large Language Models (LLMs) can often transcribe handwritten text from images and documents, but they are not purpose-built handwriting recognition systems. Organizations processing large document volumes typically require predictable output formats, consistent accuracy, structured JSON extraction, and lower processing costs, which Apryse ICR is designed to provide.

Q: Is handwriting recognition HIPAA compliant?

A: Handwriting recognition software can be used in HIPAA-compliant workflows when deployed and managed appropriately. Organizations handling protected health information (PHI) must evaluate where documents are processed, how data is stored, and whether the solution supports security and compliance requirements. Apryse SDK can help healthcare organizations maintain greater control over sensitive patient information.

View all blogs

How to Solve Six Common Problems when Getting Started with Apryse WebViewer

PDF/A Compliance: Best Practices for Long-Term Document Retention

2026 Jul 21

Choosing a React PDF SDK: Security, Speed, and No SaaS

2026 Jul 16

Handwriting Recognition Dev Tools Compared: Google, AWS, Azure, and On-Prem Alternatives (2026)

Table Of Contents

API Services for Handwriting Recognition

Basic OCR for Handwriting Recognition

LLMs for Handwriting Recognition

Why Developers Choose Apryse

5 Handwriting Extraction Approaches Compared

Developer Experience Comparison

Google Document AI

AWS Textract

Azure Document Intelligence

Tesseract

Apryse ICR

How to Build Handwriting ICR with Apryse

Step 1. Set up the Apryse Server SDK

Step 2. Download the Handwriting ICR Module

Step 3. Check out the Sample Code

What to Do Next

FAQ

Related Articles

View all blogs

How to Solve Six Common Problems when Getting Started with Apryse WebViewer

PDF/A Compliance: Best Practices for Long-Term Document Retention

Choosing a React PDF SDK: Security, Speed, and No SaaS

Ready to get started?