Glossary

Document Processing Glossary

Key terms in document digitization, OCR, intelligent document processing, and AI-powered data extraction.

Agentic Document Processing

An advanced form of intelligent document processing where AI systems autonomously classify incoming documents, route them to the appropriate processing pipeline, validate extracted data against business rules, and trigger downstream actions. Agentic systems move beyond extraction to make decisions about documents without human intervention for routine cases.

Agentic document processing explained

API-First Document Processing

A document processing platform designed primarily for programmatic integration. API-first platforms expose their full functionality through REST or GraphQL APIs, enabling developers to build custom document processing pipelines, integrate with existing business systems, and automate end-to-end workflows without manual intervention.

Document processing API guide

Batch Processing

Processing multiple documents simultaneously using identical settings and extraction rules. Batch processing is essential for large-scale digitization projects where hundreds or thousands of similar documents need to be converted, extracted, and exported using consistent Smart Flow configurations.

How to process documents in bulk

Compliance Automation

Using AI and automated workflows to support regulatory compliance — typically maintaining audit trails, enforcing role-based access, applying retention policies, and making document-processing decisions traceable. Note: this is a generic industry term. Software providing 'compliance automation' features is not itself a substitute for a vendor's compliance certification (SOC 2, HIPAA, ISO 27001). Always evaluate vendors against your specific framework requirements.

EU AI Act and document processing

Confidence Score

Technology that converts images of text into machine-readable characters. Traditional OCR works character-by-character and struggles with complex layouts, handwriting, and low-quality scans.

OCR vs AI document conversion

Intelligent Document Processing (IDP)

An evolution of OCR that combines AI-powered document understanding with structured data extraction and human review workflows. IDP systems capture, classify, extract, and validate document data — not just text.

IDP landing page

Document Digitization

The process of converting physical paper documents into digital formats. Modern digitization goes beyond scanning to include AI-powered conversion, structured data extraction, and searchable output.

Digitize paper records

Structured Data Extraction

Automatically pulling specific data fields (dates, amounts, names, line items) from documents into organized formats like JSON or CSV. Goes beyond text conversion to produce database-ready output.

AI data extraction

Confidence Score

A numerical measure of how certain an AI model is about its output. Higher scores indicate the model is more confident in the accuracy of its conversion. Used to determine which documents need human review.

Auto-approve and confidence thresholds

Human-in-the-Loop (HITL)

A workflow design where AI handles the initial processing but humans review, verify, and approve the output before it enters downstream systems. Balances automation speed with human judgment.

Human-in-the-loop OCR

Smart Flow

A reusable document processing template in PaperAI that saves your configuration: AI model selection, extraction fields, approval thresholds, and output format. Apply it to one document or thousands for consistent results.

Features — Smart Flows

Auto-Approve

A feature that automatically approves document conversions when the AI's confidence score exceeds a threshold you set. Eliminates manual review for routine documents while flagging exceptions for human attention.

Auto-approve guide

Multi-Model AI

Using multiple AI models from different providers to process documents. Different models excel at different document types — standard models handle clean text efficiently, premium models tackle handwriting and complex layouts.

How to choose an AI model

Side-by-Side Review

A review interface that displays the original document next to the AI-generated output. Lets reviewers quickly verify accuracy by comparing source and result on the same screen.

Features — Review

Document Governance

Controls and processes that govern how documents are processed inside an organization. Includes version history, audit trails, role-based access, and approval workflows. Document governance features support internal policy but are not equivalent to a vendor's formal compliance certifications.

Document governance

Batch Processing

Processing multiple documents at once using the same settings. Upload a folder of invoices, apply a Smart Flow, and let the system convert and extract data from every document using identical rules.

Digitize paper records

Credit-Based Pricing

A usage-based pricing model where document processing costs credits based on page count and AI model selected. Standard models cost fewer credits; premium models cost more but offer higher accuracy on complex documents.

Pricing

Version History

An immutable record of every change made to a document's conversion output. Each AI conversion, manual edit, or restoration creates a new version. Enables full traceability and the ability to restore any previous version.

Features — Version History

Document Capture

The initial step of scanning, photographing, or uploading documents into a processing system. Modern capture methods include flatbed scanners, automatic document feeders (ADF), phone cameras with scanning apps, email attachments, and API uploads. Capture quality directly affects downstream processing accuracy.

How to digitize paper records

Document Classification

Automatically identifying the type of a document (invoice, receipt, contract, medical form) before processing. Classification enables systems to apply the correct extraction rules and routing logic based on document type, eliminating the need for manual sorting.

Features

Document Data Pipeline

An automated workflow that takes documents from capture through AI processing, extraction, review, and export to downstream systems. A well-designed pipeline handles document ingestion, model selection, extraction, confidence-based routing, human review, and structured data export.

Build vs buy a document pipeline

Document Pre-processing

Steps taken to improve document quality before AI analysis, such as deskewing rotated pages, removing noise from scanned images, adjusting contrast, and normalizing resolution. Pre-processing can significantly improve extraction accuracy, especially for older or low-quality scans.

How to extract data from scanned PDFs

Document Workflow Automation

End-to-end automation of document processing from capture to data export. Includes automatic classification, AI-powered extraction, confidence-based approval routing, human review for exceptions, and structured data export to downstream business systems.

Document review workflow

Edge OCR

Running optical character recognition and document processing on local devices (phones, tablets, scanners) rather than sending documents to cloud servers. Edge OCR reduces latency and can address data sovereignty requirements, but typically offers limited accuracy compared to cloud-based vision AI models.

Compare approaches

ETL for Unstructured Data

Extract, Transform, Load pipelines designed for documents, images, and other unstructured data sources. Unlike traditional ETL which processes structured database records, unstructured ETL converts documents into structured data through AI processing before loading into data warehouses or business systems.

PDF to JSON

Export Formats

Output types available for processed documents. Common export formats include JSON (for APIs and databases), CSV (for spreadsheets and accounting software), Markdown (for documentation systems), and plain text. The choice of export format depends on the downstream system that will consume the data.

Features

Extraction Fields

Specific data points you configure PaperAI to extract from documents — such as invoice number, date, vendor name, total amount, and line items. Each field has a type (text, number, date, currency, array) that validates the extracted data.

Extraction flows guide

Extraction Schema

The defined set of fields, data types, and validation rules that govern what data is extracted from a document. An extraction schema specifies field names (e.g., invoice_number, total_amount), their types (text, date, currency, array), and any constraints. PaperAI stores extraction schemas within Smart Flows.

AI data extraction

Field-Level Confidence

Accuracy scores assigned to individual extracted data fields rather than the document as a whole. Field-level confidence allows more granular review — a reviewer might accept a clearly extracted vendor name (99% confidence) while verifying a potentially ambiguous total amount (78% confidence) on the same document.

Auto-approve and confidence thresholds

Form Recognition

AI capability to identify and extract data from structured forms with labeled fields, checkboxes, and designated response areas. Form recognition understands the relationship between field labels and their values, handling both printed and handwritten entries within form structures.

How to convert paper forms to digital data

Generative AI for Documents

Using large language models (LLMs) and vision-language models (VLMs) to understand and extract information from documents. Unlike traditional OCR which recognizes characters, generative AI understands document content semantically — interpreting meaning, structure, and context to produce structured output.

State of document AI in 2026

Handwriting Recognition (ICR)

Intelligent Character Recognition technology that converts handwritten text into machine-readable data. Modern handwriting recognition uses vision AI models that process handwriting as images, interpreting characters in context rather than matching individual character shapes — resulting in significantly better accuracy on cursive, mixed writing styles, and poor-quality input.

Handwriting to text

Invoice Matching

The process of comparing invoice data against purchase orders and receiving documents (three-way matching) to verify accuracy before payment. AI document processing automates the data extraction step, producing structured data from all three document types that enables automated matching and discrepancy detection.

Invoice processing use case

Key-Value Pair Extraction

Identifying labeled fields and their corresponding values in documents. For example, extracting 'Invoice Number: INV-2026-001' as a key-value pair where 'Invoice Number' is the key and 'INV-2026-001' is the value. This is fundamental to structured data extraction from forms, invoices, and other business documents.

AI data extraction

Layout-Aware OCR

OCR technology that understands page structure, columns, tables, and spatial relationships between text elements. Unlike basic OCR which reads text linearly, layout-aware OCR preserves document structure — keeping tables intact, maintaining column separation, and understanding the logical reading order of complex multi-section documents.

OCR vs AI document conversion

Multi-Language OCR

Processing documents that contain text in multiple languages simultaneously. Modern vision AI models handle multi-language documents natively because they process text visually rather than relying on language-specific character recognition rules. This is essential for international business documents, multilingual forms, and documents with mixed-language content.

Features

Page Segmentation

Dividing a document page into logical regions for processing — identifying text blocks, tables, images, headers, footers, and margins. Accurate page segmentation is a prerequisite for correct text extraction, table recognition, and structural understanding of complex document layouts.

How to extract tables from PDFs

Post-OCR Correction

Automated or manual fixes applied to OCR output errors after initial text recognition. Post-OCR correction may include spell-checking, dictionary-based validation, format verification for dates and numbers, and human review of low-confidence text segments.

Human-in-the-loop OCR

Real-Time Document Processing

Processing documents immediately upon upload without queuing or batching. Real-time processing enables instant feedback — uploading a document and seeing structured output within seconds. Most modern AI document processing platforms achieve near-real-time processing, with typical turnaround under 30 seconds per document.

How it works

Re-conversion

Processing a document again with different AI models or settings to improve accuracy. When initial conversion results are unsatisfactory, re-conversion lets you try a premium AI model, adjust extraction fields, or modify processing parameters without re-uploading the original document.

How to choose an AI model

Straight-Through Processing (STP)

Documents processed entirely by AI without any human intervention — from upload through extraction to approved output. Straight-through processing is achieved when AI confidence scores exceed the auto-approve threshold, eliminating the need for human review on routine, high-quality documents.

Auto-approve thresholds

Table Extraction

Identifying and extracting tabular data from documents while preserving row and column relationships. Table extraction is one of the hardest document processing tasks because PDFs store table content as positioned text elements, not as logical table structures. AI vision models identify tables visually, understanding borders, whitespace alignment, and header relationships.

How to extract tables from PDFs

Template-Free Extraction

AI document processing that adapts to any document layout without pre-configured templates or rules. Template-free extraction uses vision-language models to understand document structure semantically, finding relevant fields regardless of where they appear on the page. This eliminates the template maintenance burden of legacy OCR systems.

Compare approaches

Vision-Language Model (VLM)

An AI model that processes both visual and textual information simultaneously. In document processing, VLMs analyze document pages as images — understanding layout, structure, text content, and spatial relationships in a single pass. VLMs represent a fundamental advancement over character-by-character OCR, enabling template-free extraction and superior handwriting recognition.

OCR is dead: why vision AI is the future

Zonal OCR

Extracting text from predefined rectangular regions (zones) of a document. Zonal OCR works by specifying exact coordinates where specific data appears — for example, the top-right corner of a form where the date field is located. While effective for highly standardized forms, it breaks when document layouts change or vary between sources.

Compare approaches

Ready to try intelligent document processing?

See these concepts in action. Start free with 100 credits.

Get Started Free See Plans

100 free credits to start50% off first month with LAUNCH50Cancel anytime