Document Extraction API for Developers
Add AI document data extraction to your own product or workflow. Data Alchemy exposes its Intelligent Document Processing engine through a REST API, webhooks and SQL connectors: send invoices, delivery notes (DDT), orders or contracts and get back structured, validated JSON — 99.8% accuracy in about 3 seconds per document, ready to post into your ERP.
An IDP API that turns documents into structured data
The document extraction API is the programmatic side of Data Alchemy's Intelligent Document Processing platform. Instead of clicking through a web app, your systems submit documents and receive validated, structured data in return — so you can embed invoice, DDT and order extraction directly into your own software, automate your accounts-payable pipeline, or feed any ERP or CRM without manual re-keying. For REST endpoints, webhook payloads and the output data schema, see the developer API documentation.
Everything you need to integrate document extraction
REST API
Submit a document and retrieve its extracted, validated data as structured JSON — header fields, line items, totals, taxes and references — ready to map onto your ERP.
Webhooks
Build asynchronous pipelines: Data Alchemy notifies your endpoint when each document has been processed, so you never poll for results.
SQL connectors
Prefer database-level integration? Push extracted data straight into your system through direct SQL connectors, no middleware required.
Email acquisition
Connect a Google Workspace or Microsoft 365 mailbox and let the AI filter and ingest business documents automatically — integration with zero code.
Structured JSON output
Every document returns clean, typed JSON validated against your master data, so downstream systems consume reliable data instead of raw text.
A dedicated LLM per document model
Each document type is processed by the best model for the job (today Claude AI), reaching 99.8% accuracy in about 3 seconds, with no templates.
From document to structured data in four steps
Submit
Send a document to the REST API, drop it via SQL, or let the platform pull it from a connected mailbox.
Extract & classify
A dedicated LLM classifies the document and extracts every field — no layout training, no templates.
Validate
The extracted data is checked against your ERP master data in real time, flagging anomalies before they propagate.
Deliver
Receive structured JSON via API response or webhook, then post it into SAP, Zucchetti, TeamSystem or any other system.
Explore the platform behind the API
SAP, Zucchetti, TeamSystem
How extracted data is written into your management system via API, webhooks and SQL.
Learn more →ReconciliationLLM invoice reconciliation
Automated 3-way matching between invoices, orders and delivery notes.
Learn more →IDP platformDocument management software
The AI layer that reads, understands and extracts what a DMS only stores.
Learn more →Frequently asked questions about the document extraction API
What is the Data Alchemy document extraction API?
It is the programmatic interface to Data Alchemy's Intelligent Document Processing (IDP) engine. Instead of using the web app, your systems submit documents — invoices, delivery notes (DDT), purchase orders, contracts, price lists — and receive back structured, validated data ready to write into your ERP or CRM.
Which integration methods are available?
Data Alchemy exposes a REST API and webhooks for event-driven workflows, plus direct SQL connectors for systems that prefer database-level integration. Documents can also be acquired automatically from a Google Workspace or Microsoft 365 mailbox, with no code at all.
What format does the extracted data come back in?
Extracted fields are returned as structured JSON — header data, line items, totals, taxes and document references — already validated against your ERP master data, so it can be mapped directly onto your system of record.
How accurate and fast is the extraction?
Data Alchemy assigns a dedicated LLM to each document model (today Claude AI), reaching 99.8% accuracy in about 3 seconds per document, with no templates and no per-layout training.
Which ERPs and systems can I write the data into?
The API is system-agnostic: native integrations exist for SAP, Zucchetti and TeamSystem, and the REST API, webhooks and SQL connectors let you push structured data into any other ERP, CRM or internal application.
Build with the document extraction API
Tell us about your use case and we'll set up API access and walk you through integration on your real documents — no commitment.
Request API access