Data Alchemy — Software IDP con AI
Back to blog
Legal archivingIDPDocument search

Smart legal archiving: searching documents by content with AI

Data Alchemy · May 13, 2026 · 2 min read

Legal document archiving — in Italy known as conservazione sostitutiva, compliant digital preservation — ensures that business documents such as electronic invoices, contracts and delivery notes retain legal value over time. It's both an obligation and a valuable safeguard. But it has a huge practical limit: the document is preserved, not understood. This is where Data Alchemy makes the difference.

The problem: compliant but "blind" archives

A legal archiving system stores millions of documents securely and immutably. The problem is how to find them again. Search typically relies on just a few metadata fields:

  • >file name,
  • >preservation date,
  • >document type,
  • >a handful of fields indexed up front.

That means if you don't know exactly which document to look for, you won't find it. You can't ask for "all invoices from that supplier containing a certain item," or "the contracts that include a specific clause." The actual content of the documents — what's written inside — remains invisible to the system.

You have the archive, but you can't query it for what really matters: the content.

The solution: adding a layer of understanding

Data Alchemy doesn't replace your legal archiving system: it enhances it. It integrates with the archive and adds what's missing — the ability to understand every document.

Here's how it works:

  1. >Reading the content: Data Alchemy reads each document with AI models and understands its meaning, not just its characters.
  2. >Extracting rich data and metadata: it identifies supplier, amounts, items, clauses, order references and much more, turning the document into structured data.
  3. >Indexing by content: this data enriches the archive's index, making every document searchable by what it actually contains.
  4. >Semantic search: you can finally search documents by content — by supplier, by item, by clause — even on documents preserved years earlier.

A concrete example

You need to retrieve all preserved invoices containing a specific product code, for an audit or a dispute. With legal archiving alone you'd have to open documents one by one. With Data Alchemy integrated into the archive, that data has already been extracted and indexed: a content search returns all the relevant documents in seconds.

Preservation stays compliant and immutable; what changes is the way you query it.

Compliant preservation + content intelligence

Legal archiving answers the question "is this document legally valid?". Data Alchemy answers the question "what do my documents contain and how do I find them?". Together, they give your company an archive that is both legally compliant and genuinely searchable.


Want to make your compliant archive searchable? Book a free demo and see how Data Alchemy integrates with your legal archiving system.

Want to automate your company's documents?

Book a demo
Smart legal archiving: searching documents by content with AI