PI logo

ÉVIDENCE

The trustworthy AI that makes your supporting documents reliable and speeds up the processing of your cases

ÉVIDENCE is a PI Project solution designed to automate the extraction, verification and structuring of the information contained in supporting documents.

Designed for the public sector, it turns heterogeneous documents (PDFs, scans, images, forms, certificates and more) into directly usable data, then acts on the case: chasing missing documents and forwarding data to business software.

Automating a key step in administrative processing

Supporting documents are at the heart of many administrative processes: they provide proof, confirm a situation, secure a decision and feed business processes.

However, using them often remains manual, repetitive and costly. The useful information is scattered across varied, sometimes unstructured formats, which slows processing chains and heavily mobilizes teams.

ÉVIDENCE automates this key step: identifying the useful data, making it reliable, structuring it, then acting by chasing missing documents and feeding the processes concerned.

ÉVIDENCE handles every type of supporting document

By way of illustration, here are three supporting documents frequently used in administrative procedures:

Family record book

Extraction and structuring of the useful civil-status information: identity of household members, family ties, dates and places of birth, and any details available depending on the pages analysed.

Proof of address

Identification of the information relating to the declarant, the address, the date of the document and the elements that link a person to a place of residence.

Sworn statements

Analysis of accommodation, cohabitation or common-law statements to identify the people concerned, the nature of the declaration, the declared address, the date and the meaningful elements of the document.

These three use cases illustrate ÉVIDENCE's ability to handle varied administrative documents, often semi-structured or unstructured, and to produce data that can be used directly in a business process.

From a handwritten document to structured, justifiable data

The extraction detail shows each structured field next to the original handwritten document.

Extraction detail in ÉVIDENCE: the structured fields of a sworn statement shown next to the original handwritten document

A clear processing chain

  1. 1

    Submission or receipt

    Incoming documents are sent to the solution through the channel agreed with the client.

  2. 2

    Automatic analysis

    ÉVIDENCE identifies the type of document, spots the useful information and extracts the relevant data.

  3. 3

    Structuring

    The detected information is organized into a format that business processes can use.

  4. 4

    Verification and controls

    The solution carries out the expected consistency, completeness and compliance controls.

  5. 5

    Transmission

    The structured data is made available to feed existing workflows, applications or processing chains.

View of a case in ÉVIDENCE: the supporting documents feed the extraction report, the business analysis and then the output
In the application, each case makes the chain visible: submitted documents, extraction report, business analysis and output ready to be forwarded.

Extracting a piece of data is not enough: you also need to be able to trust it

An automatic reading engine returns the content of a document. It does not tell you whether it read it correctly. Yet, in administrative processing, the risk does not come from the reading: it comes from the decision made on the basis of a wrong piece of data, without anyone noticing.

This requirement is not a refinement: it is inherent to the technology. New-generation reading engines (OCR, vision and language models) are probabilistic models: they estimate the most likely reading, they do not certify it. However capable it may be, such a model can, occasionally, be confidently wrong. The guarantee therefore cannot come from the model itself: it has to be built around it.

ÉVIDENCE is the layer that separates a piece of data that has merely been read from data you can act on. The value is not in extracting more: it is in guaranteeing what is extracted.

What makes the difference between a reading and a reliable piece of data:

Read or not read, never guessed

When extraction fails or remains uncertain, ÉVIDENCE says so (failed reading, field to confirm) rather than presenting a guess as a fact. Whatever was not reliably read is flagged, never disguised as certain data.

No automatic action on uncertain data

The system recognizes when it is not sure and hands over to an officer at that precise moment: no decision is taken automatically on doubtful data.

Traceable and justifiable

Each piece of data stays attached to the original document. Any value can be inspected, verified and corrected next to the source document: nothing is produced without proof.

A reconstructable decision

Every processing step is recorded: traceability is ensured end to end. A decision can be explained and reconstructed case by case, which guarantees auditability and remains enforceable in the event of a control or a dispute.

Quality you measure, not just promise

The reliability of an automatic process is not declared, it is measured. ÉVIDENCE continuously tracks the quality of the data produced, by type of document and over time, and flags any drift before it spreads.

To go further, the reliability achieved can be quantified: the level of reliability stops being an intuition and becomes a verifiable, documented and enforceable value.

Continuous measurement

Quality is tracked by type of document and over time. A drop in quality is detected, not endured.

Independent control

A sized sample is re-processed by an independent human team, for a level of confidence defined in advance.

Verifiable results

The results are quantified and documented. You do not take our word for it: you check.

This is also the whole point of an initial proof step: measuring the reliability of ÉVIDENCE on your own documents, with indicators you can control, before any large-scale deployment.

Concretely: what “measuring reliability” means

Behind these principles lie established methods, not assertions. Here, in plain terms, is what they make possible.

Checking everything is not necessary, and checking at random is no better

To guarantee reliability, there is no need to re-check every case. Statistics make it possible to calculate the exact number of cases to check to reach a given level of confidence. Counter-intuitive point: that number depends very little on the total volume.

A naive approach (“check 10%”) would impose 1,000 then 10,000 checks, for a guarantee that is itself never quantified. Sizing, on the other hand, fixes the guarantee first and derives the minimum effort from it.

Population processed

10,000

cases to check

567

Population processed

100,000

cases to check

597

Multiplying the volume by 10 barely increases the checking effort. 95% confidence, ± 4 points margin.

n ≈ z²·p(1-p) / E²

n: cases to check · z: level of confidence · p: expected rate · E: accepted margin. Corrected for a finite population.

A rate without a margin means nothing

Example

We never give a figure on its own. A compliance rate always comes with the interval within which the real value lies. Without that margin, a percentage is only an assertion.

96.8% ± 1.4 points
90% 95% confidence interval 100%

Drift shows before it does harm

Example

Quality is plotted on a control chart. A point that crosses the limits triggers an alert and an inspection, before the deviation spreads into the processes.

upper limit

Flagged point: drift detected and put under review.

Defence in depth: quality obtained through several systems, not a single model

No single system is enough on its own. Potential deviations pass through several successive filters: at each stage, a share is stopped. What remains at the output is small, known and measured. It is this accumulation of controls, and not a single model, that produces reliability.

Example
Potential deviations at the input
100
Double reading
34
Automatic business controls
12
Targeted verification of flagged points
5
Autonomy threshold: no action on uncertain data
2
Human review of flagged cases
< 1

At the output, one last independent stage: the sized-sample control, which turns what remains into a quantified guarantee.

Methods used, each in its rightful place:

Confidence intervals Control charts (drift) Calibration of confidence scores Exact bounds on samples Inter-rater agreement Multiple-testing correction

Several families of statistical methods work together, each in its rightful place: none is enough on its own, and it is their combination that makes the result solid.

Indicators at the step that matters

A single overall figure is reassuring but does not guide. ÉVIDENCE tracks its indicators at several levels of granularity (by type of document, by step, by process, by period), to act precisely where the deviation is.

First pass yield (FPY)

Example

The share of cases processed correctly the first time, with no rework or correction. It is the indicator that links quality and cost.

Overall

94.2%

By type of document

Family record book96.5%
Proof of address93.1%
Sworn statement90.4%

The same indicator breaks down by process, by step and by period. Others follow the same logic: autonomy rate, final compliance rate, false-alert rate, for explicit control of false positives and false negatives, each with its margin.

Understand the cause, not just observe the error

Example

Each deviation is linked to a cause. That way you treat the origin of the problem, not just its symptom, and you know what stems from the tool or from the process.

Breakdown of deviation causes

Quality of the submitted document41%
Reading ambiguity27%
Business rule to clarify18%
Missing document9%
Other5%

Distinguishing what comes from the tool from what comes from the process is decisive: the two are not corrected in the same place.

Continuous improvement: the tool and the process, separately

The performance of a process depends on two distinct things: the quality of the tool (reading, extraction, controls) and the quality of the business process (rules, documents requested, forms). ÉVIDENCE measures them separately.

So you know whether a deviation comes from the tool or from the process, and you can improve each one without disrupting the other. Two improvement loops, one shared measurement base.

Tool loop

Reading, extraction, automatic controls and decision thresholds. We improve it without touching the business side.

Measure Analyse Adjust Verify

Process loop

Business rules, documents requested, forms and decision circuits. We improve it without depending on a change of tool.

Measure Analyse Adjust Verify

It is this decoupling that makes progress lasting: quality improves continuously on both sides, with no dependence on a single supplier or model.

Quality and compliance tracking in ÉVIDENCE: completeness on arrival, compliance rate, chase reasons and retries to inspect
The tracking makes quality measurable and traceable: completeness, compliance rate, chase reasons and retries to inspect.

For large organisations as well as for the smallest ones

ÉVIDENCE is often imagined as reserved for large organisations that already have an information system. That is indeed one of its uses: for these organisations, the solution integrates with existing applications and processing chains.

But ÉVIDENCE works just as well at small scale. A municipality, an institution or a department without a dedicated business tool can use it without deploying anything in-house, relying on an existing public service to collect supporting documents.

A simple combination, with no IT to deploy

démarche.numerique.gouv.fr is a State service, operated by DINUM, that lets public officials create online forms. The principle is close to a consumer form: users submit their request and supporting documents in a dematerialized, structured way. This service acts as the entry point, and ÉVIDENCE takes over to turn the collected documents into usable data.

Online procedure

Document collection

The user submits their supporting documents through a démarche.numerique.gouv.fr form, with no paper mailing or emails to sort.

ÉVIDENCE

Extraction and structuring

ÉVIDENCE reads the documents received, verifies the useful information and turns it into reliable, structured data.

Officer or service

Ready-to-use data

The officer gets clean, verified data to process the case, even without a business software in the local authority. ÉVIDENCE can also act: chase the user by email to obtain a corrected or additional document, or forward the extracted, structured data to a specialized software.

No development, no infrastructure to install: collection relies on an existing public service, and ÉVIDENCE processes the documents within a sovereign framework hosted in France.

Discover démarche.numerique.gouv.fr

A generic foundation, bespoke business variants

ÉVIDENCE provides a ready-to-use automation foundation, applicable well beyond the supporting documents presented above.

From this foundation, PI Project then designs a solution tailored to your environment: your documents, your business rules, your controls, your processing chains and your integration constraints.

The solution can be adapted to different contexts:

Processing of user cases

Checking of administrative documents

Verification of family or residence situations

Processing of documents related to accommodation or cohabitation

Structuring of incoming documents

Feeding business workflows or existing systems

A sovereign and compliant solution

Digital sovereignty is a decisive issue for public bodies, especially when the documents processed contain personal data.

With ÉVIDENCE, PI Project chooses concrete sovereignty: data is hosted in France, processed within a controlled European legal framework, and no supplier located outside the European Union is involved in the processing chain.

What this means in practice:

Data hosted in France

Processing chain with no supplier located outside the European Union

Model-agnostic approach

No dependence on a single supplier

Adaptation to each organisation's constraints

Client control over purposes, uses and data

ÉVIDENCE is also designed with technological independence in mind. PI Project does not depend on a single supplier, an imposed technology or a specific proprietary model. This approach limits the risks of technological lock-in and makes it possible to adapt the solution to each organisation's requirements.

PI Project acts as a GDPR processor: the client keeps control over its data, its purposes and its uses.

ÉVIDENCE strengthens your operational performance

The economic performance of a document process often hinges on tasks that are barely visible but heavily resource-consuming: reading documents, searching for useful information, matching, checking or routing cases.

By automating the extraction and structuring of the data contained in supporting documents, ÉVIDENCE reduces the time spent on these repetitive but essential operations.

The solution thus helps to control the unit cost of processed cases, to absorb activity peaks more efficiently and to reserve human intervention for specific checks, arbitration and situations that truly require business expertise.

ÉVIDENCE operational dashboard: cases in progress, main bottleneck, average age, distribution of responsibilities and bottlenecks
The dashboard makes bottlenecks, the age of cases and the load per step visible, to act where the cost concentrates.

Designed for the public sector

ÉVIDENCE is aimed at public administrations, local authorities, hospitals and, more broadly, any public operator that wants to industrialize the processing of its supporting documents without losing control over its data, its processing and its technological dependencies.

The solution is part of a logic of rigour, transparency, traceability, compliance and trust.

Public administrations

Public institutions

Local authorities

Hospitals

Public operators

Public co-funding possible for local authorities

Banque des Territoires · Territoires d'IA

ÉVIDENCE is a sovereign solution, hosted in France and designed for the public sector. On that basis, local authorities may, depending on their eligibility, draw on dedicated support schemes for adopting a sovereign AI.

Banque des Territoires (Caisse des Dépôts group) has launched the Territoires d'IA programme, which notably provides a co-funding scheme to make it easier for local authorities to acquire sovereign AI solutions.

Good to know

The eligibility conditions, the amount and the nature of the funding are a matter for Banque des Territoires and are assessed case by case. PI Project is not involved in processing these applications.

Discover the Territoires d'IA programme

Talk with PI Project

Let's evaluate ÉVIDENCE on your administrative supporting documents

Show us your documents and your use cases: we study their automation with ÉVIDENCE and send you a tailored proposal.