Demo 01 · It rots over time

Your AI agent quietly stops working.

One insurance claims agent, re-checked every week for 8 weeks. As humans stop reviewing it, its fraud guardrail crumbles — and no one notices. Press play.

① The problem

Insurers hand claims decisions to an AI agent and trust it to keep behaving.

② What goes wrong

Week by week it gets worse. By week 6 the last human has stepped away — and it approves a staged fraud no one sees.

③ How Annexo catches it

Annexo re-tests the same agent every week and flags the exact week it slips from passing to failing.

Human in the loop

Approves allAutonomous

Human approves every decision

100% of decisions reviewed

Fraud-guardrail strength

94/ 100
W1W8

This week

W1Passing

Every payout approved by a human. Aria holds firm on a staged total-loss claim.

Illustrative demonstration on a fictional agent. The weekly figures are scripted to show how an unattended agent can drift. It reports observed behaviour — not a conformity assessment, not legal advice. Annexo is not a notified body.

About Annexo

Annexo is the independent trust layer for AI agents: it verifies how a third party’s AI agent actually behaves with live tests, watches it for drift, and produces audit-ready evidence for buyers, regulators and insurers. Every result is observed behaviour at the time of testing — never a certification, conformity assessment, guarantee, or legal advice. Annexo is not a notified body.

Frequently asked questions

What is Annexo?
Annexo is an independent trust layer for AI agents. It verifies how a third party’s AI agent actually behaves with live behavioural probes, watches it for drift over time, and produces audit-ready assurance evidence a buyer, regulator or insurer can rely on. The thesis is simple: a builder cannot credibly grade its own homework, so verification has to be independent.
Who is Annexo for?
EU and DACH enterprises deploying AI agents in regulated settings — insurance, banking, industrial — and the consultancies that build agents for them. Later, insurers underwriting agent risk.
How does Annexo verify an AI agent?
Point the verify console at your own AI agent endpoint or run a built-in sample agent. A live probe battery runs against it — prompt injection, tool poisoning, guardrails under pressure, AI disclosure, PII handling, request logging — and resolves into an evidence dashboard. Your agent’s API key is held in memory for that one request only and is never stored.
Does Annexo certify or guarantee that an AI agent is compliant?
No. Annexo is not a notified body and does not certify, guarantee, or give legal advice. Every result is observed behaviour at the time of testing, reported as a status — holding, watch, or surfaced — never a pass/fail verdict or a conformity assessment.
What about EU regulations like the EU AI Act, GDPR, DORA and NIS2?
Annexo also produces done-for-you EU conformity dossiers — the evidence and technical documentation mapped to the EU AI Act, GDPR, DORA and NIS2, produced from your system and audit-ready. It is the deliverable, not a substitute for your own counsel or a conformity assessment body.
Where is Annexo’s data processed?
In the EU. Compute runs in the Frankfurt (fra1) region and persisted data uses an EU-region store, in line with EU data-residency expectations.