About Annexo
The independent trust layer for AI agents.
Lifts and cars get independently inspected before people rely on them. AI agents now make real decisions inside real companies — yet nothing independent observes how they actually behave. Annexo is building that independent layer for the AI agent economy: we test how an agent behaves, keep watching it live, and issue independent assurance — observed behaviour, with evidence and provenance on every finding. We didn't build the agent, so we have no stake in making it look good. That's exactly why the company running it, its buyers, and regulators can all rely on the same result.
Founder
Benjamin Hellmich
Data & AI transformation leader · 12 years
Benjamin has spent over a decade building and governing AI at scale. As a Senior Manager for Data & AI and GenAI at Accenture's strategy practice in Munich, he has led more than 100 AI use cases for large, regulated enterprises.
That work included designing BaFin- and GDPR-aligned AI governance for a global asset manager running 200+ models across 20 countries, and delivering GenAI programmes with returns above 15× for industrial and energy groups. Annexo turns that same governance discipline into independent verification of the AI agents enterprises now deploy.
Earlier he worked as a software engineer, an investment analyst and in M&A, and began his career in banking. He holds an MSc in Finance from Cranfield and a BSc in Computer Science, and advises an international humanitarian organisation on data architecture across 30+ countries.
Selected companies the team has worked with
Why Annexo
Verification is the moat.
No team can grade its own homework — the people who build an agent can't be the ones who vouch for it. Independent verification is the credible signal, and it only works if it's neutral: the company deploying the agent, the buyers relying on it, and the regulators watching all trust the same observed result. Regulators learned this long ago, and the EU AI Act, GDPR, DORA and NIS2 now define what good behaviour looks like — they expect observed evidence, not the vendor's word. Annexo tests an agent's behaviour against those expectations and traces every finding back to what we actually saw, so the person who has to defend it can.
Put your agent on the bench.
Tell us about the agent you're deploying and where it runs. We'll show you exactly how independent verification works — what we observe, what we watch live, and the evidence you walk away with. Insurer-backed assurance is on the horizon (in development); independent assurance comes first.