Blog

Notes on AI behavior evidence.

Writing

Introducing Peeld

How AI behavior requirements become modules, runs, and evidence.

What compliance-grade evals need

Why a score is not enough for regulated AI deployments.

Testing agent workflows

How to catch risky tool use before production expansion.