Layer 5

L5Assessment

Specs say what good looks like. Runs prove it. Drift catches it slipping. Replayable, signed, and bound to release gates.

Specs
3
Runs (30d)
3
2 pass · 1 marginal
Drift open
2
1 high severity
Pass rate
67%

Assessment specs

Each spec defines the dimensions that matter for a change unit and who must sign.

Triage agent v1.x

Claims Triage Agent — release gate

Dimensions
brief conformancecorrectnessoutcomesafetyregulatory
Signatories
Head of Claims · Agent Steward · Compliance
L1 copilot v0.x

L1 Adjudication Copilot — release gate

Dimensions
brief conformancecorrectnessoutcomeworkforce-intentsafety
Signatories
Head of Claims · Chief People Officer · Agent Steward
Dispatch coach v1.x

Dispatch Coach Agent — release gate

Dimensions
brief conformancecorrectnessoutcomeperformanceintegration
Signatories
Head of Dispatch · Agent Steward

Recent runs

A run is a single execution of a Spec — replayable. Each dimension gets a verdict.

Claims Triage Agent — release gate
Run on 22 Apr 2026 · signed by Aisha Patel, Priya Raman, Compliance — auto
pass
brief conformance: passcorrectness: passoutcome: passsafety: passregulatory: pass
Evidence
  • 470 backtested claims
  • 12 live shadow days
  • Compliance review note
L1 Adjudication Copilot — release gate
Run on 26 Apr 2026 · signed by Aisha Patel, Priya Raman
marginal
brief conformance: passcorrectness: marginaloutcome: passworkforce-intent: passsafety: marginal
Evidence
  • 210 supervised cases
  • Workforce-intent survey (n=12)
  • Policy-drift evaluator pack
Dispatch Coach Agent — release gate
Run on 29 Apr 2026 · signed by Marcus Reilly
pass
brief conformance: passcorrectness: passoutcome: passperformance: passintegration: pass
Evidence
  • 3-week pilot, 1,840 exceptions
  • Coordinator feedback (n=8)
  • Latency profile

Drift events

A live agent or function deviates from previously-passing state. Drift can auto-demote AdoptionState.

Dispatch exception handler

Latency on swap-recommendation rose from 2.1s p95 to 5.7s p95 over 72h.

Detected 2d ago
mediuminvestigating
L1 adjudication copilot

Payout-amount band suggestions drifted +6% mean vs. policy intent over 30 days.

Detected 5d ago
highopen
Claims triage agent

Dedupe miss rate on photo evidence increased from 0.8% to 2.4%.

Detected 12 Apr 2026
lowresolved