Layer 5
L5Assessment
Specs say what good looks like. Runs prove it. Drift catches it slipping. Replayable, signed, and bound to release gates.
Specs
3
Runs (30d)
3
2 pass · 1 marginal
Drift open
2
1 high severity
Pass rate
67%
Assessment specs
Each spec defines the dimensions that matter for a change unit and who must sign.
Triage agent v1.x
Claims Triage Agent — release gate
Dimensions
brief conformancecorrectnessoutcomesafetyregulatory
Signatories
Head of Claims · Agent Steward · Compliance
L1 copilot v0.x
L1 Adjudication Copilot — release gate
Dimensions
brief conformancecorrectnessoutcomeworkforce-intentsafety
Signatories
Head of Claims · Chief People Officer · Agent Steward
Dispatch coach v1.x
Dispatch Coach Agent — release gate
Dimensions
brief conformancecorrectnessoutcomeperformanceintegration
Signatories
Head of Dispatch · Agent Steward
Recent runs
A run is a single execution of a Spec — replayable. Each dimension gets a verdict.
Claims Triage Agent — release gate
Run on 22 Apr 2026 · signed by Aisha Patel, Priya Raman, Compliance — auto
brief conformance: passcorrectness: passoutcome: passsafety: passregulatory: pass
Evidence
- 470 backtested claims
- 12 live shadow days
- Compliance review note
L1 Adjudication Copilot — release gate
Run on 26 Apr 2026 · signed by Aisha Patel, Priya Raman
brief conformance: passcorrectness: marginaloutcome: passworkforce-intent: passsafety: marginal
Evidence
- 210 supervised cases
- Workforce-intent survey (n=12)
- Policy-drift evaluator pack
Dispatch Coach Agent — release gate
Run on 29 Apr 2026 · signed by Marcus Reilly
brief conformance: passcorrectness: passoutcome: passperformance: passintegration: pass
Evidence
- 3-week pilot, 1,840 exceptions
- Coordinator feedback (n=8)
- Latency profile
Drift events
A live agent or function deviates from previously-passing state. Drift can auto-demote AdoptionState.
Dispatch exception handler
Latency on swap-recommendation rose from 2.1s p95 to 5.7s p95 over 72h.
Detected 2d ago
mediuminvestigating
L1 adjudication copilot
Payout-amount band suggestions drifted +6% mean vs. policy intent over 30 days.
Detected 5d ago
highopen
Claims triage agent
Dedupe miss rate on photo evidence increased from 0.8% to 2.4%.
Detected 12 Apr 2026
lowresolved