(dev only) Fabricate failing eval runs and fire the real eval.failed emitter
Fabricate failing eval runs for an agent and run the REAL ENG-604 emitter.
Exercises the full classification/escalation/digest/dedupe-key path that plain
test-send bypasses: builds the eval.failed payload exactly as the eval
orchestrator would, then enqueues dispatch (the notify Celery task) — so a
background worker is required for the message to actually deliver.
Authorizations
Body
Fabricate failing eval runs for one agent and fire the REAL eval.failed emitter.
Dev-only (the route 404s in production). n_failures==1 produces the single
payload shape; >1 produces the digest shape. failure_kind='quality' writes
COMPLETED runs with a failing score (these feed the sustained-failure window
that escalates to critical); 'error' writes ERRORED runs with no score
(these never increment the sustained window).
Response
Successful Response
The fabricated session/runs plus the exact payload handed to dispatch.
payload is {} and dispatched is False only if the emitter found
nothing notifiable (should not happen for the shapes this endpoint fabricates).