Observability
Agent observability
Monitor every agent, drill into runs and waterfall traces, and surface failure signals to debug agent failures in seconds.
GitHub smoke test
Create a disposable PR in obro79/demo-agents, then close it immediately.
Ready to dispatch
idleDispatch reads the selected run, packages a fix plan, runs evals, and opens a PR when the gate passes.
Get logs via MCP
complete
Pull the selected run, event payloads, root cause, and evidence refs from Supabase over the Promptetheus MCP server.
| Status | |
|---|---|
| Failed | Cancel order #48192 and confirm that the refund was started. ses_voice_refu…cess |
| Failed | Book a demo for next Tuesday at 2pm Pacific, but stop at the confirmation screen. ses_acmemeet_tue2pm |
| Failed | Schedule an onboarding call for Thursday at 3pm Pacific. ses_book_thu3pm |
| Failed | Book a product walkthrough Monday at 9am Eastern. ses_book_mon9am |
| Failed | Buy a refundable conference ticket for the Friday session. ses_refund_ticket |
| Failed | Submit the contact-sales form with my details. ses_form_submit |
| Error | Disable the Slack integration in settings. ses_delete_wor…pace |
| Failed | Add the Pro plan to the cart but do not purchase. ses_no_purchase |
| Failed | Tell the customer whether order #88241 is refundable. ses_support_refund |
| Failed | Fix the failing test in checkout_total.py. ses_coding_falsefix |
Trace waterfall
12 events · 7 roots
- 0ms
- 4.0s
- 8.7s
- 13.1s
- 19.2s
- 30.0s
- 30.2s
Run inspector
state change
ses_voice_refund_false_success · seq 6
Replay attached
Voice failure replay
Embedded beside the trace so the selected log can be replayed without leaving the logs workflow.
Input
I want to cancel order 48192 and get a refund.
Output
status failed
Error
User requested cancellation, but cancel_order and issue_refund were never called.