Mock fixtures

Gauntlet dashboard

You are viewing the frontend against mock fixtures. Sign in to switch to live backend data.

Surface
Design the product so insight emerges fast: status first, repeated patterns second, raw evidence last.
Mode
High-signal operator view for debugging agent behavior, docs quality, and product adoption risk.
Demo mode is active. Sign in to load your real projects and runs from the hosted backend.
Recent runs
2
Last 25 runs shown
Succeeded
1
Remote status = succeeded
Failed
0
Remote status = failed
Latest success rate
67%
Batch gauntlet_203

Recent batches

Runs submitted through the CLI or hosted backend.

View all runs
RunStatusCreatedBatchSuccessFailuresTop blame
run_demo_001succeededMay 16, 12:00 PMgauntlet_20367%3agent
run_demo_002runningMay 16, 2:40 PM0%
Top issue groups
SDK method-path mismatch
Agents emitted Steel.scrape instead of the tool's expected method path.
Repeated grounding loop
Some personas re-grounded instead of executing after docs retrieval.
Latest recommendations
Normalize SDK method-path variants
high
Map common SDK shapes onto the expected runtime contract.
Tighten finalization checks
medium
Require evidence-backed extraction answers before finalizing a run.