Start an eval run
Create and start a new run against an existing prompt set. The run enqueues one agentic job per prompt and begins executing immediately. Returns the newly created run with its initial per-prompt result rows.
Authorizations
Can be either an Organization API Key or Personal Access Token (PAT).
Include in the Authorization header as: Bearer YOUR_TOKEN
Body
The prompt set to execute.
"550e8400-e29b-41d4-a716-446655440000"
Optional human-readable description for the run. Omit or pass null to leave it unset. Max 1024 characters.
1024"Re-running after switching to gpt-4o for query generation"
Per-run configuration. Optional — omit if no overrides.
Response
Run created and jobs enqueued.
Number of per-prompt agentic jobs created for this run (one per prompt that fanned out successfully). Enqueue onto the work queue happens after creation and is best-effort, so this count reflects jobs created, not necessarily those successfully enqueued.
12
The newly created run with its initial results.