Sandbox
for a single prompt engineer testing the notebook.
- →Unlimited cells, unlimited drafts
- →Single seat · single project
- →Per-token cost subscripts (4 providers)
- →Eval gate disabled (no promotes)
- →Community Discord
No per-seat extortion. No usage caps that triple your invoice when you actually ship. One number: $0.05 for every prompt version that clears the eval gate and gets promoted to HEAD. Drafts, dry-runs, and rollbacks are free.
for a single prompt engineer testing the notebook.
the honest wedge — you pay only when a version ships.
for teams running >1K promotes/mo with SSO + SLA + audit.
A semver bump that clears the eval gate (HumanEval ≥ HEAD − 5%) and lands on HEAD. Drafts, dry-runs, rolled-back versions, and override-rejected promotes do not count.
No. The Lab tier is pure per-promote billing. Run 10 cells/day or 10,000 — the invoice only tracks promotions that shipped.
No. Cell runs, eval-set executions, and the right-rail provider projections are all free at the Lab tier. We charge when you commit a release.
At Lab, yes — upload any JSONL with prompt+expected. At Foundry, the eval gate is fully configurable, including private benchmark sets behind your own auth.