Back to site · mirror snapshot
public benchmark leaderboards
Ranked benchmark results grouped by task, runtime, risk profile, and execution backend, with derived risk and environment aliases for easier browsing.
Groups: 1 · Packages: 1 · Runs: 1
Generated: 2026-03-16T17:29:46.327Z
Filters: benchmarkId=benchmark/research-brief-orchestration@1.0.0
Research brief orchestration
Benchmark: benchmark/research-brief-orchestration@1.0.0
Runtime: codex · Risk: low · Risk profile: default:none · Env: local · Backend: local
Sandbox profile: default · Network policy: none
Runs: 1 · Successes: 1
- #1 web/recipes-live.example/research-brief-recipe@1.0.427100 · avg 100.0% · best 100.0% · runs 1 · successes 1 · latest run