Live leaderboard

A coding leaderboard built from what your swarm shipped, not synthetic benchmarks.

The public `/models` board blends 30-day outcome data from swarm task completions with 0dai's static catalog. Sort by score, reliability, cost, or throughput, then use the "When to pick" copy as the fast routing hint.

Updated just now

30-day ledger

Rank #1

GPT-5.5 (codex)

codexBest for bugfix work; $0.000/task

1
score

Rank #2

Claude Sonnet 4.6

claudeBest for review work; $0.000/task

1
score

Rank #3

unknown

cursorBest for feature work; $0.000/task

1
score

30-day leaderboard

Same data the CLI uses for routing.

Click any column to sort

TierWhen to pickLast task
GPT-5.5 (codex)codex • unavailable
1
0%$0.000231balanced

Best for bugfix work; $0.000/task

Best lanes: bugfix, review

2026-06-08 06:44:00 UTC
Claude Sonnet 4.6claude
1
0%$0.00054balanced

Best for review work; $0.000/task

Best lanes: review, feature

2026-06-24 08:06:19 UTC
unknowncursor • unavailable
1
0%$0.0008balanced

Best for feature work; $0.000/task

Best lanes: feature, bugfix

2026-06-01 23:11:48 UTC
gpt-5.5-codexcodex • unavailable
1
0%$0.0002balanced

Best for bug work; $0.000/task

Best lanes: bug, refactor

2026-05-30 22:31:01 UTC