Live leaderboard

A coding leaderboard built from what your swarm shipped, not synthetic benchmarks.

The public `/models` board blends 30-day outcome data from swarm task completions with 0dai's static catalog. Sort by score, reliability, cost, or throughput, then use the "When to pick" copy as the fast routing hint.

Updated just now

30-day ledger

Rank #1

GPT-5.5 (codex)

codexBest for bugfix work; $0.000/task

1
score

Rank #2

Claude Sonnet 4.6

claudeBest for greenfield features; $0.000/task

1
score

Rank #3

DeepSeek v4 Pro

opencodeBest for review work; $0.000/task

1
score

30-day leaderboard

Same data the CLI uses for routing.

Click any column to sort

TierWhen to pickLast task
GPT-5.5 (codex)codex • unavailable
1
0%$0.000134balanced

Best for bugfix work; $0.000/task

Best lanes: bugfix, feature

2026-05-11 13:52:56 UTC
Claude Sonnet 4.6claude
1
0%$0.00029.1m13balanced

Best for greenfield features; $0.000/task

Best lanes: feat, delegated

2026-04-19 15:12:42 UTC
DeepSeek v4 Proopencode • unavailable
1
0%$0.0004balanced

Best for review work; $0.000/task

Best lanes: review

2026-05-01 00:39:32 UTC
Gemini 3.1 Progemini
1
0%$0.0004balanced

Best for greenfield features; $0.000/task

Best lanes: feat, delegated

2026-04-19 05:41:37 UTC
gpt-5.5-codexcli • unavailable
1
0%$0.00028.3m4balanced

Best for general work; $0.000/task

Best lanes: general

2026-05-10 09:48:27 UTC
gpt-5.5-codexcodex • unavailable
1
0%$0.0003balanced

Best for general work; $0.000/task

Best lanes: general

2026-05-10 10:08:18 UTC
qoder-nativeqoder • unavailable
1
0%$0.0002balanced

Best for bugfix work; $0.000/task

Best lanes: bugfix, feature

2026-04-30 23:21:40 UTC