Public benchmarks · updated live
Every two-model review AntFleet ran on a benchmark-class repo.
61
benchmarks and counting
updated 16 hours ago
Benchmark-class repos are public repos with a BENCHMARK.mdfile at the root. PRs there are not meant to merge — they exist to run a known diff past AntFleet's two-model unanimous consensus and publish the result. Click any row to read the bot review on GitHub.
Looking for closed-finding receipts instead? /receipts.
Latest benchmarks
showing 50 of 61- 2 findings2 filesreview →
AntFleet/agent-autonomopoly-bench · PR #6
gpt-5claude-opus-4-7·commit 4904a75·16 hours ago - 0 findings (clean)2 filesPR →
AntFleet/agent-autonomopoly-bench · PR #5
gpt-5claude-opus-4-7·commit e7466d8·16 hours ago - 2 findings15 filesreview →
AntFleet/agent-openhuman-bench · PR #2
gpt-5claude-opus-4-7·commit 59e046b·17 hours ago - 2 findings1 filereview →
AntFleet/agent-openhuman-bench · PR #1
gpt-5claude-opus-4-7·commit 9df8938·17 hours ago - 1 finding6 filesreview →
AntFleet/aeon-bench · PR #28
gpt-5claude-opus-4-7·commit 0059094·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #27
gpt-5claude-opus-4-7·commit 4cb226d·2 days ago - 0 findings (clean)3 filesPR →
AntFleet/aeon-bench · PR #26
gpt-5claude-opus-4-7·commit 4be9281·2 days ago - 3 findings2 filesreview →
AntFleet/aeon-bench · PR #25
gpt-5claude-opus-4-7·commit 37d0c07·2 days ago - 1 finding2 filesreview →
AntFleet/aeon-bench · PR #23
gpt-5claude-opus-4-7·commit 66bb888·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #22
gpt-5claude-opus-4-7·commit 2625e16·2 days ago - 1 finding2 filesreview →
AntFleet/aeon-bench · PR #21
gpt-5claude-opus-4-7·commit f8cace5·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #20
commit c5dc24e·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #19
commit 72f8589·2 days ago - 0 findings (clean)1 filePR →
AntFleet/aeon-bench · PR #18
gpt-5claude-opus-4-7·commit 1ec7a89·2 days ago - 0 findings (clean)1 filePR →
AntFleet/aeon-bench · PR #17
gpt-5claude-opus-4-7·commit 9b290aa·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #16
commit c44ee52·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #15
gpt-5claude-opus-4-7·commit 334edff·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #14
gpt-5claude-opus-4-7·commit 3a53384·2 days ago - 2 findings2 filesreview →
AntFleet/aeon-bench · PR #12
gpt-5claude-opus-4-7·commit 76b3059·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #11
commit ba745df·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #10
commit f6726cf·2 days ago - 0 findings (clean)5 filesPR →
AntFleet/aeon-bench · PR #8
gpt-5claude-opus-4-7·commit a9a960b·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #7
gpt-5claude-opus-4-7·commit b190012·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #5
commit 14fb422·2 days ago - 2 findings1 filereview →
AntFleet/aeon-bench · PR #30
gpt-5claude-opus-4-7·commit fac2cd3·2 days ago - 2 findings3 filesreview →
AntFleet/aeon-bench · PR #29
gpt-5claude-opus-4-7·commit 79a346f·2 days ago - 0 findings (clean)6 filesPR →
AntFleet/aeon-bench · PR #28
gpt-5claude-opus-4-7·commit 0b9d165·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #27
gpt-5claude-opus-4-7·commit bfcf2bf·2 days ago - 0 findings (clean)3 filesPR →
AntFleet/aeon-bench · PR #26
gpt-5claude-opus-4-7·commit df29156·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #25
gpt-5claude-opus-4-7·commit 65c227d·2 days ago - 1 finding2 filesreview →
AntFleet/aeon-bench · PR #24
gpt-5claude-opus-4-7·commit 1c841f9·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #23
gpt-5claude-opus-4-7·commit e2819cd·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #22
gpt-5claude-opus-4-7·commit 92e0cb5·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #21
gpt-5claude-opus-4-7·commit f629516·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #20
commit 4fa6133·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #19
commit a2c6038·2 days ago - 0 findings (clean)1 filePR →
AntFleet/aeon-bench · PR #18
gpt-5claude-opus-4-7·commit cec727c·2 days ago - 0 findings (clean)1 filePR →
AntFleet/aeon-bench · PR #17
gpt-5claude-opus-4-7·commit ac2bbab·2 days ago - 0 findings (clean)PR →
AntFleet/aeon-bench · PR #16
commit cfa141c·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #15
gpt-5claude-opus-4-7·commit bea20ed·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #14
gpt-5claude-opus-4-7·commit f7ac48a·2 days ago - 1 finding7 filesreview →
AntFleet/aeon-bench · PR #13
gpt-5claude-opus-4-7·commit a0cf0e1·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #12
gpt-5claude-opus-4-7·commit 84da39b·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #11
gpt-5claude-opus-4-7·commit b487885·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #10
gpt-5claude-opus-4-7·commit 55d430f·2 days ago - 1 finding2 filesreview →
AntFleet/aeon-bench · PR #9
gpt-5claude-opus-4-7·commit 1ae5b6c·2 days ago - 0 findings (clean)5 filesPR →
AntFleet/aeon-bench · PR #8
gpt-5claude-opus-4-7·commit f9198d5·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #7
gpt-5claude-opus-4-7·commit 577a4eb·2 days ago - 2 findings3 filesreview →
AntFleet/aeon-bench · PR #6
gpt-5claude-opus-4-7·commit 30c37a1·2 days ago - 0 findings (clean)2 filesPR →
AntFleet/aeon-bench · PR #5
gpt-5claude-opus-4-7·commit 729f73d·2 days ago