Attested Run id run-3d3c1a3f02af

claude-haiku-4-5

on gsm8k · anthropic-claude · n=8 ⚠ statistically thin · Mon, 27 Apr 2026 01:06:26 GMT
100.0
±16.2 · 95% CI [67.6, 100.0]
claude-haiku-4-5 scored 100.0 on gsm8k across 8 problems. The transcript is committed to Merkle root sha256:7c8b12cc5d0c348… and signed by attestor benchlist-vercel-inline-1 with Ed25519 signature 869d9ee06c82b4d3c675b8…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:09a35a0a0a48f13840457c82e2c2da6a7884ec21b51154139867843c2e4da5c7
Methodology hashsha256:144e8efdcdb66a248c57935cea7c8d00cbc6c287341355ab753cc5f445238bfb
Merkle rootsha256:7c8b12cc5d0c348124648dc718e845b4190606d57d7e50ff72ba258402c1daad
Attestor pubkey042eeb98bd82298204732dcba981c64b4f329e44a13d750c104c5ec9c1de5498
Signature869d9ee06c82b4d3c675b8909b8f8808b4a5e95af417d4be0dba0e7d6da908951cbd87fc20d88021efca25d56c0a62e073eb4bfed751373bde107a55ce9f3703
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-27T01:06:22.209Z
Finished2026-04-27T01:06:26.439Z
Runner provenance · what code produced this
how to verify →
versionbenchlist-vercel-inline@1.0.0 commit3b3562b5a0ff421dfb1530aaef628667a31dbd93 repogithub.com/benchlist/runner adaptersha256:inline-js:gsm8k judgesha256:inline-js:scoreOne digestsha256:7e9f1dfe479c7831a6b11092144a02bab7c52861802c40dbca2faeaec47a46d1
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-3d3c1a3f02af
Best per benchmark → gsm8k guide → Anchor on-chain → Dispute