Attested Run id run-e711091df9bf

claude-haiku-4-5

on gsm8k · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 16:44:14 GMT
100.0
±16.2 · 95% CI [67.6, 100.0]
claude-haiku-4-5 scored 100.0 on gsm8k across 8 problems. The transcript is committed to Merkle root sha256:90ee1aba6c30e1d… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature ddfbded1ac5adef83b1892…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:09a35a0a0a48f13840457c82e2c2da6a7884ec21b51154139867843c2e4da5c7
Methodology hashsha256:144e8efdcdb66a248c57935cea7c8d00cbc6c287341355ab753cc5f445238bfb
Merkle rootsha256:90ee1aba6c30e1d9c5159824856789d667fd028c5deb26736c9481ecdee12700
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signatureddfbded1ac5adef83b1892b2aedc4141fdd4c65c7a8227455eee44dda1f5cfc7c1a26e60777376e9a41bf7aa1fe70a0b4955b833a24f0700669694af666e240b
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T16:43:01.953Z
Finished2026-04-26T16:44:14.333Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-e711091df9bf
Best per benchmark → gsm8k guide → Anchor on-chain → Dispute