Attested Run id run-5c103412a8eb

claude-sonnet-4.5

on gsm8k · anthropic-claude · n=3 ⚠ statistically thin · Sun, 26 Apr 2026 07:24:33 GMT
100.0
±28.1 · 95% CI [43.8, 100.0]
claude-sonnet-4.5 scored 100.0 on gsm8k across 3 problems. The transcript is committed to Merkle root sha256:ae009bdb2ad039f… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature 2992cd508764351b5971f3…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:09a35a0a0a48f13840457c82e2c2da6a7884ec21b51154139867843c2e4da5c7
Methodology hashsha256:144e8efdcdb66a248c57935cea7c8d00cbc6c287341355ab753cc5f445238bfb
Merkle rootsha256:ae009bdb2ad039f4c0d29ad946dfdb29ca755b3a1b8c1b7a228064cc1b75137f
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signature2992cd508764351b5971f30b4e32f58da2d5abb9211958092ceb2988b0a71a277200ad26695e226eceba27e865c3eb122fe22aa1ac2d85e65c3c5053874d890d
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T07:24:21.585Z
Finished2026-04-26T07:24:33.356Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-5c103412a8eb
Best per benchmark → gsm8k guide → Anchor on-chain → Dispute