Attested Run id run-54508ca4cc03

claude-opus-4-7

on gsm8k · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 18:30:40 GMT
100.0
±16.2 · 95% CI [67.6, 100.0]
claude-opus-4-7 scored 100.0 on gsm8k across 8 problems. The transcript is committed to Merkle root sha256:d2758a26ea4d3e8… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature 2ca399f3de8e9fe22990c5…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:09a35a0a0a48f13840457c82e2c2da6a7884ec21b51154139867843c2e4da5c7
Methodology hashsha256:144e8efdcdb66a248c57935cea7c8d00cbc6c287341355ab753cc5f445238bfb
Merkle rootsha256:d2758a26ea4d3e87c767f52aa0ec28002129218f82849fc5ade390d4923a48c0
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signature2ca399f3de8e9fe22990c5310c57af681c53f9635b822b3f3d8be064f88f25ed8776e3f6d08dc0c0bf5af07b0a8df9d3d00f109f628b1d3be085b54e7d806306
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T18:29:29.385Z
Finished2026-04-26T18:30:40.824Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-54508ca4cc03
Best per benchmark → gsm8k guide → Anchor on-chain → Dispute