Attested Run id run-3fff093406bb

claude-opus-4-7

on humaneval · anthropic-claude · n=8 ⚠ statistically thin · Mon, 27 Apr 2026 01:10:05 GMT
0.0
±16.2 · 95% CI [0.0, 32.4]
claude-opus-4-7 scored 0.0 on humaneval across 8 problems. The transcript is committed to Merkle root sha256:4e1bb61936f2a21… and signed by attestor benchlist-vercel-inline-1 with Ed25519 signature a67941cedb515120e931af…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:3e1eb278fb45e71a150b896866387eae8c5bf42c0618c1a543fd5bb03cd3edaf
Methodology hashsha256:a2c80bc70417578adad51e92cd412d7b79be24c225b1794bfabb87dd741ccf24
Merkle rootsha256:4e1bb61936f2a216b0f708b97a366bd3b2155f642d176c7c5d3bd595db0a1ed0
Attestor pubkey042eeb98bd82298204732dcba981c64b4f329e44a13d750c104c5ec9c1de5498
Signaturea67941cedb515120e931af7b076240a8bf7cfbf00296243a922d4e45f9b52c6567c2da45084326da60bcf1dad389e9a0cae5f5e68c33b7047bb20b42585f090f
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-27T01:08:12.215Z
Finished2026-04-27T01:10:05.156Z
Runner provenance · what code produced this
how to verify →
versionbenchlist-vercel-inline@1.0.0 commit3b3562b5a0ff421dfb1530aaef628667a31dbd93 repogithub.com/benchlist/runner adaptersha256:inline-js:humaneval judgesha256:inline-js:scoreOne digestsha256:a17c77357134be86c8f0dbbe13c65e4f3852e1695d3192bc7a56b82c9dd900a1
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-3fff093406bb
Best per benchmark → humaneval guide → Anchor on-chain → Dispute