Attested Run id run-d70f62c1a050

claude-opus-4.7

on mmlu-pro · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 07:53:30 GMT
62.5
±27.9 · 95% CI [30.6, 86.3]
claude-opus-4.7 scored 62.5 on mmlu-pro across 8 problems. The transcript is committed to Merkle root sha256:5e2118a15b77a54… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature 54a98ecdaf9ded845a704b…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:f83f7230d012b45f7532fd0947ca596e7de52a518e7f9edcd0df4566b409bf9a
Methodology hashsha256:7d4179c2b699af35bc95f0bd466e9b344ae348a765fbbca15e14adbc4ceb7072
Merkle rootsha256:5e2118a15b77a54801050dcd7aea3a922e463984c48ccfd752f8465bfd8395bd
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signature54a98ecdaf9ded845a704bc616dc4108ab7d9533636ec81f3ab196c8297edef56ed27358477ae91802fd474c08728ae02dcb7c06482dbad7a8ae2e7949cedf0b
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T07:43:35.410Z
Finished2026-04-26T07:53:30.996Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-d70f62c1a050
Best per benchmark → mmlu-pro guide → Anchor on-chain → Dispute