Attested Run id run-df07f5028fbc

claude-opus-4.7

on mmlu · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 08:38:19 GMT
100.0
±16.2 · 95% CI [67.6, 100.0]
claude-opus-4.7 scored 100.0 on mmlu across 8 problems. The transcript is committed to Merkle root sha256:7035cfd5d4fc19a… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature a6828a3772b04e4b67ac68…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:05ef744f592cd2481092a6ecdecbccaf5e515f6ac2be7d5fc77ad85b8165f15c
Methodology hashsha256:f65dba1e549ab81ea004be624791ae7b7b3e784648c0cb2ce84b8bf930bb0457
Merkle rootsha256:7035cfd5d4fc19ac5eab731a6906ba3e7d1ca7b86b976ad0af37523097e35be7
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signaturea6828a3772b04e4b67ac683c13e7d96499f56d97dad0c14c935686bc7a9265cd4e4c515885fdadaf5b10e015c3d632d75d3dcb2e086dc161286f3cc994d3860e
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T08:37:38.425Z
Finished2026-04-26T08:38:19.783Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-df07f5028fbc
Best per benchmark → mmlu guide → Anchor on-chain → Dispute