Attested Run id run-a8efa2ab0d5d

claude-sonnet-4.5

on mmlu-pro · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 08:08:55 GMT
25.0
±26.0 · 95% CI [7.1, 59.1]
claude-sonnet-4.5 scored 25.0 on mmlu-pro across 8 problems. The transcript is committed to Merkle root sha256:e7c80ce1754ad43… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature d167d2a0668c85094f34f7…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:f83f7230d012b45f7532fd0947ca596e7de52a518e7f9edcd0df4566b409bf9a
Methodology hashsha256:7d4179c2b699af35bc95f0bd466e9b344ae348a765fbbca15e14adbc4ceb7072
Merkle rootsha256:e7c80ce1754ad43bafd10f5ec6b3b244ef3ae1a395d403a4daab043feca01048
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signatured167d2a0668c85094f34f70f29b1313fe80b5f365c21b03ff98210b34794b40b5288e173d9c51ce49d1cf199957f54512cc88b1fc4c10f290262b845bc3b4a0b
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T08:08:42.583Z
Finished2026-04-26T08:08:55.329Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-a8efa2ab0d5d
Best per benchmark → mmlu-pro guide → Anchor on-chain → Dispute