Attested Run id run-920a62aadab5

claude-sonnet-4.5

on mmlu · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 08:37:49 GMT
62.5
±27.9 · 95% CI [30.6, 86.3]
claude-sonnet-4.5 scored 62.5 on mmlu across 8 problems. The transcript is committed to Merkle root sha256:8450e2b4d565c0b… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature dac61c24204d49d4eaca4a…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:05ef744f592cd2481092a6ecdecbccaf5e515f6ac2be7d5fc77ad85b8165f15c
Methodology hashsha256:f65dba1e549ab81ea004be624791ae7b7b3e784648c0cb2ce84b8bf930bb0457
Merkle rootsha256:8450e2b4d565c0bc39dcb1f3bc972c49bfeb756b115593b21b3c8cf5fd1eccff
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signaturedac61c24204d49d4eaca4aa5249959a67d8cd9545ce7f1520d73a7c509018d8a0e611af1289cffbecf9382a0ee03bbf4d7b5a60998c1c74e0c9f1c60b1a9280f
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T08:37:34.940Z
Finished2026-04-26T08:37:49.046Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-920a62aadab5
Best per benchmark → mmlu guide → Anchor on-chain → Dispute