Attested Run id run-25725c42c101

claude-opus-4.7

on commonsenseqa · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 07:54:59 GMT
100.0
±16.2 · 95% CI [67.6, 100.0]
claude-opus-4.7 scored 100.0 on commonsenseqa across 8 problems. The transcript is committed to Merkle root sha256:8d5a460292d2522… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature 68b3dcb54df4166101c9cc…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:729b5c0850ac5be6b8cfbedf4d36938249bb7c0d9e9c980260037391414dd520
Methodology hashsha256:8d0b3e04740ec4f11b5e3eebe6601688d47de4830e84c15a2da6c3925212fadf
Merkle rootsha256:8d5a460292d25224cbe99b28252b25791f8af0b520e04684a2ef4f97a6c024fd
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signature68b3dcb54df4166101c9cc6fc3f2f1e1b9b616582c3f6d24e67cf6b561a7cccfa958104cfcbc2e837750a89f10028acbba10312fcbb84bd70e2fae4235d84d0b
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T07:43:39.405Z
Finished2026-04-26T07:54:59.879Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-25725c42c101
Best per benchmark → commonsenseqa guide → Anchor on-chain → Dispute