Attested Run id run-176c3d2e6cac

claude-opus-4.7

on hellaswag · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 07:56:26 GMT
100.0
±16.2 · 95% CI [67.6, 100.0]
claude-opus-4.7 scored 100.0 on hellaswag across 8 problems. The transcript is committed to Merkle root sha256:f32a75a0cb97157… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature 960438a748755c317b1ed7…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:b967f14e9705f2c1512bfecbc280340660ac60811aca2cd09789d654cb44b3ee
Methodology hashsha256:2725c767f087367a0bbb3d937db51573191931b9f2e7a805d74297244330c18f
Merkle rootsha256:f32a75a0cb97157f63f48ccd2f6c5f900702a4c4da33c2efdd637cb9a8197b07
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signature960438a748755c317b1ed7b1d3fa44908bc1acde078d0ca47f13c1dcd9e4acae49fe9dcfa46f01625ff1ee81e454221f35cec62b6d00b7ca09ecc390c09fcf01
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T07:43:42.084Z
Finished2026-04-26T07:56:26.728Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-176c3d2e6cac
Best per benchmark → hellaswag guide → Anchor on-chain → Dispute