Attested Run id run-a8a177251ee6

claude-opus-4.7

on truthfulqa · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 08:22:24 GMT
100.0
±16.2 · 95% CI [67.6, 100.0]
claude-opus-4.7 scored 100.0 on truthfulqa across 8 problems. The transcript is committed to Merkle root sha256:5d6d13182af2ab6… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature 36ee32c3d78de536c6fc4e…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:1d3a9406678cb49569834ab0a185eb50e97f8072798c1f6afd73f76a77d5f75d
Methodology hashsha256:7f598216d03d1e165c16eba8a94bcf4814bc61513eee4a9620f97110bed29d31
Merkle rootsha256:5d6d13182af2ab69d972269d50963f646ad705ff06fe52ddfd04e8b5935bd5a5
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signature36ee32c3d78de536c6fc4ee5303e28fbe5e11789892aaf5384dc689f01dd95d4aa575159bc945e5fd42821a5d36dac22b27e1aa59a9a3c1c3dc662744db89a00
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T08:21:33.699Z
Finished2026-04-26T08:22:24.333Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-a8a177251ee6
Best per benchmark → truthfulqa guide → Anchor on-chain → Dispute