Attested Run id run-038bc5b6052e

claude-sonnet-4.5

on truthfulqa · anthropic-claude · n=8 ⚠ statistically thin · Sun, 26 Apr 2026 08:21:43 GMT
87.5
±22.4 · 95% CI [52.9, 97.8]
claude-sonnet-4.5 scored 87.5 on truthfulqa across 8 problems. The transcript is committed to Merkle root sha256:c2c9878c5478506… and signed by attestor benchlist-vercel-inline-0 with Ed25519 signature c742ec1c3b40d74a359a68…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:1d3a9406678cb49569834ab0a185eb50e97f8072798c1f6afd73f76a77d5f75d
Methodology hashsha256:7f598216d03d1e165c16eba8a94bcf4814bc61513eee4a9620f97110bed29d31
Merkle rootsha256:c2c9878c54785069313b476ac502d466397a08f4f0ae155dcb3da6a6e3503a01
Attestor pubkeycb6e95d0f7b402e254f491b57767df3a3a93ae92f1faee3a02aa52e728f5cd11
Signaturec742ec1c3b40d74a359a68b9aaf1ef85ac805fd0b0143274dcbf06ff9b170022ac4083be149ac3fb44b8a39076daf5045dc4d0cfc0898d26f15f737fb721830d
Runnerbenchlist-vercel-inline@1.0.0
Started2026-04-26T08:21:30.639Z
Finished2026-04-26T08:21:43.570Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-038bc5b6052e
Best per benchmark → truthfulqa guide → Anchor on-chain → Dispute