Local Run id run-local-3e2f640576ac

glm-4.7-flash-30b-q4km

on commonsenseqa · openrouter · n=3 ⚠ statistically thin · Sun, 26 Apr 2026 02:18:56 GMT
33.3
±36.5 · 95% CI [6.1, 79.2]
glm-4.7-flash-30b-q4km scored 33.3 on commonsenseqa across 3 problems. The transcript is committed to Merkle root sha256:71325df3f0cfbbc… and signed by attestor benchlist-local-ollama with Ed25519 signature ca0e81f68e7342c62fb2a2…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:729b5c0850ac5be6b8cfbedf4d36938249bb7c0d9e9c980260037391414dd520
Methodology hashsha256:afaa58378ca68cf9d5d85a75f5d168088888466b4b067772f67d22c7a05da188
Merkle rootsha256:71325df3f0cfbbc112c380c4ffae30bd99bc5c144235a9dba5c4e4858fd6e059
Attestor pubkeyf82b412efec2f9bd732efd7786568ad1dde8b788c79bdba2134be01f68e8ff79
Signatureca0e81f68e7342c62fb2a2c34677f52d803b8f46e7900ef4d141bec43afc7e688302fa1e46123191bffbb8fbb055521c99a8b4011333dbebddfa679edc14ce00
Runnerbenchlist-local-ollama@1.0.0
Started2026-04-26T02:18:49Z
Finished2026-04-26T02:18:56Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-local-3e2f640576ac
Best per benchmark → commonsenseqa guide → Anchor on-chain → Dispute