Local Run id run-local-ff9bbc6f5000

mistral-7b-q4km

on commonsenseqa · openrouter · n=50 · Sun, 26 Apr 2026 07:44:36 GMT
56.0
±13.3 · 95% CI [42.3, 68.8]
mistral-7b-q4km scored 56.0 on commonsenseqa across 50 problems. The transcript is committed to Merkle root sha256:8e65f134127f323… and signed by attestor benchlist-local-ollama with Ed25519 signature 21c06df0fb7b4bcdfa5ce9…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:657c0ecfad0bd4dcbf062db3be6475df54a6d658c93b0493d4e0d3d86c4cb5bf
Methodology hashsha256:11c314e72c2b767f36d059911da85f213c8fa50958bc5b4e94ae94f7fb36dd77
Merkle rootsha256:8e65f134127f3238352173c64a290c1ee53509baa064c284671e173f09822132
Attestor pubkeyf82b412efec2f9bd732efd7786568ad1dde8b788c79bdba2134be01f68e8ff79
Signature21c06df0fb7b4bcdfa5ce9466eac73f6448d77fb5994d2f573cdcc6e692e2df0ba5711c7f970610671b7843a8f54cf1800d350dbe3e2dd2a2e20ea1a6446d60f
Runnerbenchlist-local-ollama@1.0.0
Started2026-04-26T07:42:49Z
Finished2026-04-26T07:44:36Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-local-ff9bbc6f5000
Best per benchmark → commonsenseqa guide → Anchor on-chain → Dispute