Local Run id run-local-362f4618430d

llama3-8b-q40

on commonsenseqa · openrouter · n=50 · Sun, 26 Apr 2026 08:14:35 GMT
62.0
±13.0 · 95% CI [48.2, 74.1]
llama3-8b-q40 scored 62.0 on commonsenseqa across 50 problems. The transcript is committed to Merkle root sha256:e60a0d67c84be6f… and signed by attestor benchlist-local-ollama with Ed25519 signature 98ea853ac4c2010a3fb0c8…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:657c0ecfad0bd4dcbf062db3be6475df54a6d658c93b0493d4e0d3d86c4cb5bf
Methodology hashsha256:11c314e72c2b767f36d059911da85f213c8fa50958bc5b4e94ae94f7fb36dd77
Merkle rootsha256:e60a0d67c84be6f98ad1974db83cb5995b612c0df334949e69d37d7ba86d8283
Attestor pubkeyf82b412efec2f9bd732efd7786568ad1dde8b788c79bdba2134be01f68e8ff79
Signature98ea853ac4c2010a3fb0c887355a9fc5cfb017b39647ab498a526a2ef36617263c9b3ae6800993b447c5099f24c11380a325af728cd7a220bdc8f6e281377b04
Runnerbenchlist-local-ollama@1.0.0
Started2026-04-26T08:12:46Z
Finished2026-04-26T08:14:35Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-local-362f4618430d
Best per benchmark → commonsenseqa guide → Anchor on-chain → Dispute