Local Run id run-local-962d5fda31c9

llama3-8b-q40

on arc-challenge · openrouter · n=50 · Sun, 26 Apr 2026 08:10:52 GMT
84.0
±10.1 · 95% CI [71.5, 91.7]
llama3-8b-q40 scored 84.0 on arc-challenge across 50 problems. The transcript is committed to Merkle root sha256:fa750f816a123f8… and signed by attestor benchlist-local-ollama with Ed25519 signature aef49805096c89afe0f6a7…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:144e9a13fb369f31007fffdcf4d7d55692677b409c8fa7b7dec4328c81a55752
Methodology hashsha256:8e84e6ffec11c082a286373b8b306600732cdf99b514079bfc0754fe4cd7a7c5
Merkle rootsha256:fa750f816a123f8520c0fecc628ea79e02019e9356f344e469c471ac245824d4
Attestor pubkeyf82b412efec2f9bd732efd7786568ad1dde8b788c79bdba2134be01f68e8ff79
Signatureaef49805096c89afe0f6a7e2ee6cd07b0520c2d1ed7bff999445ec63a9a28475bf1ff2b5b3623eeef56419db730493f324b6e31b4cddc94e735e55e42eaf2a0a
Runnerbenchlist-local-ollama@1.0.0
Started2026-04-26T08:09:03Z
Finished2026-04-26T08:10:52Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-local-962d5fda31c9
Best per benchmark → arc-challenge guide → Anchor on-chain → Dispute