Attested Run id run-mbpp-7f87239d91

meta-llama/llama-3.3-70b

on mbpp · openrouter · n=20 ⚠ statistically thin · Fri, 24 Apr 2026 17:42:43 GMT
76.4
±17.6 · 95% CI [54.6, 89.7]
meta-llama/llama-3.3-70b scored 76.4 on mbpp across 20 problems. The transcript is committed to Merkle root sha256:d92eb70a6c781b7… and signed by attestor benchlist-runner-0 with Ed25519 signature 2838a1bd26d235b607db42…. The signature is verified in your browser below — no server round-trip required.
Raw JSON ↗ Replay for $0.50
Dataset hashsha256:fa4746e4dbf616502400a2547bac67774b519b8dbd314749181832b59d939c23
Methodology hashsha256:3d09dd53dce94bc6d9727753f14950bbf1fd7ee93413ab117c1ff0d3898d3ac9
Merkle rootsha256:d92eb70a6c781b75d6fa9d57c0eb3ae018e8c0ac906a2be8ffa32f809cd80cc3
Attestor pubkeyf38712fae5f11a2fc2fe3f7541264f04cd90974affdf1cce05163ecdaf35d457
Signature2838a1bd26d235b607db42736c2db98af58d3999c69fa05ba366c20203a6279fb85e208cc531c86be223ca45d69f60a08846a36b56bf3668c25b5e1403f38005
Runnerbenchlist-runner@1.0.0
Started2026-04-24T17:40:43Z
Finished2026-04-24T17:42:43Z
Not yet anchored on-chain. Anyone can anchor for ~$0.01–$1.40 in gas → /anchor?run=run-mbpp-7f87239d91
Best per benchmark → mbpp guide → Anchor on-chain → Dispute