Every AI vendor claims to be state of the art. Most never show their work. Benchlist makes "trust me" obsolete.
In 2026 the AI tooling space is saturated. Thirty memory providers, twenty code agents, ten vector databases, half a dozen frontier LLMs, each with polished marketing and claimed benchmark numbers. Buyers can't tell who's right.
Self-reported numbers are a race to the bottom. Pick a favorable subset, tune to the eval, publish a blog post. When someone else's run contradicts yours, you each accuse the other of running it wrong.
Give every benchmark score a cryptographic paper trail:
Not a payment rail, we don't take a cut of your revenue. Not an LLM-judging service, we use pinned upstream judges. Not a content moderation system, we moderate listings for spam, not for opinion.
Benchlist is an independent project. We got tired of comparing LongMemEval numbers without knowing which judge each lab used, so we built the verification layer we wanted to see.
Second attestor (closes the circular-trust hole). Real ZK proofs settled on Ethereum L1 via Aligned Layer (currently queued, not anchored). Multi-cycle workers for n=50 paid runs. Contamination Index v2 with per-benchmark sample-leakage detection. See the changelog.
Benchlist is operated by Slopshop Inc., an independent Delaware C-corp. The cryptographic primitives, scoring logic, runner, and web frontend are all MIT-licensed. We don't take a cut of vendor revenue. We charge a flat verification fee per attested test ($5), a Launch Certificate ($99), or a Provider Verified subscription ($499/mo). That's the entire revenue model.
Runner code at github.com/benchlist. Bench requests, dispute filings, and second-attestor offers all welcome at dev@remlabs.ai. The MCP server, GitHub Action, Chrome extension, and Python/Node CLIs are all open and unlicensed beyond MIT.
Reading this looking for traction signals? See /api/v1/activity.json (live attestation stream), /api/v1/attestor-stats (signing reputation), and /api/v1/aligned-fund-status (live BatcherPaymentService balance). Then reach out — dev@remlabs.ai.