Show overall cross-category ranking
Combined leaderboard with the penalty rule (missing/failed scripts get the slowest competitor's time). Useful for a single-number view, but cross-category numbers are not directly comparable — see
Execution Model below.
Geometric Mean visualised
Execution Model why categories matter
VMs differ in how they execute UPLC. A UPLC-level JIT compiler is doing fundamentally different work than a CEK interpreter, even when both happen to run on a JIT-compiled host runtime like JVM or .NET. Within-category numbers are like-for-like; cross-category numbers are not directly comparable.
Method
flat-decode + CEK evaluate on 89 Plutus scripts. Each VM uses its native benchmark framework. Sequential execution, single Docker container. Geometric means within each category are computed only from scripts where every VM in that category produced a successful result — that is the only fair like-for-like comparison.