Run Archive
All benchmark runs, newest first. Download raw data for any run.
| Date | Scenario Set | Models | Scenarios | Status | Data |
|---|---|---|---|---|---|
| 2026-05-11latest | 2.1 | 23 | 58 | complete | scored.jsonreport.md |
All benchmark runs, newest first. Download raw data for any run.
| Date | Scenario Set | Models | Scenarios | Status | Data |
|---|---|---|---|---|---|
| 2026-05-11latest | 2.1 | 23 | 58 | complete | scored.jsonreport.md |