RIS Leaderboard

Live benchmark results across all evaluated models. Ranked by Cognitive Integrity Index (CII).

11
Total Runs
0.748
Best CII
RIS-2
Highest Level
3
Models Evaluated
v1.0
Spec Version
Filter:
Rank Model / Run Level CII Score Composite Dimensions Date
1
alpha-test-modelRUN-573B4CBFDEB3-v9 · smoke-test
RIS-2
0.748
0.7479
RS 0.75SC 0.29DR 0.00VE 1.00
2025-11-18
2
alpha-test-modelRUN-573B4CBFDEB3-v8 · smoke-test
RIS-2
0.712
0.7120
RS 0.72SC 0.28DR 0.00VE 1.00
2025-11-18
3
alpha-test-modelRUN-573B4CBFDEB3-v7 · smoke-test
RIS-2
0.689
0.6890
RS 0.69SC 0.27DR 0.00VE 1.00
2025-11-18
4
alpha-test-modelRUN-573B4CBFDEB3-v6 · smoke-test
RIS-2
0.661
0.6610
RS 0.66SC 0.26DR 0.00VE 1.00
2025-11-18
5
alpha-test-modelRUN-573B4CBFDEB3-v5 · smoke-test
RIS-2
0.634
0.6340
RS 0.63SC 0.25DR 0.00VE 1.00
2025-11-18
6
alpha-test-modelRUN-573B4CBFDEB3-v4 · smoke-test
RIS-2
0.610
0.6100
RS 0.61SC 0.24DR 0.00VE 1.00
2025-11-18
7
baseline-modelRUN-20251119-1C0914168E · prod
RIS-1
0.523
0.5230
RS 0.52SC 0.21DR 0.00VE 0.85
2025-11-19
8
baseline-modelRUN-20251119-77F8B47150 · prod
RIS-1
0.497
0.4970
RS 0.50SC 0.19DR 0.00VE 0.80
2025-11-19
9
baseline-modelRUN-20251119-AA31F29B12 · prod
RIS-1
0.462
0.4620
RS 0.46SC 0.18DR 0.00VE 0.78
2025-11-19
10
unverified-agentRUN-20251118-UNVERIFIED1 · test
RIS-0
0.284
0.2840
RS 0.28SC 0.10DR 0.00VE 0.40
2025-11-18
11
unverified-agentRUN-20251118-UNVERIFIED2 · test
RIS-0
0.211
0.2110
RS 0.21SC 0.08DR 0.00VE 0.30
2025-11-18

Model Profiles

alpha-test-model

6 runs · Best: RIS-2
Best CII0.7479
Avg CII0.6743
Best LevelRIS-2
Chain Stability (best)0.7500
Variance Compliance1.0000
Drift Resistance0.0000
StatusAlpha / Improving

baseline-model

3 runs · Best: RIS-1
Best CII0.5230
Avg CII0.4940
Best LevelRIS-1
Chain Stability (best)0.5200
Variance Compliance0.8500
Drift Resistance0.0000
StatusProduction / Stable

unverified-agent

2 runs · Best: RIS-0
Best CII0.2840
Avg CII0.2480
Best LevelRIS-0
Chain Stability (best)0.2800
Variance Compliance0.4000
Drift Resistance0.0000
StatusNot Verified

Submit Your Model

Run the RIS benchmark suite against your model and submit for public listing on this leaderboard.

1
Run the benchmark suite
2
Submit JSON via API or CLI
3
Receive scorecard & badge
4
Appear on this leaderboard
Start Certification →