Judge Prediction

Predict prompt scores using existing optimization data — without spending tokens on a new run

Step 1 — Select Optimization History

#47 — Optimization 2026-04-12 12:57
4 prompts · Apr 12, 2026 12:57
8.7
#46 — Optimization 2026-04-12 12:37
4 prompts · Apr 12, 2026 12:37
7.5
#43 — Optimization 2026-04-12 11:54
4 prompts · Apr 12, 2026 11:54
8.5
#42 — Optimization 2026-04-12 11:35
4 prompts · Apr 12, 2026 11:35
8.3
#41 — Optimization 2026-04-12 11:24
4 prompts · Apr 12, 2026 11:24
7.6
#39 — Optimization 2026-04-12 11:06
4 prompts · Apr 12, 2026 11:06
8.4
#38 — Optimization 2026-04-12 10:31
4 prompts · Apr 12, 2026 10:31
8.4
#37 — Optimization 2026-04-12 09:41
4 prompts · Apr 12, 2026 09:41
8.2
#36 — Optimization 2026-04-12 09:16
4 prompts · Apr 12, 2026 09:16
8.1
#35 — Optimization 2026-04-11 08:55
4 prompts · Apr 11, 2026 08:55
8.2
#34 — Optimization 2026-04-10 16:01
4 prompts · Apr 10, 2026 16:01
8.0
#33 — Optimization 2026-04-10 14:54
4 prompts · Apr 10, 2026 14:54
8.7
#32 — Optimization 2026-04-10 06:35
4 prompts · Apr 10, 2026 06:35
8.5
#31 — Optimization 2026-04-09 20:11
4 prompts · Apr 09, 2026 20:11
8.3
#30 — Optimization 2026-04-09 18:04
4 prompts · Apr 09, 2026 18:04
8.1
#29 — Optimization 2026-04-09 16:25
4 prompts · Apr 09, 2026 16:25
8.1
#28 — Optimization 2026-04-09 16:10
4 prompts · Apr 09, 2026 16:10
8.4
#27 — Optimization 2026-04-09 16:03
4 prompts · Apr 09, 2026 16:03
8.8
#26 — Optimization 2026-04-09 15:22
4 prompts · Apr 09, 2026 15:22
8.3
#22 — Optimization 2026-04-09 13:55
4 prompts · Apr 09, 2026 13:55
8.6
#20 — Optimization 2026-04-09 13:32
4 prompts · Apr 09, 2026 13:32
8.2
#19 — Optimization 2026-04-09 11:40
4 prompts · Apr 09, 2026 11:40
8.3
#17 — Optimization 2026-04-09 10:42
4 prompts · Apr 09, 2026 10:42
8.1
#16 — Optimization 2026-04-09 10:26
4 prompts · Apr 09, 2026 10:26
7.6
#15 — Optimization 2026-04-09 10:15
4 prompts · Apr 09, 2026 10:15
8.0
#12 — Optimization 2026-04-09 07:26
4 prompts · Apr 09, 2026 07:26
8.6
#9 — Optimization 2026-04-09 06:59
4 prompts · Apr 09, 2026 06:59
#8 — Optimization 2026-04-09 06:31
4 prompts · Apr 09, 2026 06:31

Prediction Results

Select an optimization and run prediction

Prediction Template Ranking

Ranked by average rank-match accuracy across all runs

#TemplateRunsAvg AccuracyAvg Rank Match
1 Specificity and Outcome Predictability Judge 2 62.5% 2.5/4 correct
2 Clarity and Communicative Effectiveness Judge 4 56.3% 2.3/4 correct
3 Structural Completeness Judge 1 25.0% 1.0/4 correct

Past Predictions

Optimization Template Accuracy Date
Optimization 2026-04-12 09:16 Clarity and Communicative Effectiveness Judge 100.0% Apr 12, 13:11
Optimization 2026-04-12 11:35 Clarity and Communicative Effectiveness Judge 50.0% Apr 12, 13:09
Optimization 2026-04-12 12:57 Structural Completeness Judge 25.0% Apr 12, 13:08
Optimization 2026-04-12 12:57 Clarity and Communicative Effectiveness Judge 25.0% Apr 12, 13:07
Optimization 2026-04-12 12:57 Specificity and Outcome Predictability Judge 25.0% Apr 12, 13:07
Optimization 2026-04-09 16:10 Specificity and Outcome Predictability Judge 100.0% Apr 10, 22:07
Optimization 2026-04-10 14:54 Clarity and Communicative Effectiveness Judge 50.0% Apr 10, 15:52
# N/A - Apr 10, 14:49
# N/A - Apr 10, 14:44
# N/A - Apr 10, 14:28