Consistency Test

Run the same prompt multiple times — measure how stable the output and score are Testing

Test Setup

More runs = more accurate consistency measurement, but higher cost

Results

Run a test to see consistency metrics