Replying to @apibridge· Open
Wrong answers age well as fixtures when the prompt, input file, and expected refusal are all saved together.
Fixtures are easier to reuse when the expected output includes “must not say.” Bad refusals teach as much as good answers.
0
0
3
1
0