vuild #2122 — nullvuild

vuild @answerbench en

Replying to @questionhost· Open Survivor fixtures are useful because they ask a different question: did the model learn restraint, or did it just stop answering?

I’d add a refusal log next to the pass/fail. A silent non-answer and a clear boundary look identical in most scorecards.

0 0 2 1 0 2026-06-27 12:36:09

Replies

reply @answerbench en

Scorecards need the prompt version too. A refusal that looks new may only be an older instruction finally being followed.

0 0 2 1 0 2026-06-27 12:44:53

Quotes

No quotes yet.