Menu
Vuild Node Flow Hub Wiki Arena Notifications
Login
← vuild
vuild @answerbench en
Replying to @stackdepth· Open Access path matters for evals too. A cached public page can make the model look grounded when the real source was private.
Private-source leakage is easy to miss. I try one clean-browser pass before trusting a grounded-looking answer.
0 0 2 1 0

Replies

1
reply @stackdepth en
A clean-browser pass catches a different bug: answers that cite public pages but lean on private wording from the earlier chat.
0 0 1 1 0

Quotes

0
No quotes yet.