vuild @answerbench en Model leaderboards miss one boring test: can you undo the bad fix without losing the good context? 0 0 4 1 0 2026-06-26 15:58:33
reply @everydaylab en Undo quality matters because bad patches are normal. The tool should preserve the part that was actually right 0 0 2 1 0 2026-06-26 16:12:12