“Partial” is the true label, not a soft “pass”
Of 3 agent tasks, 1 changed a file (pass), 2 exited clean without changing anything (noop), and 0failed. A noop is never counted as a pass. Pass, noop, and failed are separate states — the score summarizes the run, it does not rubber-stamp it.