Per-sample output, diffs, and baseline management