Rich evaluation data captured for a proposal minibatch.
This mirrors upstream's SubsampleEvaluation: scores remain the compact
acceptance surface, while outputs, objective scores, and trajectories are
retained for callbacks, tracking, and custom acceptance logic.