Make the right answer easy to select: checkpointing
When the last step is where things go wrong, longer free-form reasoning adds more competitors.
- Maintain a tiny
STATE: record (bindings / totals / chosen option) - Rewrite it once near the end (or periodically)
- Final answer must be a direct copy of
FINAL_STATE.answer
This is exactly the kind of intervention that recovers binding failures in the procedural setting.