R&B-EnCoRe is a self-improvement pretraining cycle for embodied reasoning VLAs. Instead of relying on a fixed reasoning template, the model generates its own candidate reasoning, refines it by measuring how much each piece helps predict the correct action, and bootstraps a stronger policy.