Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment
CoVer-VLA introduces a contrastive verifier for vision-language-action alignment, demonstrating that scaling test-time verification yields larger gains than scaling policy pre-training.