RoboMonkey: Scaling Test-Time Sampling and Verification for Vision-Language-Action Models

Jacky Kwok Stanford University

Christopher Agia Stanford University

Rohan Sinha Stanford University

Matt Foutter Stanford University

Shulu Li UC Berkeley

Ion Stoica UC Berkeley

Azalia Mirhoseini Stanford University

Marco Pavone Stanford, NVIDIA

Preprint, 2025


A monkey with robotic arms