Overcoming Dynamics-Blindness: Training-Free Pace-and-Path Correction for VLA Models
Summary
This paper addresses a critical limitation in Vision-Language-Action (VLA) models: their inability to handle dynamic environments due to 'dynamics-blindness' during action chunk execution. The authors propose Pace-and-Path Correction (PPC), a training-free mathematical wrapper that corrects for object motion without retraining. PPC decomposes motion compensation into two orthogonal channels: pace compression (adjusting execution timing) and spatial path offsets (compensating for perpendicular drift). The method requires only external velocity signals and works with any existing VLA model. Evaluated on their new MOVEBENCH benchmark, PPC improves success rates by up to 28.8% in dynamic environments while preserving static performance.
Key findings
- Dynamics-blindness in VLA models stems from intra-chunk execution gaps, not inference latency - faster re-planning doesn't solve the fundamental problem
- Motion compensation can be mathematically decomposed into orthogonal pace and path channels with closed-form solutions requiring no learning
- PPC consistently improves all tested foundational VLA models (π0, π0.5, GR00T, SmolVLA) across uniform, accelerated, and irregular motion patterns
- Accelerated motion causes steeper performance degradation than faster uniform motion, revealing regime complexity matters more than raw speed
How to implement
- Wrap existing warehouse automation VLA models with PPC to handle conveyor belt picking without retraining - requires only adding depth camera velocity tracking to current systems
- Integrate PPC into autonomous kitchen robots to maintain manipulation performance while ingredients or dishes move during cooking tasks, using existing visual tracking pipelines
- Deploy PPC on manufacturing assembly lines where parts arrive on moving platforms - enables immediate improvement of current VLA-based robotic arms without model replacement