Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models Paper • 2603.15618 • Published 19 days ago • 21
Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain Paper • 2510.17801 • Published Oct 20, 2025 • 2