RoboGround: Robotic Manipulation with Grounded Vision-Language Priors Paper • 2504.21530 • Published Apr 30
CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation Paper • 2506.19816 • Published Jun 24
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation Paper • 2506.10966 • Published Jun 12
MM-ACT: Learn from Multimodal Parallel Generation to Act Paper • 2512.00975 • Published 26 days ago • 6
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy Paper • 2510.13778 • Published Oct 15 • 16