VINCIE: Unlocking In-context Image Editing from Video Paper • 2506.10941 • Published Jun 12, 2025 • 4
view article Article Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot 6 days ago • 18
ProEdit: Inversion-based Editing From Prompts Done Right Paper • 2512.22118 • Published 17 days ago • 17
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 14 days ago • 64
Running 3.64k The Ultra-Scale Playbook 🌌 3.64k The ultimate guide to training LLM on large GPU Clusters
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published 20 days ago • 49
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 • 307