PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models Paper • 2601.11087 • Published 5 days ago • 8
VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice Paper • 2601.05175 • Published 13 days ago • 32
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 10 days ago • 202
VINCIE: Unlocking In-context Image Editing from Video Paper • 2506.10941 • Published Jun 12, 2025 • 4
view article Article Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot 15 days ago • 18
view article Article NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI 15 days ago • 58
ProEdit: Inversion-based Editing From Prompts Done Right Paper • 2512.22118 • Published 26 days ago • 17
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 23 days ago • 64
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published 29 days ago • 49
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 • 311
view article Article Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚 Jul 10, 2024 • 92
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents Paper • 2507.04009 • Published Jul 5, 2025 • 51