OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published Dec 2, 2025 • 32
SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization Paper • 2512.02631 • Published Dec 2, 2025 • 8
AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition Paper • 2512.03794 • Published Dec 3, 2025 • 3