Reasoning Shift: How Context Silently Shortens LLM Reasoning Paper • 2604.01161 • Published 1 day ago • 22 • 3
QuitoBench: A High-Quality Open Time Series Forecasting Benchmark Paper • 2603.26017 • Published 7 days ago • 26 • 3
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification Paper • 2603.26648 • Published 7 days ago • 33 • 3
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? Paper • 2603.25823 • Published 7 days ago • 36 • 3
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 4 days ago • 53 • 5
ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers Paper • 2603.24414 • Published 9 days ago • 167 • 4
Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells Paper • 2603.25240 • Published 8 days ago • 73 • 4
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published 4 days ago • 299 • 4
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 5 days ago • 125 • 4
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 14 days ago • 304 • 7
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published 4 days ago • 51 • 3
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published 6 days ago • 137 • 4
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 8 days ago • 45 • 14
PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference Paper • 2603.25730 • Published 8 days ago • 47 • 3
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published 8 days ago • 150 • 6
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 8 days ago • 147 • 4
Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs? Paper • 2603.24472 • Published 9 days ago • 47 • 7
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience Paper • 2603.24533 • Published 9 days ago • 45 • 4