SAGE: Scalable Agentic 3D Scene Generation for Embodied AI Paper • 2602.10116 • Published Feb 10 • 13
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 23 items • Updated about 11 hours ago • 112
Semi-Supervised Reward Modeling via Iterative Self-Training Paper • 2409.06903 • Published Sep 10, 2024 • 1
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Paper • 2601.02439 • Published Jan 5 • 18
MergeBench Collection A Benchmark for Merging Domain-Specialized LLMs • 2 items • Updated Oct 21, 2025 • 1
MergeBench: A Benchmark for Merging Domain-Specialized LLMs Paper • 2505.10833 • Published May 16, 2025 • 2
O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL Paper • 2601.03743 • Published Jan 7 • 3
Draft-Conditioned Constrained Decoding for Structured Generation in LLMs Paper • 2603.03305 • Published Feb 8 • 1
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation Paper • 2603.25804 • Published 5 days ago • 21
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing Paper • 2602.02437 • Published Feb 2 • 80
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Paper • 2602.12205 • Published Feb 12 • 81
PRIMO R1 Collection Official release of PRIMO R1, a 7B video MLLM for robotic process reasoning featuring RL-optimized models, SFT/RL datasets, and cross-domain benchmark • 3 items • Updated 14 days ago • 4
From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation Paper • 2603.15600 • Published 15 days ago • 7
GenMask: Adapting DiT for Segmentation via Direct Mask Paper • 2603.23906 • Published 7 days ago • 6
Clipping-Free Policy Optimization for Large Language Models Paper • 2601.22801 • Published Jan 30 • 3
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers Paper • 2602.18292 • Published Feb 20 • 11
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus Paper • 2603.20105 • Published 11 days ago • 36