-
Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video
Paper • 2605.15182 • Published • 39 -
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?
Paper • 2605.06527 • Published • 44 -
Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis
Paper • 2605.14392 • Published • 8 -
World Action Models: The Next Frontier in Embodied AI
Paper • 2605.12090 • Published • 67
Collections
Discover the best community collections!
Collections including paper arxiv:2605.12090
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 83 -
Aria Everyday Activities Dataset
Paper • 2402.13349 • Published • 31 -
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Paper • 2403.04132 • Published • 41 -
SaulLM-7B: A pioneering Large Language Model for Law
Paper • 2403.03883 • Published • 92
-
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Paper • 2605.15980 • Published • 36 -
NGRPO: Negative-enhanced Group Relative Policy Optimization
Paper • 2509.18851 • Published • 2 -
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization
Paper • 2605.19436 • Published • 14 -
Delta Attention Residuals
Paper • 2605.18855 • Published • 8
-
WorldVLA: Towards Autoregressive Action World Model
Paper • 2506.21539 • Published • 40 -
LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Paper • 2509.05263 • Published • 11 -
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
Paper • 2510.00406 • Published • 68 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 53
-
Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video
Paper • 2605.15182 • Published • 39 -
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?
Paper • 2605.06527 • Published • 44 -
Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis
Paper • 2605.14392 • Published • 8 -
World Action Models: The Next Frontier in Embodied AI
Paper • 2605.12090 • Published • 67
-
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
Paper • 2605.15980 • Published • 36 -
NGRPO: Negative-enhanced Group Relative Policy Optimization
Paper • 2509.18851 • Published • 2 -
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization
Paper • 2605.19436 • Published • 14 -
Delta Attention Residuals
Paper • 2605.18855 • Published • 8
-
WorldVLA: Towards Autoregressive Action World Model
Paper • 2506.21539 • Published • 40 -
LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Paper • 2509.05263 • Published • 11 -
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators
Paper • 2510.00406 • Published • 68 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 53
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 83 -
Aria Everyday Activities Dataset
Paper • 2402.13349 • Published • 31 -
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Paper • 2403.04132 • Published • 41 -
SaulLM-7B: A pioneering Large Language Model for Law
Paper • 2403.03883 • Published • 92