CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published Sep 11, 2025 • 28 • 2
R1-RE: Cross-Domain Relationship Extraction with RLVR Paper • 2507.04642 • Published Jul 7, 2025 • 6 • 1
Learning to Reason via Mixture-of-Thought for Logical Reasoning Paper • 2505.15817 • Published May 21, 2025 • 18 • 7
Learning to Reason via Mixture-of-Thought for Logical Reasoning Paper • 2505.15817 • Published May 21, 2025 • 18 • 7
Learning to Reason via Mixture-of-Thought for Logical Reasoning Paper • 2505.15817 • Published May 21, 2025 • 18 • 7