Learning from examples - training/inference - a shoaibmohd Collection

shoaibmohd 's Collections

Self Supervision

Memory

NBA/Recommenders

Computer Use Agent

Learning from examples - training/inference

OCR

Data Analysis Papers

Learning from examples - training/inference

updated Nov 22, 2025

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 80
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning

Paper • 2510.01132 • Published Oct 1, 2025 • 5
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 127
MixReasoning: Switching Modes to Think

Paper • 2510.06052 • Published Oct 7, 2025 • 21
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks

Paper • 2510.08002 • Published Oct 9, 2025 • 23
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8, 2025 • 48
Dr.LLM: Dynamic Layer Routing in LLMs

Paper • 2510.12773 • Published Oct 14, 2025 • 31
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 108
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Paper • 2511.14460 • Published Nov 18, 2025 • 20