Beyond Turn Limits: Training Deep Search Agents with Dynamic Context Window Paper • 2510.08276 • Published Oct 9 • 9
RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback Paper • 2507.15024 • Published Jul 20 • 14
ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases Paper • 2306.05301 • Published Jun 8, 2023 • 2
Self-Retrieval: Building an Information Retrieval System with One Large Language Model Paper • 2403.00801 • Published Feb 23, 2024 • 2
Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models Paper • 2305.09144 • Published May 16, 2023
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization Paper • 2410.08815 • Published Oct 11, 2024 • 47
A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models Paper • 2410.13841 • Published Oct 17, 2024 • 16