Locas: Your Models are Principled Initializers of Locally-Supported Parametric Memories Paper • 2602.05085 • Published 10 days ago • 4
UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory Paper • 2602.10652 • Published 4 days ago • 2
Table-as-Search: Formulate Long-Horizon Agentic Information Seeking as Table Completion Paper • 2602.06724 • Published 9 days ago • 2
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published Oct 30, 2025 • 29
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30, 2025 • 117
Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines Paper • 2411.16365 • Published Nov 25, 2024 • 1
HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application Paper • 2510.19631 • Published Oct 22, 2025 • 28
DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking Paper • 2510.20168 • Published Oct 23, 2025 • 28
view article Article Generating Human-level Text with Contrastive Search in Transformers 🤗 Nov 8, 2022 • 17
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published Apr 3, 2025 • 58
Constraint Back-translation Improves Complex Instruction Following of Large Language Models Paper • 2410.24175 • Published Oct 31, 2024 • 18
T2I-Eval Collection Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a distilled MLLM. • 6 items • Updated Feb 17, 2025 • 1
hammerllm-1.4b Collection Intermediate checkpoints of hammerllm-1.4b • 20 items • Updated Sep 13, 2024 • 2
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published Nov 12, 2024 • 65
🦋SEALONG Collection Large Language Models Can Self-Improve in Long-context Reasoning • 7 items • Updated Nov 14, 2024 • 7
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22, 2024 • 134
Contrastive Decoding Improves Reasoning in Large Language Models Paper • 2309.09117 • Published Sep 17, 2023 • 39