Running on CPU Upgrade Featured 2.85k The Smol Training Playbook ๐ 2.85k The secrets to building world-class LLMs
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand Dec 4, 2025 โข 63
view article Article Tricks from OpenAI gpt-oss YOU ๐ซต can use with transformers +5 Sep 11, 2025 โข 177
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 โข 282
4KAgent: Agentic Any Image to 4K Super-Resolution Paper โข 2507.07105 โข Published Jul 9, 2025 โข 105
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs Paper โข 2507.07996 โข Published Jul 10, 2025 โข 35
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence Paper โข 2505.23747 โข Published May 29, 2025 โข 68 โข 3