FrontiersMind/Nandi-Mini-V1.1-600M-Intermediate-Checkpoint-400GT Text Generation β’ 0.6B β’ Updated 13 days ago β’ 263 β’ 8
FrontiersMind/Nandi-Mini-600M-GuardRails Text Generation β’ 0.6B β’ Updated 25 days ago β’ 9.06k β’ 14
FrontiersMind/Nandi-Mini-600M-Early-Checkpoint Text Generation β’ 0.6B β’ Updated 26 days ago β’ 17.3k β’ 104
FrontiersMind/Nandi-Mini-150M-Tool-Calling Text Generation β’ 0.2B β’ Updated 25 days ago β’ 21.9k β’ 52
view article Article How I contributed a new model to the Transformers library using Codex nielsr β’ Mar 30 β’ 52
FrontiersMind/Nandi-Mini-150M-Instruct Text Generation β’ 0.2B β’ Updated 25 days ago β’ 833 β’ 52
Running on CPU Upgrade 246 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 246 Explore synthetic data benchmarks via an interactive bookshelf
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix codelion β’ Nov 3, 2025 β’ 65
Running Featured 1.36k FineWeb: decanting the web for the finest text data at scale π· 1.36k Explore and download the FineWeb webβscale text dataset
Running 3.88k The Ultra-Scale Playbook π 3.88k The ultimate guide to training LLM on large GPU Clusters
The Instruction Gap: LLMs get lost in Following Instruction Paper β’ 2601.03269 β’ Published Dec 19, 2025 β’ 8
Running on CPU Upgrade Featured 3.2k The Smol Training Playbook π 3.2k The secrets to building world-class LLMs
view reply You don't really have to clone the repo. The FastAPI code is just there for demonstration, and you can code the way you like. The main takeaway is the Dockerfile.