GPT-OSS General (4.2B to 20B) Collection Collection of pruned GPT-OSS models spanning 1-32 experts, maintaining general capabilities across domains while reducing computational requirements. • 29 items • Updated Aug 13, 2025 • 10
GPT-OSS Pruned Experts (4.2B-20B) [IF, Science, Math, etc.] Collection Complete collection of domain-specialized GPT-OSS models (1-32 experts) optimized for science, math, medicine, law, safety, and instruction following. • 8 items • Updated Aug 13, 2025 • 10
Roleplaying Collection Creativity at cost of context & knowledge • 5 items • Updated Dec 13, 2025 • 14
Kimi-Linear-A3B Collection Moonshot's experimental MoE model with Kimi Delta Attention • 3 items • Updated Nov 1, 2025 • 18
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Nov 14, 2025 • 162
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 85
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 27 items • Updated 5 days ago • 136
Quantized Olmo 3 Collection Verified models. All compatible with vLLM for very fast inference. Use the 3.1 models as they are more recent. • 23 items • Updated Dec 15, 2025 • 4
AI PC: Text Generation Collection Text generation LLMs that have been validated to run on the AI PC Intel® Core™ Ultra CPU and iGPU. • 186 items • Updated Aug 28, 2024 • 12
Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc Collection Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. • 371 items • Updated Dec 9, 2025 • 20
OpenVINO NPU Collection Models specifically tested on Intel's NPU with OpenVINO • 16 items • Updated 19 days ago • 2
OpenVINO GPU Collection Models tested on Ultra 7+ AIPC GPUs • 5 items • Updated 19 days ago • 1
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 192