Sometimes I finetune models specifically to take on expert roles in a MoE configuration, sometimes I find interesting models others have fine tuned.
Rasmus Rasmussen
theprint
AI & ML interests
Agentic and small language model experiments, data sets and tools.
Recent Activity
liked a dataset about 15 hours ago
nvidia/AceReason-Math liked a dataset about 15 hours ago
meta-math/MetaMathQA updated a model about 20 hours ago
theprint/Llama3.2-1B-ThinkMix-Full-GGUF