mradermacher/inframind-0.5b-dapo-GGUF Reinforcement Learning • 0.5B • Updated about 1 month ago • 210
mradermacher/inframind-0.5b-grpo-GGUF Reinforcement Learning • 0.5B • Updated about 1 month ago • 220