OsakanaTeishoku/qwen3-4b-structured-output-20260108_cot3_epoch3_T4_merged_DPO Text Generation • 4B • Updated 7 days ago • 16
OsakanaTeishoku/gpt-oss-120b-distill-sarashina2.2-3b-cot-sft-step1000-test-20251006-lora-exp Updated Oct 12, 2025
OsakanaTeishoku/sarashina2.2-3b-instruct-v0.1-grpo-exp-v0.1 Text Generation • 3B • Updated Mar 7, 2025 • 1
OsakanaTeishoku/mixtral_2x300m_wikipython_step1000_0408 Text Generation • 0.5B • Updated Jun 1, 2024 • 6