ThinkMix 1B
Fine-tuned on the use of <think></think> tags for reasoning.
Works best around temperature 0.5-0.6.
- Developed by: theprint
- License: apache-2.0
- Finetuned from model : unsloth/llama-3.2-1b-instruct-unsloth-bnb-4bit
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 511
