purbeshmitra/semantic-soft-bootstrapping
Text Generation
•
Updated
•
5
•
1
A self-distillation based training method for long context reasoning in a single LLM without reinforcement learning