wxzhang
/

dpo-selective-buffer-spo-shift

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

dpo-selective-buffer-spo-shift

14.5 GB

1 contributor

History: 41 commits

wxzhang's picture

Model save

5d84c8e verified about 1 year ago