GeorgiaTech
/

0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3

16.1 GB

1 contributor

History: 4 commits

ZhangShenao's picture

End of training

fa65ef7 verified over 1 year ago