TitleOS/rlaif_training_fictional_patriot_experiment
Viewer
•
Updated
•
255
•
24
Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.