Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TitleOS 's Collections
RLAIF Experimentation
Qwen3 Coder Heretic - Decensored
Spark 270M - Micro Local Utility LLM
Lightning 1.7B - Local Utility LLM
HomePhi4 - Home Assistant Reasoning LLM
HomeGem - Home Assistant Conversational LLM
Galactic Reasoning LoRA Adapters
Experiments

RLAIF Experimentation

updated 4 days ago

Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.

Upvote
-

  • TitleOS/rlaif_training_fictional_patriot_experiment

    Viewer • Updated 5 days ago • 255 • 24

  • TitleOS/RLAIF_Patriot_Experiment_LoRA

    Updated 5 days ago • 19

  • TitleOS/RLAIF_Patriot_Experiment_Q8_0-GGUF

    38.4M • Updated 4 days ago • 8

  • TitleOS/RLAIF_Patriot_Experiment_F16-GGUF

    38.4M • Updated 4 days ago • 13
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs