Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nicholasKluge
/
RewardModel
like
1
Text Classification
Transformers
PyTorch
Safetensors
nicholasKluge/reward-aira-dataset
English
bert
reward model
alignment
preference model
RLHF
Carbon Emissions
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
RewardModel
1.75 GB
3 contributors
History:
67 commits
nicholasKluge
Update README.md
9144139
verified
7 months ago
.gitattributes
1.48 kB
initial commit
over 2 years ago
LICENSE
10.8 kB
Upload 11 files
over 2 years ago
README.md
8.45 kB
Update README.md
7 months ago
RewardModel_emissions.csv
784 Bytes
Upload 13 files
over 2 years ago
config.json
762 Bytes
Update config.json
about 2 years ago
model.safetensors
433 MB
xet
Adding `safetensors` variant of this model (#2)
over 2 years ago
optimizer.pt
867 MB
xet
Upload 13 files
over 2 years ago
pytorch_model.bin
433 MB
xet
Upload 13 files
over 2 years ago
rng_state.pth
14.6 kB
xet
Upload 13 files
over 2 years ago
scheduler.pt
627 Bytes
xet
Upload 13 files
over 2 years ago
special_tokens_map.json
125 Bytes
Upload 15 files
over 2 years ago
tokenizer.json
669 kB
Upload 13 files
over 2 years ago
tokenizer_config.json
315 Bytes
Upload 15 files
over 2 years ago
trainer_state.json
1.11 kB
Upload 13 files
over 2 years ago
training_args.bin
4.09 kB
xet
Upload 13 files
over 2 years ago
vocab.txt
213 kB
Upload 15 files
over 2 years ago
webgpt_eval.parquet
14.6 MB
xet
Upload webgpt_eval.parquet
over 2 years ago