-
-
-
-
-
-
Inference Providers
Active filters:
reward
li-jay-cs/test2-rlhf-rm-checkpoint
li-jay-cs/gpt2-medium-rlhf-rm-checkpoint
li-jay-cs/test3-rlhf-rm-checkpoint
li-jay-cs/gpt2-rlhf-rm-checkpoint
li-jay-cs/gpt2-training-full-rlhf-rm-checkpoint
Updated
•
12
li-jay-cs/gpt2-last_token_reward_and_full_training-rlhf-rm-checkpoint
li-jay-cs/1gpu-gpt2-myepoch1-gcp-reward-model
Text Classification
•
0.3B
•
Updated
•
13
ZhangNy/2024-11-18_10-58-28
0.2B
•
Updated
•
7
8B
•
Updated
•
377
•
6
33B
•
Updated
•
22
•
8
eth-nlped/Qwen2.5-1.5B-pedagogical-rewardmodel
Text Classification
•
2B
•
Updated
•
43
•
3
NiuTrans/GRAM-Qwen3-1.7B-RewardModel
2B
•
Updated
•
8
•
6
NiuTrans/GRAM-Qwen3-14B-RewardModel
15B
•
Updated
•
9
•
3
NiuTrans/GRAM-LLaMA3.2-3B-RewardModel
3B
•
Updated
•
10
•
3
NiuTrans/GRAM-Qwen3-4B-RewardModel
4B
•
Updated
•
14
•
2
NiuTrans/GRAM-Qwen3-8B-RewardModel
8B
•
Updated
•
9
•
4
prithivMLmods/GRAM-LLaMA3.2-3B-RewardModel-GGUF
Text Ranking
•
3B
•
Updated
•
126
prithivMLmods/GRAM-Qwen3-4B-RewardModel-GGUF
Text Ranking
•
4B
•
Updated
•
47
mradermacher/GRAM-LLaMA3.2-3B-RewardModel-GGUF
3B
•
Updated
•
106
mradermacher/GRAM-LLaMA3.2-3B-RewardModel-i1-GGUF
3B
•
Updated
•
1.63k
TIGER-Lab/EditReward-MiMo-VL-7B-SFT-2508
Image-to-Text
•
Updated
•
256
•
1
TIGER-Lab/EditReward-Qwen2.5-VL-7B
Image-Text-to-Text
•
Updated
•
43
•
3