[ICLR'24 Spotlight] Tool-Augmented Reward Modeling
AI & ML interests
Large Language Models
Recent Activity
View all activity
Papers
View all Papers models 12
ernie-research/Themis-7b
Updated • 3 • 4
ernie-research/APPS-Gemma-7B-MA-PPO-Fixed10
9B • Updated • 2
ernie-research/APPS-Gemma-2B-MA-PPO-Fixed10
3B • Updated • 10
ernie-research/HH-RLHF-Gemma-2B-MA-PPO-Fixed5
3B • Updated • 8
ernie-research/HH-RLHF-Gemma-7B-MA-PPO-Fixed5
9B • Updated • 2
ernie-research/TLDR-Gemma-7B-MA-PPO-Fixed5
9B • Updated
ernie-research/TLDR-Gemma-2B-MA-PPO-Fixed5
3B • Updated • 1 • 1
ernie-research/TLDR-Gemma-2-27B-MA-PPO-Fixed5
27B • Updated • 9
ernie-research/ernie-code-560m
Updated • 82 • 10
ernie-research/MonoGPT
Text Generation • 0.4B • Updated • 5 • 2
datasets 7
ernie-research/MEnvData-SWE-Trajectory
Viewer • Updated • 3.92k • 183 • 25
ernie-research/MEnvData-SWE
Preview • Updated • 680 • 3
ernie-research/MEnvBench
Viewer • Updated • 1k • 18 • 2
ernie-research/TARA
Preview • Updated • 13 • 1
ernie-research/GPTDynamics
Preview • Updated • 55 • 1
ernie-research/rendered_xnli
Updated • 8 • 1
ernie-research/rendered_GLUE
Updated • 27 • 1