Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Cornell-AGI
university
Activity Feed
Follow
9
AI & ML interests
Reinforcement Learning from Human Feedback
Team members
1
Cornell-AGI
's models
20
Sort: Recently updated
Cornell-AGI/apo_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
13
Cornell-AGI/ppo_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
16
Cornell-AGI/rebel_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
20
Cornell-AGI/grpo_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
13
Cornell-AGI/grpo_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
16
Cornell-AGI/ppo_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
16
Cornell-AGI/rebel_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
11
Cornell-AGI/apo_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
13
Cornell-AGI/grpo_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 5
•
15
Cornell-AGI/ppo_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 5
•
14
Cornell-AGI/rebel_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 4
•
10
Cornell-AGI/apo_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 4
•
17
•
1
Cornell-AGI/REFUEL-Llama-3-Armo-iter_2
8B
•
Updated
Oct 8, 2024
•
9
Cornell-AGI/REFUEL-Llama-3-Armo-iter_1
8B
•
Updated
Oct 8, 2024
•
10
Cornell-AGI/REBEL-Llama-3-Armo-iter_3
8B
•
Updated
Sep 2, 2024
•
8
•
2
Cornell-AGI/REBEL-Llama-3-Armo-iter_2
8B
•
Updated
Sep 2, 2024
•
15
•
1
Cornell-AGI/REBEL-Llama-3-Armo-iter_1
8B
•
Updated
Sep 2, 2024
•
10
•
1
Cornell-AGI/REBEL-Llama-3-epoch_2
Text Generation
•
Updated
Sep 1, 2024
•
17
•
3
Cornell-AGI/REBEL-Llama-3
Text Generation
•
Updated
Sep 1, 2024
•
23
•
1
Cornell-AGI/REBEL-OpenChat-3.5
Text Generation
•
Updated
Sep 1, 2024
•
19
•
1