AI & ML interests
None yet
Organizations
None yet
HFXM/Entropy_final_reward_model-FsfairX-LR1e-5-Epoch2
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-FsfairX-LR2e-5-Epoch2
8B
•
Updated
•
5
HFXM/Entropy_final_reward_model-FsfairX-LR2e-5-Epoch1
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-FsfairX-LR1e-5-Epoch1
8B
•
Updated
•
5
HFXM/Entropy_final_reward_model-Llama3_1-8B-LR2e-5-Epoch2
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Skywork-LR1e-5-Epoch2
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Skywork-LR1e-4-Epoch2
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Skywork-LR2e-5-Epoch2
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Llama3_1-8B-LR1e-4-Epoch2
8B
•
Updated
•
6
HFXM/Entropy_final_reward_model-Llama3_1-8B-LR1e-5-Epoch2
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Skywork-LR2e-5-Epoch1
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Skywork-LR1e-4-Epoch1
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Skywork-LR1e-5-Epoch1
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Llama3_1-8B-LR1e-5-Epoch1
8B
•
Updated
•
4
HFXM/Entropy_final_reward_model-Llama3_1-8B-LR2e-5-Epoch1
8B
•
Updated
•
5
HFXM/Entropy_final_reward_model-Llama3_1-8B-LR1e-4-Epoch1
8B
•
Updated
•
5
HFXM/Entropy_final_reward_model
8B
•
Updated
•
5
HFXM/RM_HHRLHF_Rule18_Seed2029
Text Classification
•
8B
•
Updated
•
7
HFXM/RM_HHRLHF_Rule19_Seed2029
Text Classification
•
8B
•
Updated
•
4
HFXM/RM_HHRLHF_Rule19_Seed2026
Text Classification
•
8B
•
Updated
•
6
HFXM/RM_HHRLHF_Rule19_Seed2028
Text Classification
•
8B
•
Updated
•
5
HFXM/RM_HHRLHF_Rule13_Seed2028
Text Classification
•
8B
•
Updated
•
4
HFXM/RM_HHRLHF_Rule16_Seed2027
Text Classification
•
8B
•
Updated
•
6
HFXM/RM_HHRLHF_Rule11_Seed2027
Text Classification
•
8B
•
Updated
•
6
HFXM/RM_HHRLHF_Rule11_Seed2028
Text Classification
•
8B
•
Updated
•
7
HFXM/RM_HHRLHF_Rule17_Seed2026
Text Classification
•
8B
•
Updated
•
3
HFXM/RM_HHRLHF_Rule10_Seed2028
Text Classification
•
8B
•
Updated
•
4
HFXM/RM_HHRLHF_Rule17_Seed2025
Text Classification
•
8B
•
Updated
•
4
HFXM/RM_HHRLHF_Rule10_Seed2026
Text Classification
•
8B
•
Updated
•
4
HFXM/RM_HHRLHF_Rule14_Seed2026
Text Classification
•
8B
•
Updated
•
5