Med-PRM Collection This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards • 7 items • Updated Aug 16, 2025 • 4
CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction Paper • 2508.03159 • Published Aug 5, 2025 • 22
jhyun0414/20250717_Llama-3.1-8B-Instruct_norag_e1_lr2e-6_after0_conv_bce_stateless Text Generation • 8B • Updated Jul 17, 2025
jhyun0414/20250717_Llama-3.1-8B-Instruct_norag_e1_lr2e-6_after0_conv_bce_stateless Text Generation • 8B • Updated Jul 17, 2025
jhyun0414/20250711-Llama-3.1-8B-Instruct-gemini_label-norag-lr5e-05-e1 Text Generation • 8B • Updated Jul 10, 2025 • 4
jhyun0414/20250711-Llama-3.1-8B-Instruct-gemini_label-norag-lr2e-06-e1 Text Generation • 8B • Updated Jul 10, 2025 • 1
jhyun0414/20250711-Llama-3.1-8B-Instruct-gemini_label-norag-lr5e-05-e1 Text Generation • 8B • Updated Jul 10, 2025 • 4
jhyun0414/20250711-Llama-3.1-8B-Instruct-gemini_label-norag-lr2e-06-e1 Text Generation • 8B • Updated Jul 10, 2025 • 1
jhyun0414/20250710-Llama-3.1-8B-Instruct-gemini_label-norag-lr2e-06-e1 Text Generation • 8B • Updated Jul 10, 2025 • 4
jhyun0414/20250710-Llama-3.1-8B-Instruct-gemini_label-norag-lr2e-06-e1 Text Generation • 8B • Updated Jul 10, 2025 • 4
jhyun0414/20250630-Qwen3-4B-gemini_label-filter-rag-e3 Text Generation • 4B • Updated Jun 30, 2025 • 1
jhyun0414/20250630-Qwen3-4B-gemini_label-filter-rag-e3 Text Generation • 4B • Updated Jun 30, 2025 • 1
jhyun0414/Qwen3-1.7B-gemini_label-filter_yes-ep3-20250627_205805-RAG_yes 2B • Updated Jun 29, 2025 • 1
jhyun0414/Qwen3-1.7B-gemini_label-filter_yes-ep3-20250627_205805-RAG_yes 2B • Updated Jun 29, 2025 • 1
jhyun0414/20250629-Llama-3.1-8B-Instruct-gemini_score-filter-rag-e3 Text Generation • 8B • Updated Jun 29, 2025