Yofuria/UltraFeedback-binarized-ms-swift-hard-1024-v5-Llama-3.2-3B Viewer • Updated 3 days ago • 31.8k • 16
Yofuria/UltraFeedback-binarized-ms-swift-hard-1024-v5-Llama-3.2-3B Viewer • Updated 3 days ago • 31.8k • 16
Yofuria/UltraFeedback-binarized-ms-swift-hard-1024-v5-Qwen2.5-v2 Viewer • Updated 7 days ago • 18.8k • 20
Yofuria/UltraFeedback-binarized-ms-swift-hard-1024-v5-Qwen2.5-v2 Viewer • Updated 7 days ago • 18.8k • 20
\$OneMillion-Bench: How Far are Language Agents from Human Experts? Paper • 2603.07980 • Published Mar 9 • 27