Running 108 Open Japanese LLM Leaderboard 🌸 108 Explore and compare LLM models with interactive filters and visualizations
Running 37 Polish Information Retrieval Benchmark (PIRB) 📈 37 View evaluation results on a leaderboard