Belebele
Berkeley Function-Calling Leaderboard| Launched | 2023 |
| Pricing Model | Free |
| Starting Price | |
| Tech used | |
| Tag | Text Analysis |
| Launched | |
| Pricing Model | Free |
| Starting Price | |
| Tech used | Google Analytics,Google Tag Manager,cdnjs,Fastly,Google Fonts,Bootstrap,GitHub Pages,Gzip,Varnish,YouTube |
| Tag | Llm Benchmark Leaderboard,Data Analysis,Data Visualization |
| Global Rank | 0 |
| Country | |
| Month Visit | 0 |
| Global Rank | |
| Country | |
| Month Visit |
Estimated traffic data from Similarweb
LiveBench - LiveBench is an LLM benchmark with monthly new questions from diverse sources and objective answers for accurate scoring, currently featuring 18 tasks in 6 categories and more to come.
ZeroBench - ZeroBench: The ultimate benchmark for multimodal models, testing visual reasoning, accuracy, and computational skills with 100 challenging questions and 334 subquestions.
AI2 WildBench Leaderboard - WildBench is an advanced benchmarking tool that evaluates LLMs on a diverse set of real-world tasks. It's essential for those looking to enhance AI performance and understand model limitations in practical scenarios.
The Pile - Discover the power of The Pile, an 825 GiB open-source language dataset by EleutherAI. Train models with broader generalization abilities.
ModelBench - Launch AI products faster with no-code LLM evaluations. Compare 180+ models, craft prompts, and test confidently.