2025年Belebele與LiveBench對比

Belebele

Learn More | Visit Site

Belebele 資料集的存放庫，這是個大量的多語言閱讀理解資料集。

LiveBench

Learn More | Visit Site

LiveBench 是一個大型語言模型基準測試，每月從不同來源獲得新問題和客觀答案，以進行準確評分。目前包含 6 個類別的 18 個任務，並將陸續增加更多任務。

Belebele

Launched	2023
Pricing Model	Free
Starting Price
Tech used
Tag	Text Analysis

LiveBench

Launched	2024-05
Pricing Model	Free
Starting Price
Tech used	Google Analytics,Google Tag Manager,Fastly,GitHub Pages,Gzip,Progressive Web App,Varnish
Tag	Llm Benchmark Leaderboard

Belebele Rank/Visit

Global Rank	0
Country
Month Visit	0

Top 5 Countries

Traffic Sources

LiveBench Rank/Visit

Global Rank	111818
Country	United States
Month Visit	409857

Top 5 Countries

23.78%

10.9%

4.8%

4.33%

4.32%

United States China United Kingdom Canada Taiwan

Traffic Sources

4.16%

0.56%

0.07%

6.71%

36.53%

51.95%

social paidReferrals mail referrals search direct

Estimated traffic data from Similarweb

What are some alternatives?

When comparing Belebele and LiveBench, you can also consider the following products

ZeroBench - ZeroBench：多模態模型的終極基準測試，透過 100 道具挑戰性的問題和 334 道子問題，檢驗視覺推理、準確性和運算能力。

AI2 WildBench Leaderboard - WildBench 是一個先進的基準測試工具，用於評估 LLM 在各種真實世界任務中的表現。對於那些希望提升 AI 效能並了解模型在實際情境中的局限性的人來說，它是必不可少的工具。

The Pile - 探索 The Pile 的威力，這是 EleutherAI 推出的 825 GiB 開源語言資料集。訓練擁有更廣泛歸納能力的模型。

ModelBench - 運用免程式碼大型語言模型評估，加速您的 AI 產品發佈。比較 180 多個模型、設計提示詞，並自信地進行測試。

More Alternatives

Belebele VS ZeroBench

Belebele VS AI2 WildBench Leaderboard

Belebele VS The Pile

Belebele VS ModelBench

Belebele VS LiveBench

Belebele

LiveBench

Belebele

LiveBench

Belebele Rank/Visit

Top 5 Countries

Traffic Sources

LiveBench Rank/Visit

Top 5 Countries

Traffic Sources

What are some alternatives?