Launched | 2023 |
Pricing Model | Free |
Starting Price | |
Tech used | |
Tag |
Launched | |
Pricing Model | Free |
Starting Price | |
Tech used | |
Tag |
Global Rank | 0 |
Country | |
Month Visit | 0 |
Global Rank | |
Country | |
Month Visit |
TruthfulQA - Measure language model truthfulness with TruthfulQA, a benchmark of 817 questions across 38 categories. Avoid false answers based on misconceptions.
MMStar - MMStar, a benchmark test set for evaluating large-scale multimodal capabilities of visual language models. Discover potential issues in your model's performance and evaluate its multimodal abilities across multiple tasks with MMStar. Try it now!
Lebesgue - Supercharge your marketing strategies with Lebesgue, the AI tool that analyzes data, provides recommendations, and offers benchmarking and competitive analysis. Start your free trial now!
BenchLLM by V7 - BenchLLM: Evaluate LLM responses, build test suites, automate evaluations. Enhance AI-driven systems with comprehensive performance assessments.