Best MiniCPM-2B Alternatives in 2025
-

With a total of 8B parameters, the model surpasses proprietary models such as GPT-4V-1106, Gemini Pro, Qwen-VL-Max and Claude 3 in overall performance.
-

MiniCPM3-4B is the 3rd generation of MiniCPM series. The overall performance of MiniCPM3-4B surpasses Phi-3.5-mini-Instruct and GPT-3.5-Turbo-0125, being comparable with many recent 7B~9B models.
-

SmolLM is a series of state-of-the-art small language models available in three sizes: 135M, 360M, and 1.7B parameters.
-

Build AI models from scratch! MiniMind offers fast, affordable LLM training on a single GPU. Learn PyTorch & create your own AI.
-

OpenBMB: Building a large-scale pre-trained language model center and tools to accelerate training, tuning, and inference of big models with over 10 billion parameters. Join our open-source community and bring big models to everyone.
-

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
-

PolyLM, a revolutionary polyglot LLM, supports 18 languages, excels in tasks, and is open-source. Ideal for devs, researchers, and businesses for multilingual needs.
-

JetMoE-8B is trained with less than $ 0.1 million1 cost but outperforms LLaMA2-7B from Meta AI, who has multi-billion-dollar training resources. LLM training can be much cheaper than people generally thought.
-

Yuan2.0-M32 is a Mixture-of-Experts (MoE) language model with 32 experts, of which 2 are active.
-

MiniMax-M1: Open-weight AI model with 1M token context & deep reasoning. Process massive data efficiently for advanced AI applications.
-

Explore InternLM2, an AI tool with open-sourced models! Excel in long-context tasks, reasoning, math, code interpretation, and creative writing. Discover its versatile applications and strong tool utilization capabilities for research, application development, and chat interactions. Upgrade your AI landscape with InternLM2.
-

Discover PaLM 2, Google's advanced language model for reasoning, translation, and coding tasks. Built with responsible AI practices, PaLM 2 excels in multilingual collaboration and specialized code generation.
-

OLMo 2 32B: Open-source LLM rivals GPT-3.5! Free code, data & weights. Research, customize, & build smarter AI.
-

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
-

Qwen2.5 series language models offer enhanced capabilities with larger datasets, more knowledge, better coding and math skills, and closer alignment to human preferences. Open-source and available via API.
-

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
-

iconicon嘻哈歌手arrow56/5000iconMiniMax is the latest generation of large-scale Chinese language models, and its main goal is to help humans write efficiently, stimulate creativity, acquire knowledge, and make decisions.
-

Discover StableLM, an open-source language model by Stability AI. Generate high-performing text and code on personal devices with small and efficient models. Transparent, accessible, and supportive AI technology for developers and researchers.
-

A high-throughput and memory-efficient inference and serving engine for LLMs
-

CentML streamlines LLM deployment, reduces costs up to 65%, and ensures peak performance. Ideal for enterprises and startups. Try it now!
-

Mini-Gemini supports a series of dense and MoE Large Language Models (LLMs) from 2B to 34B with image understanding, reasoning, and generation simultaneously. We build this repo based on LLaVA.
-

A Trailblazing Language Model Family for Advanced AI Applications. Explore efficient, open-source models with layer-wise scaling for enhanced accuracy.
-

Mistral Small 3 ( 2501 ) sets a new benchmark in the "small" Large Language Models category below 70B, boasting 24B parameters and achieving state-of-the-art capabilities comparable to larger models!
-

OpenBioLLM-8B is an advanced open source language model designed specifically for the biomedical domain.
-

nanochat: Master the LLM stack. Build & deploy full-stack LLMs on a single node with ~1000 lines of hackable code, affordably. For developers.
-

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models.
-

VoxCPM: Realistic, tokenizer-free AI Text-to-Speech. Get context-aware speech generation & true-to-life voice cloning for natural audio.
-

Phi-2 is an ideal model for researchers to explore different areas such as mechanistic interpretability, safety improvements, and fine-tuning experiments.
-

StableLM Zephyr 3B is a new chat model that represents the latest addition to the StableLM series of lightweight Large Language Models (LLMs) from Stability AI.
-

ChatGLM-6B is an open CN&EN model w/ 6.2B paras (optimized for Chinese QA & dialogue for now).
