Best Mini-Gemini Alternatives in 2025
-

Discover Gemini, Google's advanced AI model designed to revolutionize AI interactions. With multimodal capabilities, sophisticated reasoning, and advanced coding abilities, Gemini empowers researchers, educators, and developers to uncover knowledge, simplify complex subjects, and generate high-quality code. Explore the potential and possibilities of Gemini as it transforms industries worldwide.
-

Gemma 3: Google's open-source AI for powerful, multimodal apps. Build multilingual solutions easily with flexible, safe models.
-

Enhance vision-language understanding with MiniGPT-4. Generate image descriptions, create websites, identify humor elements, and more! Discover its versatile capabilities.
-

GLM-4.5V: Empower your AI with advanced vision. Generate web code from screenshots, automate GUIs, & analyze documents & video with deep reasoning.
-

Gemma 3 270M: Compact, hyper-efficient AI for specialized tasks. Fine-tune for precise instruction following & low-cost, on-device deployment.
-

Gemma 3n brings powerful multimodal AI to the edge. Run image, audio, video, & text AI on devices with limited memory.
-

With a total of 8B parameters, the model surpasses proprietary models such as GPT-4V-1106, Gemini Pro, Qwen-VL-Max and Claude 3 in overall performance.
-

Gemma 2 offers best-in-class performance, runs at incredible speed across different hardware and easily integrates with other AI tools, with significant safety advancements built in.
-

DeepGemini: Multi-model AI orchestration. Integrate DeepSeek, Claude, OpenAI & more. Flexible workflows, OpenAI API compatible. Open-source!
-

GLM-4-9B is the open-source version of the latest generation of pre-trained models in the GLM-4 series launched by Zhipu AI.
-

EmbeddingGemma: On-device, multilingual text embeddings for privacy-first AI apps. Get best-in-class performance & efficiency, even offline.
-

Yi Visual Language (Yi-VL) model is the open-source, multimodal version of the Yi Large Language Model (LLM) series, enabling content comprehension, recognition, and multi-round conversations about images.
-

Build AI models from scratch! MiniMind offers fast, affordable LLM training on a single GPU. Learn PyTorch & create your own AI.
-

CogVLM and CogAgent are powerful open-source visual language models that excel in image understanding and multi-turn dialogue.
-

MiniCPM is an End-Side LLM developed by ModelBest Inc. and TsinghuaNLP, with only 2.4B parameters excluding embeddings (2.7B in total).
-

MiniCPM3-4B is the 3rd generation of MiniCPM series. The overall performance of MiniCPM3-4B surpasses Phi-3.5-mini-Instruct and GPT-3.5-Turbo-0125, being comparable with many recent 7B~9B models.
-

Gemini CLI: Get AI power right in your terminal. Open-source agent for developers. Enhance coding, research, & automation workflows.
-

Discover MiniAutoGen, the open-source library for Large Language Models. Empower your conversational AI research with lightweight and customizable agents.
-

DeepSeek-VL2, a vision - language model by DeepSeek-AI, processes high - res images, offers fast responses with MLA, and excels in diverse visual tasks like VQA and OCR. Ideal for researchers, developers, and BI analysts.
-

ChatGLM-6B is an open CN&EN model w/ 6.2B paras (optimized for Chinese QA & dialogue for now).
-

Gemini Robotics: Discover adaptable AI robots powered by Gemini. Intelligent & versatile for homes, factories, and beyond. The future is here!
-

BAGEL: Open-source multimodal AI from ByteDance-Seed. Understands, generates, edits images & text. Powerful, flexible, comparable to GPT-4o. Build advanced AI apps.
-

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models.
-

Gemma is a family of lightweight, open models built from the research and technology that Google used to create the Gemini models.
-

Nano Banana: AI image editing & creation with Gemini 2.5 Flash. Achieve precise, text-based transformations & unmatched character consistency, fast.
-

ggml is a tensor library for machine learning to enable large models and high performance on commodity hardware.
-

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
-

CM3leon: A versatile multimodal generative model for text and images. Enhance creativity and create realistic visuals for gaming, social media, and e-commerce.
-

A high-throughput and memory-efficient inference and serving engine for LLMs
-

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
