Modelmultimodalv2.6

MiniCPM-V 2.6

by Tsinghua University (ModelBest) · free · Last verified 2026-03-17

MiniCPM-V 2.6 is a compact yet capable vision-language model from Tsinghua University, designed for deployment on edge devices and mobile platforms while achieving GPT-4V-level performance on several visual benchmarks. It supports high-resolution images and multi-image inputs, making it remarkably capable relative to its small footprint.

https://huggingface.co/openbmb/MiniCPM-V-2_6 ↗

C—Below Average

Adoption: C+Quality: AFreshness: ACitations: FEngagement: F

Specifications

License: Apache 2.0
Pricing: free
Capabilities: vision, visual-question-answering, ocr, image-captioning, multi-image-understanding
Integrations: Hugging Face, Ollama, LM Studio
Use Cases: edge-deployment, mobile-ai, visual-qa, document-analysis, on-device-vision
API Available: Yes
Parameters: 8B
Context Window: 128K
Modalities: text, image
Training Cutoff: 2024
Tags: tsinghua, efficient, vision-language, open-source, edge-deployment
Added: 2026-03-17
Completeness: 80%

Index Score

Adoption

Quality

Freshness

Citations

Engagement

Need help choosing the right model?

Get Expert Guidance

Explore the full AI ecosystem on Agents as a Service