Qwen-VL Max
by Alibaba Cloud · paid · Last verified 2026-03-17
Alibaba Cloud's flagship vision-language model capable of understanding images, documents, charts, and diagrams alongside text. Excels at OCR, visual question answering, and document comprehension tasks across multiple languages.
https://qwenlm.github.io ↗C+
C+—Average
Adoption: BQuality: AFreshness: ACitations: BEngagement: F
Specifications
- License
- Proprietary
- Pricing
- paid
- Capabilities
- image-understanding, document-comprehension, ocr, visual-qa, chart-analysis, text-generation
- Integrations
- alibaba-cloud, dashscope, langchain
- Use Cases
- document-digitization, visual-qa, chart-analysis, content-moderation, accessibility
- API Available
- Yes
- Parameters
- Undisclosed
- Context Window
- 32K tokens
- Modalities
- text, image
- Training Cutoff
- Mid 2024
- Tags
- multimodal, vision-language, proprietary, ocr, document-understanding
- Added
- 2026-03-17
- Completeness
- 95%
Index Score
57.4Adoption
65
Quality
82
Freshness
80
Citations
60
Engagement
0