Open Source Model Open Source

Granite 4.0 3B Vision

A compact vision-language model designed for enterprise-grade document data extraction, focusing on charts, tables, and key-value pairs.

Price Free / Open Source
Category Open Source Model
Model Open Source
Company International Business Machines Corporation (IBM)
Team Size 284,500

Granite 4.0 3B Vision is a vision-language model (VLM) engineered specifically for enterprise document understanding and reliable information extraction from complex documents, forms, and structured visuals. It excels at tasks such as accurately parsing complex table structures, converting charts into structured machine-readable formats or code, and identifying semantic key-value pairs across diverse document layouts. The model is delivered as a 0.5B parameter LoRA adapter on top of the 3.5B parameter Granite 4.0 Micro base language model, enabling modular deployment for both multimodal and text-only workloads.

5/10

Granite 4.0 3B Vision fills a specific niche for enterprise document extraction with Apache 2.0 licensing, but remains a specialized tool with limited broader market impact. Despite IBM's resources, it holds a niche position in a competitive landscape dominated by larger, more versatile models.

Free Tier
Try Granite 4.0 3B Vision

Visit the official Granite 4.0 3B Vision website