TOOL UPDATES

IBM Releases Granite-4.0-3B-Vision Model for Document Data Extraction

M megaone_admin Mar 29, 2026 2 min read
Engine Score 8/10 — Important

This is a new vision model release from IBM, a significant industry player, offering high actionability for developers. While not a revolutionary breakthrough, it represents a solid update to available AI tools.

Editorial illustration for: IBM Releases Granite-4.0-3B-Vision Model for Document Data Extraction

IBM has released Granite-4.0-3B-Vision, a compact vision-language model designed specifically for enterprise document data extraction tasks. The 3-billion parameter model is now available on Hugging Face and focuses on specialized extraction capabilities that smaller models typically struggle with.

The model targets three primary use cases: chart extraction, table extraction, and semantic key-value pair extraction from document images. For chart processing, it can convert visual charts into structured formats including CSV data (Chart2CSV), descriptive summaries (Chart2Summary), and executable code that recreates the chart (Chart2Code).

For table extraction, Granite-4.0-3B-Vision can process complex table layouts and output them in multiple structured formats. The model supports JSON output with detailed cell-level metadata including row and column indices, span information, and content type classification. It also generates HTML tables and OTSL (a specialized table markup format) that uses specific tags like “<fcel>” for filled cells, “<ecel>” for empty cells, and “<lcel>” for merged cells.

The model includes built-in chat templates with task-specific prompts. For CSV extraction, it instructs users that the output should “include a header row with clear column names” and “represent all data series/categories shown in the chart” while using “numeric values that match the chart as closely as possible.” The JSON extraction prompt specifies a detailed schema structure with dimensions, cell properties, and content classification.

IBM positions this as an “enterprise-grade” solution for document processing workflows, though the company has not disclosed training data details, benchmark performance metrics, or availability timeline beyond the current Hugging Face release.

Share

Enjoyed this story?

Get articles like this delivered daily. The Engine Room — free AI intelligence newsletter.

Join 500+ AI professionals · No spam · Unsubscribe anytime

M
MegaOne AI Editorial Team

MegaOne AI monitors 200+ sources daily to identify and score the most important AI developments. Our editorial team reviews 200+ sources with rigorous oversight to deliver accurate, scored coverage of the AI industry. Every story is fact-checked, linked to primary sources, and rated using our six-factor Engine Score methodology.

About Us Editorial Policy