Mistral OCR 4 targets the enterprise back office

Mistral OCR 4 reads a document like a structured map, not a wall of text. It is cheap, speaks 170 languages, and can run entirely on your own servers. Europe's AI champion is going after the enterprise back office.


Mistral OCR 4 targets the enterprise back office Image by: Mistral

Mistral OCR 4 reads a document like a structured map, not a wall of text. It is cheap, speaks 170 languages, and can run entirely on your own servers. Europe’s AI champion is going after the enterprise back office.

Mistral has a new model, and it is not a chatbot. The French company on 23 June released Mistral OCR 4, a system that turns documents into structured data, it said in a blog post. The model stays small and focused, chasing one huge target: the world’s paperwork.

Optical character recognition has been around for decades. The pitch here is what the model returns. Older systems convert a page into clean text. OCR 4 hands back a map of the page, with each block labelled and located. Independent annotators preferred it to every rival system tried, Mistral said, with an average win rate of 72%.

From page to structured map

OCR 4 does three new things at once. It draws bounding boxes around every element, so software knows exactly where each line sits. It classifies each block by type, marking titles, tables, equations and even signatures. And it adds a confidence score, per page and per word, so a human knows which parts to double-check.

Customers asked for bounding boxes more than any other feature, Mistral said. They let an app point to the exact source of an answer. Paired with block types and confidence scores, they enable citations, redactions and human review. The output also arrives as clean markdown.

The 💜 of EU tech

The latest rumblings from the EU tech scene, a story from our wise ol' founder Boris, and some questionable AI art. It's free, every week, in your inbox. Sign up now!

The shift matters because of what comes next. A chatbot can summarise a contract. An agent has to file it. For that, software needs to tell a signature from a sub-total, and know where each one sits. OCR 4 supplies that scaffolding, where older tools handed back a flat block of words.

It marks a clear break from the last version. OCR 3 focused on turning a page into clean text and tidy tables. OCR 4 returns the whole structure instead. Each block carries a location, a type and a score. Downstream systems then learn not just what a document says, but how it is built.

Built for the back office

OCR 4 targets enterprise drudgery. It feeds retrieval systems, the “RAG” pipelines that let chatbots answer from a company’s own files. It also gives AI agents the structure they need to act, not just read. That means filling forms, processing invoices and running compliance checks.

Its reach runs wide. The model handles PDFs, Word, PowerPoint and OpenDocument files, and reads 170 languages across 10 groups. Mistral says it holds up on low-resource languages where rivals fall away. Early users are digitising archives, turning invoices into fields and pulling clean text from scientific reports.

OCR 4 also plugs into Mistral’s new Search Toolkit, an open-source framework the firm unveiled at its AI Now Summit. The model’s structured output feeds straight into that pipeline. The aim is to hand developers citation-ready inputs, so an answer can point back to the page it came from.

The speed claims form part of the sell. Anaqua, which manages intellectual-property filings, said the model runs about four times faster per page than its previous tool. For high-volume docketing, where deadlines are unforgiving, that pace decides whether a workflow scales.

It slots into Mistral’s push beyond chatbots. The company already sells industrial AI to Airbus, BMW and EDF, and document work is the same enterprise bet by another name.

The sovereignty pitch

The headline feature for European buyers is where the model runs. OCR 4 is small enough to fit in a single container. So a company can host it on its own infrastructure and keep sensitive documents in-house.

That lands on Mistral’s core message. The firm sells itself as Europe’s sovereign alternative to American AI, and self-hosting answers the data-residency worries that come with Europe’s tightening sovereignty rules. For banks, hospitals and governments, keeping the paperwork on home soil is the point.

Cheap, and nearly everywhere

The price looks aggressive. The API costs $4 per 1,000 pages, halving to $2 in batch mode. A higher-level Document AI product, which reshapes output into custom fields, runs $5 per 1,000 pages. One customer, financial-research firm Rogo, claimed similar accuracy to its old provider at roughly eight times lower cost.

Distribution runs broad too. OCR 4 is live through Mistral’s own studio, Amazon SageMaker and Microsoft’s Foundry, with Snowflake support coming. Mistral, now valued near €20bn in fresh funding talks, is making sure its tools sit inside the clouds its customers already use.

Microsoft called the launch a milestone in its partnership with Mistral. That endorsement carries weight. It routes the model toward the enterprise buyers who already sit inside Microsoft’s cloud, and gives Mistral a distribution channel it could never build alone.

The strategy stays consistent. Over the past year, Mistral has wired itself into enterprise software rather than chasing consumer hype. A cheap, self-hostable document reader fits that plan neatly, because it pulls customers into the rest of its stack.

The case for caution

The benchmarks deserve a careful read. Mistral tops the public OlmOCRBench (85.20) and its own multilingual test. But the company calls those scores “directional”. It admits the benchmarks misjudge maths and multi-column text, and that it reproduced every competitor figure itself. The 72% win rate looks firmer, because humans judged real documents.

There are limits on use, too. Mistral is blunt that OCR 4 reads documents, it does not decide on them. It says the model is not for medical diagnosis, legal judgment or high-stakes finance. It extracts the words; a human still makes the call.

The market looks crowded as well. Google, AWS and a wave of startups all sell document AI. Mistral’s edge comes from the combination: structured output, low cost and a version you can run yourself. Whether that wins the back office, against far bigger clouds, remains the open question. For now, Europe’s AI champion has decided the boring documents are worth fighting for.

Get the TNW newsletter

Get the most important tech news in your inbox each week.