Details
- Mistral AI has launched OCR 3, its latest optical character recognition system designed for both structured and unstructured documents.
- Company testing shows it offers greater character-level accuracy and lower latency compared to established enterprise solutions and recent AI-based OCR offerings.
- OCR 3 is available now via a REST API and a no-code Document AI playground inside Mistral AI Studio.
- Pricing is published in a blog post, starting around $0.0003 per page, alongside throughput numbers for documents such as invoices, IDs, and multi-language scans.
- The product supports features like table extraction, document layout parsing, and automatic redaction—all accessible through the same API endpoint as Mistral’s language models.
- This marks Mistral’s first standalone perception tool, broadening its scope beyond large language models like Mixtral and positioning the company as a comprehensive document intelligence platform.
Impact
This move intensifies competition with leaders like Google Cloud Vision and AWS Textract by claiming better accuracy at a lower cost. If adopted widely, it could help drive uptake of sovereign, GDPR-compliant AI across Europe and prompt rivals to accelerate their integration of multimodal capabilities.
