Mistral AI

B Tier · 7.5/10

European AI lab with open and commercial models -- Mistral Medium 3.5 SHIPPED 2026-04-29 (128B dense, 256k context, 77.6% SWE-Bench Verified) plus Vibe Remote Agents + Le Chat Work Mode. Earlier 2026 line: Small 4 (Mar 2026 119B MoE Apache 2.0 unified), Medium 3 (Apr 9 2026), Voxtral TTS (Mar 2026 open-source speech)

Last updated: 2026-05-04Free tier available

Score Breakdown

6.0

Ease of Use

8.0

Output Quality

9.0

Value

7.0

Features

Benchmark Scores

Benchmarks for Mistral Medium 3.5 (vendor-published; third-party verification pending)

Benchmark	Description	Score
MMLU	Knowledge across 57 subjects	86%
HumanEval	Python code generation	92%
MATH	Math problem solving	69%
SWE-Bench Verified		77.6%

Last updated: 2026-04-29

Visit Mistral AI

Personality & Tone

The European pragmatist

Tone: Efficient, terse, and slightly blunt. Mistral answers in fewer words than Claude or ChatGPT, especially on factual questions, and rarely hedges or softens its take.

Quirks: Trained with less Anglocentric data than Llama, so it handles French, German, and Spanish notably better than US-origin models. Refusal rates are lower than ChatGPT or Gemini on most gray-area prompts.

The Good and the Bad

What we like

+Mistral Medium 3.5 (April 29 2026) is Mistral's first 'flagship merged' model -- 128B dense, 256k context, 77.6% on SWE-Bench Verified, in public preview at $1.5/$7.5 per million tokens. Closes most of the coding-benchmark gap to Claude Opus / GPT-5.5 at materially lower API cost
+Vibe Remote Agents (also 4/29) lets you launch cloud-based coding sessions that run asynchronously and in parallel via CLI or Le Chat -- file diffs, tool calls, and the ability to teleport a local session to the cloud while preserving history and approval state. Unique in the category as of today
+Le Chat Work Mode (4/29) is the first agentic mode shipped at the consumer-chat tier -- multi-step task completion, cross-tool workflows, research synthesis, inbox triage, with explicit approval gates for sensitive operations
+Mistral Small 4 (March 2026) unifies the previously-split Small/Magistral/Pixtral/Devstral lines into one 119B MoE Apache-2.0 model. Voxtral TTS (March 2026) fills the speech gap with a competent open-source 4B-param model that runs on consumer hardware
+Extremely competitive API pricing remains the moat -- Small 4 at $0.20/1M tokens, Medium 3.5 at $1.5/$7.5 per million tokens, against frontier-class quality

What could be better

−Le Chat web interface is bare-bones compared to ChatGPT or Claude
−Smaller ecosystem -- fewer integrations and community resources
−Less brand recognition means less community help when you get stuck
−Documentation could be better, especially for newer models

Pricing

Le Chat (Free)

✓Web chat interface with Mistral models
✓Mistral Small 4 + Medium 3 available
✓Basic features, limited rate

API (Mistral Small 4)

$0.20/per 1M tokens

✓119B MoE, Apache 2.0 open-weight
✓Unifies Small/Magistral/Pixtral/Devstral into one model
✓Fast, efficient, 128K context

API (Mistral Medium 3.5)

$1.5 / $7.5/per 1M tokens (input/output)

✓Public preview SHIPPED 2026-04-29 -- Mistral's first 'flagship merged' model
✓128B dense, 256k context, 77.6% SWE-Bench Verified
✓Underlies new Vibe Remote Agents + Le Chat Work Mode

API (Mistral Medium 3 -- predecessor)

$1/per 1M tokens

✓Launched April 9, 2026
✓EU AI Act compliance metadata
✓Balanced price/performance, superseded by 3.5 for new workloads

API (Mistral Large 3)

$2/per 1M tokens

✓Flagship sparse MoE
✓256K context
✓MRL license (paid for commercial self-hosting)

Voxtral TTS

✓4B-param open-source speech model, March 2026
✓9 languages, runs on consumer hardware
✓Apache 2.0

System Requirements

Hardware needed to self-host. Min = smallest viable setup (usually heavy quantization). Max = full-precision / production-grade.

Model variant	Min	Max
Mistral Small 3 / Devstral 2 (24B dense, Apache 2.0)	10 GB VRAM (Q4)	1× A100 40 GB FP16
Mistral 14B / 8B / 3B (Apache 2.0)	6 / 4 / 2 GB VRAM (Q4)	24 / 16 / 8 GB VRAM FP16
Mixtral 8x22B (legacy)	64 GB RAM + 24 GB GPU (Q3)	2× A100 80 GB FP16
Mistral Large 3 (flagship)	Not self-hostable under free terms -- MRL license	Requires paid commercial license to self-host

Known Issues

ENTERPRISE PRODUCT (2026-04-28 public preview): Mistral Workflows -- a Temporal-powered durable orchestration engine for AI workloads. Built on the same Temporal core that backs Netflix / Stripe / Salesforce, with Mistral-added streaming, payload handling, multi-tenancy, and observability. Python SDK v3.0, Helm-deployable workers, customer-perimeter data residency. Human-in-the-loop approvals via simple Python (wait_for_input()), full execution tracking in Studio, deploys cloud / on-prem / hybrid. Distinct from Vibe Remote Agents (the consumer-facing async coding sessions); Workflows is the enterprise infra layer that makes them and other AI workloads durable at scale. Live customers cited at preview: ASML, ABANCA, CMA-CGM, France Travail, La Banque Postale, Moeve. Pricing during preview not disclosedSource: Mistral AI blog (mistral.ai/news/workflows) · 2026-04-28
Mistral Medium 3.5 SHIPPED 2026-04-29 in public preview, accompanied by two net-new agentic offerings: Vibe Remote Agents (cloud-based coding sessions, async + parallel, CLI or Le Chat entry) and Le Chat Work Mode (agentic chat for multi-step tasks across tools). The model is 128B dense, 256k context, and posts 77.6% on SWE-Bench Verified. Pricing is $1.5/$7.5 per million tokens (input/output). 'Flagship merged' framing means Medium 3.5 supersedes Medium 3 for new workloads -- existing Medium 3 deployments continue to workSource: Mistral AI blog (mistral.ai/news/vibe-remote-agents-mistral-medium-3-5) · 2026-04-29
Le Chat occasionally slower than competitors during European business hoursSource: Reddit r/MistralAI · 2026-03
Voxtral TTS English output is competent but trails ElevenLabs v3 on expressiveness -- it's positioned as an open-source alternative, not a quality leaderSource: TechCrunch Voxtral coverage · 2026-03

Best for

Developers who want cheap, high-quality API access. Also strong for multilingual applications and European companies that prefer an EU-based AI provider for data residency.

Not for

Non-technical users looking for a polished chat experience. ChatGPT and Claude are much better as consumer products.

Our Verdict

Mistral is the scrappy underdog that keeps surprising people. Their models are impressively efficient -- you get near-GPT-4 quality at a fraction of the API cost. But the consumer experience (Le Chat) is rough. This is primarily a developer's tool. If you're building AI applications on a budget, Mistral should be on your shortlist.

Sources

Mistral AI: Workflows public preview (2026-04-28) (accessed 2026-05-04)
Mistral AI: Vibe Remote Agents + Mistral Medium 3.5 (2026-04-29) (accessed 2026-04-30)
Mistral AI official site (accessed 2026-04-30)
TechCrunch: Mistral releases Voxtral TTS (accessed 2026-04-16)
SiliconANGLE: hardware-efficient language models (accessed 2026-04-16)
LMSYS Chatbot Arena rankings (accessed 2026-04-16)
API testing (accessed 2026-04-16)