Mistral Medium 3.5 logo

Mistral Medium 3.5

Mistral Medium 3.5 is a dense 128B parameter large language model with a 256K context window that combines instruction-following, reasoning, and coding in one unified model. It features multimodal vision input, configurable reasoning effort, and scores 77.6% on SWE-Bench Verified.

Mistral Medium 3.5 screenshot

Mistral Medium 3.5 is a Large Language Models (LLMs) tool. Mistral Medium 3.5 is a dense 128B parameter large language model with a 256K context window that combines instruction-following, reasoning, and coding in one unified model. It features multimodal vision input, configurable reasoning effort, and scores 77.6% on SWE-Bench Verified. Best for software developers and engineers, data scientists and analysts and scientists and researchers.

6 key features6+ alternatives →

About Mistral Medium 3.5

Open-weight 128B model for coding, reasoning, and instruction-following with vision

Key Features

**Unified Model Architecture.** Combines instruction-following, reasoning, and coding capabilities in a single 128B dense model, replacing three separate specialized models with configurable reasoning effort per request.
**Large Context Window.** Supports 256K tokens of context, allowing you to process entire codebases, long documents, and complex multi-turn conversations without losing information.
**Multimodal Vision Input.** Includes a vision encoder trained from scratch that handles variable image sizes and aspect ratios for document analysis, diagram understanding, and visual question answering.
**Strong Coding Performance.** Scores 77.6% on SWE-Bench Verified and 91.4% on τ³-Telecom agentic benchmarks, making it suitable for complex coding tasks and autonomous agent workflows.
**Open Weights for Self-Hosting.** Released under a modified MIT license with weights available on Hugging Face, runs on as few as 4 GPUs at Q4 quantization for self-hosted deployments.
**Remote Cloud Agents.** Powers Mistral Vibe CLI with asynchronous cloud execution, parallel coding sessions, and GitHub pull request integration for hands-off development workflows.

Frequently Asked Questions

Mistral Medium 3.5 is a 128 billion parameter large language model released by Mistral AI in April 2026. It's a dense transformer model with a 256K context window that combines instruction-following, reasoning, and coding in one unified architecture. The model is released as open weights under a modified MIT license and can be self-hosted or accessed via API.

Mistral Medium 3.5 costs $1.50 per million input tokens and $7.50 per million output tokens through the Mistral API. This is half the input cost of Claude Sonnet 4.6 and 40% cheaper than GPT-4o. For self-hosting, the open weights are free to download under a modified MIT license, though high-revenue enterprises may need a commercial agreement.

Yes, Mistral Medium 3.5 includes multimodal vision capabilities. It has a vision encoder trained from scratch that accepts both text and image inputs with text output. The encoder handles variable image sizes and aspect ratios, making it suitable for document parsing, diagram understanding, screenshot analysis, and visual question answering tasks.

Mistral Medium 3.5 scores 77.6% on SWE-Bench Verified, which is close to Claude Sonnet 4.6 at 79.6% but ahead of smaller models like Qwen 3.6 at 72.4%. It's designed as a unified model that handles coding, reasoning, and general tasks together, with configurable reasoning effort. The model also powers Mistral Vibe CLI for autonomous coding agents that can open pull requests.

User Reviews

Similar Tools

View all →