Mistral Small 4 is a Large Language Models (LLMs) tool. Hybrid AI model combining reasoning, coding, and vision in one open-source system. Best for software developers and engineers, data scientists and analysts and scientists and researchers.
About Mistral Small 4
Key Features
Frequently Asked Questions
Mistral Small 4 unifies reasoning, coding, and vision capabilities in a single model instead of requiring separate specialized models. It also features configurable reasoning depth per request, letting you choose between fast responses and deep reasoning without switching models.
Yes, Mistral Small 4 is released under Apache 2.0 license with full commercial use rights. You can self-host it, but you'll need significant GPU infrastructure—minimum 4x NVIDIA H100 or 2x H200 GPUs. The full weights are available on Hugging Face.
Through the Mistral API, Mistral Small 4 costs $0.15 per million input tokens and $0.60 per million output tokens. If you self-host under the Apache 2.0 license, there are no licensing fees, but you'll need to cover your own infrastructure costs.
Mistral Small 4 supports a 256K token context window, which is roughly 190,000 words of usable context. This is large enough to handle full codebase analysis, lengthy legal documents, and multi-session conversations without aggressive chunking.





