Mistral Small 3.2
Mistral Mistral
Rank #23 overall · 21 evaluations
7.61
Performance Metrics
Metric Breakdown
Relevance
Style/Tone
Semantic Consistency
Human Likeness
Readability
Ensemble Agreement
Factual Accuracy
Strengths
Relevance: 8.17
Style/Tone: 8.05
Areas for Improvement
Factual Accuracy: 6.54
Ensemble Agreement: 6.62
Performance by Domain
Head-to-Head Record
| Opponent | Wins | Losses | Ties | Avg Diff |
|---|---|---|---|---|
| Jamba Mini | 4 | 0 | 9 | +2.00 |
| Gemini 2.5 Flash | 0 | 0 | 9 | -0.17 |
| GPT-5 Mini | 0 | 2 | 7 | -0.22 |
| Mistral Nemo | 0 | 0 | 6 | -0.04 |
| Mistral Large | 0 | 0 | 6 | -0.02 |
| Mistral Medium | 0 | 1 | 3 | -0.13 |
| Sonar Reasoning Pro | 3 | 0 | 1 | +1.50 |