Mistral Small 3.2

Mistral Mistral

Rank #23 overall · 21 evaluations

7.61
Performance Metrics
Metric Breakdown
Relevance
8.17
Style/Tone
8.05
Semantic Consistency
7.91
Human Likeness
7.50
Readability
7.40
Ensemble Agreement
6.62
Factual Accuracy
6.54
Strengths
Relevance: 8.17
Style/Tone: 8.05
Areas for Improvement
Factual Accuracy: 6.54
Ensemble Agreement: 6.62
Performance by Domain
Head-to-Head Record
OpponentWinsLossesTiesAvg Diff
Jamba Mini 4 0 9 +2.00
Gemini 2.5 Flash 0 0 9 -0.17
GPT-5 Mini 0 2 7 -0.22
Mistral Nemo 0 0 6 -0.04
Mistral Large 0 0 6 -0.02
Mistral Medium 0 1 3 -0.13
Sonar Reasoning Pro 3 0 1 +1.50