Mistral Nemo
Mistral Mistral
Rank #30 overall · 12 evaluations
6.53
Performance Metrics
Metric Breakdown
Relevance
Semantic Consistency
Style/Tone
Human Likeness
Readability
Factual Accuracy
Ensemble Agreement
Strengths
Relevance: 7.21
Semantic Consistency: 6.92
Areas for Improvement
Ensemble Agreement: 5.50
Factual Accuracy: 5.77
Performance by Domain
Head-to-Head Record
| Opponent | Wins | Losses | Ties | Avg Diff |
|---|---|---|---|---|
| Mistral Large | 0 | 0 | 7 | +0.01 |
| Mistral Small 3.2 | 0 | 0 | 6 | +0.04 |
| Mistral Medium | 0 | 0 | 5 | -0.06 |
| Claude Sonnet 4.5 | 0 | 1 | 2 | -0.09 |
| Jamba Large | 1 | 0 | 2 | +2.47 |