Mistral Nemo

Mistral Mistral

Rank #30 overall · 12 evaluations

6.53
Performance Metrics
Metric Breakdown
Relevance
7.21
Semantic Consistency
6.92
Style/Tone
6.92
Human Likeness
6.63
Readability
6.54
Factual Accuracy
5.77
Ensemble Agreement
5.50
Strengths
Relevance: 7.21
Semantic Consistency: 6.92
Areas for Improvement
Ensemble Agreement: 5.50
Factual Accuracy: 5.77
Performance by Domain
Head-to-Head Record
OpponentWinsLossesTiesAvg Diff
Mistral Large 0 0 7 +0.01
Mistral Small 3.2 0 0 6 +0.04
Mistral Medium 0 0 5 -0.06
Claude Sonnet 4.5 0 1 2 -0.09
Jamba Large 1 0 2 +2.47