Jamba Large

AI21 Jamba

Rank #15 overall · 11 evaluations

8.20
Performance Metrics
Metric Breakdown
Relevance
8.82
Style/Tone
8.64
Semantic Consistency
8.55
Human Likeness
8.14
Readability
8.05
Ensemble Agreement
7.45
Factual Accuracy
6.10
Strengths
Relevance: 8.82
Style/Tone: 8.64
Areas for Improvement
Factual Accuracy: 6.10
Ensemble Agreement: 7.45
Performance by Domain
Head-to-Head Record
OpponentWinsLossesTiesAvg Diff
Gemini 3 Flash 4 0 1 +1.10
GPT-4.1 Nano 0 2 3 -0.17
Sonar 3 0 2 +0.34
Mistral Nemo 0 1 2 -2.47