Jamba Large
AI21 Jamba
Rank #15 overall · 11 evaluations
8.20
Performance Metrics
Metric Breakdown
Relevance
Style/Tone
Semantic Consistency
Human Likeness
Readability
Ensemble Agreement
Factual Accuracy
Strengths
Relevance: 8.82
Style/Tone: 8.64
Areas for Improvement
Factual Accuracy: 6.10
Ensemble Agreement: 7.45
Performance by Domain
Head-to-Head Record
| Opponent | Wins | Losses | Ties | Avg Diff |
|---|---|---|---|---|
| Gemini 3 Flash | 4 | 0 | 1 | +1.10 |
| GPT-4.1 Nano | 0 | 2 | 3 | -0.17 |
| Sonar | 3 | 0 | 2 | +0.34 |
| Mistral Nemo | 0 | 1 | 2 | -2.47 |