GPT-5 vs Grok 4 (Reasoning)

54 head-to-head matchups on identical queries

8.83
32 Wins 21 Ties 1 Losses
VS
8.08
1 Wins 21 Ties 32 Losses
Metric-by-Metric Comparison