GPT-5.2 vs Grok 4.1 (Reasoning)

16 head-to-head matchups on identical queries

8.71
9 Wins 7 Ties 0 Losses
VS
7.68
0 Wins 7 Ties 9 Losses
Metric-by-Metric Comparison