GPT-5.2 (Thinking) vs Gemini 2.5 Flash

16 head-to-head matchups on identical queries

8.71
7 Wins 8 Ties 1 Losses
VS
8.67
1 Wins 8 Ties 7 Losses
Metric-by-Metric Comparison