Claude Sonnet 4.5 (Thinking) vs GPT-5.2 (Thinking)

9 head-to-head matchups on identical queries

7.60
0 Wins 5 Ties 4 Losses
VS
8.71
4 Wins 5 Ties 0 Losses
Metric-by-Metric Comparison