Claude Sonnet 4.5 vs GPT-5.2 (Thinking)

80 head-to-head matchups on identical queries

8.40
8 Wins 33 Ties 39 Losses
VS
8.71
39 Wins 33 Ties 8 Losses
Metric-by-Metric Comparison