Claude Sonnet 4 vs GPT-5.2 (Thinking)

5 head-to-head matchups on identical queries

7.05
1 Wins 3 Ties 1 Losses
VS
8.71
1 Wins 3 Ties 1 Losses
Metric-by-Metric Comparison