Claude Sonnet 4.5 vs GPT-4.1

13 head-to-head matchups on identical queries

8.40
1 Wins 6 Ties 6 Losses
VS
8.64
6 Wins 6 Ties 1 Losses
Metric-by-Metric Comparison