GPT-4.1 vs Grok 3 Mini

3 head-to-head matchups on identical queries

8.64
2 Wins 1 Ties 0 Losses
VS
7.76
0 Wins 1 Ties 2 Losses
Metric-by-Metric Comparison