SpaceXAI Cursor Jumps up the rankings
Two big takeaways.
Composer 2.5 is way better than most people think.
63.2% score at $0.55 per task. Nearly matching Opus 4.7 Max and GPT 5.5 Extra High at 20x less cost. This is insane value.
Gemini 3.5 Flash is #10 at 49.8%.
Below GPT 5.5 Low.
Below Opus 4.7 Low.
Google's newest model can't even beat budget tier competition.
Composer 2.5 is the sleeper.
Gemini 3.5 Flash is the disappointment.